Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What are the main methods of data transferring in hadoop sqoop?
Agent communicate with other Agents?
Compare Hadoop and RDBMS?
Name different types of NoSQL database?
How is the processing of streaming data achieved in Apache Spark? Explain.
If reducers do not start before all mappers finish then why does the progress on mapreduce job shows something like map(50%) reduce(10%)? Why reducers progress percentage is displayed when mapper is not finished yet?
What is a column family in Cassandra?
What is serialization in spark?
What are the 2 types of table in hive?
Explain the overview of hadoop history breifly?
Why we cannot do aggregation (addition) in a mapper? Why we require reducer for that?
How does hdfs get a good throughput?
Mention the difference between hbase and relational database?
What are the different types of Znodes?
What is the InputFormat ?