Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is a reliable and unreliable receiver in Spark?
What is the relation between job and task in hadoop?
Apache Flume support third-party plugins also?
Compare apache pig and sql?
Which one will you decide for an undertaking – Hadoop MapReduce or Apache Spark?
How to format the HDFS? How frequently it will be done?
What is the use of expand cqlsh command in Cassandra?
Explain the benefits of block transfer?
When is it not recommended to use MapReduce paradigm for large scale data processing?
Explain the key benefits of using storm for real time processing?
Which technique can you use in hbase to access hfile directly without the help of hbase?
What are combiners? When should I use a combiner in my MapReduce Job?
What is data pipeline in spark?
What are the key features of Apache Spark that you like?
What is identity mapper and identity reducer?