Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) If no custom partitioner is defined in Hadoop then how is data partitioned before it is sent to the reducer?
1 515
Explain the terms memtable, commitlog and sstables.
How do I know if flume agent is running?
How does groupbykey work in spark?
What are the important features of hadoop?
How does a namenode handle the failure of the data nodes?
What is spark shuffle?
What is safe mode in Hadoop?
Are there any problems which can only be solved by MapReduce and cannot be solved by PIG? In which kind of scenarios MR jobs will be more useful than PIG?
Why are the number of splits equal to the number of maps?
What is single node cluster in Hadoop? for what all purposes Hadoop run on a single node cluster?
Is spark good for machine learning?
Difference Between Apache Sqoop vs Flume?
Explain about the smb join in hive?
How can Flume be used with HBase?
Explain what are the tools used in Big Data?