Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Does mapreduce programming model provide a way for reducers to communicate with each other? In a mapreduce job can a reducer communicate with another reducer?
679How would you tackle calculating the number of unique visitors for each hour by mining a huge apache log? You can use post processing on the output of the mapreduce job.
826If reducers do not start before all mappers finish then why does the progress on mapreduce job shows something like map(50%) reduce(10%)? Why reducers progress percentage is displayed when mapper is not finished yet?
712
Can you define rdd lineage?
Explain HCatReader?
If no custom partitioner is defined in Hadoop then how is data partitioned before it is sent to the reducer?
Define HDFS and talk about their respective components?
What is fluming?
What is sqoop in Hadoop ?
Explain avrostorage function?
Differentiate between static and dynamic cql tables.
What is the sequencefileinputformat in hadoop?
What is the Use of Cassandra Database ?
Did edmond berger invent the spark plug?
How can one copy a file into HDFS with a different block size to that of existing block size configuration?
Define a daemon?
What is the full form of fsck?
Can you tell us how many daemon processes run on a hadoop system?