Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What alternate way does HDFS provides to recover data in case a Namenode, without backup, fails and cannot be recovered?
35If I create a folder in HDFS, will there be metadata created corresponding to the folder? If yes, what will be the size of metadata created for a directory?
38What is a block in Hadoop HDFS? What should be the block size to get optimum performance from the Hadoop cluster?
36
How do we create rdds in spark?
How do I start flume agent?
Input Split & Record Reader and what they do?
Why is block size set to 128 MB in HDFS?
Where is hadoop-env.sh file present?
How do you organize the pig latin statements?
Is spark faster than hadoop?
Explain apache kafka?
What is ZooKeeper Atomic Broadcast (ZAB) protocol?
Define taskinstance?
Name the two types of shared variable available in Apache Spark?
What is zookeeper in hadoop?
Explain how you can improve the throughput of a remote consumer?
What are core components of Flume?
How does hdfs get a good throughput?