Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is the local repository and where it is useful while using ambari environment?
What is Implicit Type conversion in Hive?
How does HDFS ensure Data Integrity of data blocks stored in HDFS?
What is the problem with the small file in Hadoop?
What is the function of mapreducer partitioner?
What do you know by storage and compute node?
What are ‘maps’ and ‘reduces’?
What is a checkpoint?
Why Mapper runs in heavy weight process and not in a thread in MapReduce?
Why HDFS stores data using commodity hardware despite the higher chance of failures?
What is difference between flume and sqoop?
Mention what are the main components of cassandra data model?
Why we use intwritable instead of int? Why we use longwritable instead of long?
Define fold() operation in Apache Spark?
How namenode handles data node failures?