Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is a flume agent?
What are the data formats supported by apache tajo?
What do we mean by Paraquet?
What are all stats classes in the java api package available?
What are the various configuration parameters required to run a mapreduce job?
How NameNode tackle Datanode failures in Hadoop?
What is apache spark for beginners?
Is it necessary to install spark on all the nodes of a YARN cluster while running Apache Spark on YARN ?
Can you join multiple fields in Apache
What is difference between coalesce and repartition?
What problems can be addressed by using Zookeeper?
Are results returned as they become available, or all at once when a query completes?
Name different types of the data model?
Explain the difference between NameNode
Mention what is the difference between Hbase and Hive?