Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What combiners are and when you should use a combiner in a mapreduce job?
What is the procedure to create users in HDFS and how to allocate Quota to them?
What are the purposes of using Ambari shell?
What do you understand by the term snitch in cassandra? Give some example.
What happens to job tracker when namenode is down?
Explain what is jobtracker in hadoop? What are the actions followed by hadoop?
What are partitions and tokens in cassandra?
Is spark part of hadoop ecosystem?
How do you set up a spark?
Specify the different methods of hive?
What is the port number for NameNode
What are the different ways of representing data in Spark?
Define Partitions?
Why we use BloomMapFile?
How does NameNode tackle DataNode failures?