Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the types of tables in Hive?
What is the use of context object?
How many instances of a jobtracker run on hadoop cluster?
Explain first() operation in Spark?
how Cassandra writes data?
On what basis data will be stored on a rack?
What are benefits of DataFrame in Spark?
Describe join() operation. How is outer join supported?
What is fluming?
What happens to a namenode, when job tracker is down?
Mention Hive default read and write classes?
What are sink processors?
How does cassandra perform read operation? Explain
Explain what are the tools used in Big Data?
What does hadoop-env.sh do?