Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What all tasks you can perform for managing host using Ambari host tab?
What is a rack awareness algorithm and why is it used in hadoop?
When we are using queries instead of scripting?
What is the difference between python and spark?
What is Text Input Format?
Can you explain bloommapfile.
Explain how is hadoop different from other data processing tools?
What database does spark use?
Can you explain broadcast variables?
What is spark and what is its purpose?
What do you understand by column family?
How do you write your own custom SerDe ?
What is data skew and how do you fix it?
is HQL case sensitive?
How do Hadoop MapReduce works?