Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is hadoop pig?
What do you mean by meta information in hdfs?
What is check pointing in hadoop?
What are Pig Execution modes?
How can you overwrite the replication factors in HDFS?
what does the shell commands “Capture” and “Consistency” determines?
Why scala is used in spark?
Can we change the data type of a column in a hive table?
On what basis data will be stored on a rack?
Is apache flume real time processing framework?
What are the different types of tables available in Hive?
Explain what is “map” and what is "reducer" in hadoop?
Explain the shuffle?
What are the benefits of apache kafka over the traditional technique?
How to add the partition in existing table without the partition table?