Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What does illustrate do in Apache Pig?
What are the most commonly defined input formats in Hadoop?
What are the DDL commands used in hbase?
What will you do when NameNode is down?
How can you transfer data from hive to hdfs?
What is aws spark?
Is hadoop a memory?
What are the applications of Apache ZooKeeper?
when to choose “internal table” and “external table” in hive?
What is the role of the offset.
How to overwrite an existing output file/dir during execution of Hadoop MapReduce jobs?
How is HCatalog different from Hive?
What are the various data sources available in SparkSQL?
How hbase uses zookeeper?
What is pre-requisites for contributing to apache mahout ?