Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is Hadoop serialization?
Does 'ILLUSTRATE' run a MapReduce job?
How often do you need to reformat the namenode?
What are distinct operators in impala?
When Namenode is down what happens to job tracker?
Which are the two types of 'writes' in HDFS?
State the usage of 'filters', 'group' , 'orderBy', 'distinct' keywords in pig scripts?
shouldn't DFS be able to handle large volumes of data already?
What is the need for Spark DAG?
What does /etc /init.d do?
Explain the process for starting a kafka server?
Use of import command in hadoop sqoop?
What is a partition in spark?
What is difference between hive and hdfs?
When to choose "External Table" in Hive?