Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
what are relational operations in pig latin?
What are the complex datatypes in pig?
What is Data Log in Kafka?
What are best features of Apache Avro?
Is it possible to provide multiple input to Hadoop? If yes then how?
When executing Hive queries in different directories, why is metastore_db created in all places from where Hive is launched?
What is hadoop, hbase, hive and cassandra? Specify similarities and differences among them.
What are the difference between of the “HDFS Block” and “Input Split”?
Explain Spark countByKey() operation?
What is Sparse Vector?
Explain lineage graph
Does 'ILLUSTRATE' run a MapReduce job?
What is compaction in hbase?
List the various types of "Cluster Managers" in Spark.
What is a worker node in Apache Spark?