Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain Catalyst framework?
Which language is better for spark?
What is memtable?
Does Apache Sqoop have a default database?
What is broadcast variable?
Where is spark rdd?
What is difference between hive and hdfs?
What are the different functions available in pig latin language?
What is a "map" in Hadoop?
What is a “Distributed Cache” in Apache Hadoop?
Explain about the smb join in hive?
List down the languages supported by Apache Spark?
What is the difference between RDBMS with Hadoop MapReduce?
How can we import data from particular row or column? What is the destination types allowed in Sqoop import command?
What is the problem in having lots of small files in hdfs?