Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Which storage level does the cache () function use?
What is a generic udf in hive?
Can we run spark without hadoop?
Explain what is heartbeat in hdfs?
Explain what happens if, during the PUT operation, HDFS block is assigned a replication factor 1 instead of the default value 3?
When is it not recommended to use MapReduce paradigm for large
what is JobTracker in Hadoop? What are the actions followed by Hadoop?
What is heartbeat in hadoop?
What is struct and explain its purpose?
What is Hive Data Definition language?
What is the Internal Architecture of the Cassandra Database ?
What is the non dfs used?
What are the limitations of Apache Spark?
How do I optimize my spark code?
Where the mapper's intermediate data will be stored?