Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the prerequisite for Apache Hive installation?
Is spark part of hadoop ecosystem?
how is a file of the size 1 GB uncompressed
What is HBase?
Explain pipe() operation. How it writes the result to the standard output?
Explain different transformation on DStream?
Map reduce jobs take too long. What can be done to improve the performance of the cluster?
What is the advantage of hadoop over java serialization?
Explain HCatReader?
Define sparksession in apache spark? Why is it needed?
Can you modify the file present in hdfs?
Define replication factor?
What is shuffle spill in spark?
How to start zookeeper server?
What is BloomMapFile used for?