Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Is it necessary to start Hadoop to run any Apache Spark Application ?
Replication causes data redundancy then why is is pursued in HDFS?
What is apache mahout?
What do you know about Partition in Kafka?
Does the archiving of hive tables give any space saving in hdfs?
Explain how cassandra writes data?
What do you understand by Commit log in Cassandra?
What is heap memory in spark?
Is it possible to rename the output file, and if so, how?
Name some features of Apache Cassandra?
What does ambari shell can provide?
Explain the input type/format in mapreduce by default?
What is lineage graph?
What is a Task instance in Hadoop? Where does it run?1
How does pipe operation writes the result to standard output in Apache Spark?