Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are nodes and ephemeral nodes?
What is hive installation path?
what is partitions in hive?
Does hdfs enable a customer to peruse a record, which is already opened for writing?
Which Sorting algorithm is used in Hadoop MapReduce?
Does Apache Spark provide checkpoints?
Define Thrift in Apache Cassandra?
what is "map" and what is "reducer" in Hadoop?
Clarify what is sqoop in hadoop?
What do you mean by Free Form Import in Sqoop?
What do you understand by Executor Memory in a Spark application?
When should you use sequencefileinputformat?
Explain the role of the offset?
Define a udf?
What is driver memory and executor memory in spark?