Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What do we mean by Paraquet?
Rack awareness of Namenode?
Describe REVERSE function in Hive with example?
Discuss writeahead logging in Apache Spark Streaming?
Tell something about the query language used in Cassandra Database?
What are best features of Apache Avro?
Mention what does the shell commands “capture” and “consistency” determines?
What is tungsten engine in spark?
How do I get apache spark on windows 10?
What is kafka Producer?
What does rdd stand for in logistics?
What is a UDF in Pig?
When should you use hbase?
How to set mappers and reducers for Hadoop jobs?
What are the differences between Caching and Persistence method in Apache Spark?