Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is PageRank in Spark?
What are Paired RDD?
When to avoid secondary indexes?
What is hector?
Why is rdd immutable?
What do you use spark for?
Mention the best features of Apache Sqoop?
How to change a number of mappers running on a slave in MapReduce?
What is flume and sqoop?
How to copy a file into HDFS with a different block size to that of existing block size configuration?
what is the maximum size of the message does Kafka server can receive?
Is databricks a database?
Explain the core methods of the reducer?
List few differences between apache kafka and rabbitmq?
What is the use of cloudera?