Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Define Thrift?
What is the use of spark sql?
How would you check whether your NameNode is working or not?
How to set which framework would be used to run mapreduce program?
Mention what is the number of default partitioner in Hadoop?
What are the tools that are needed to build ambari?
What are the benefits of apache kafka over the traditional technique?
What is shuffle read and shuffle write in spark?
What is driver and executor in spark?
Is avro supported?
What is the use of Hcatalog?
What is the spark driver?
Explain what is shuffling in mapreduce?
What are the limitations of importing RDBMS tables into Hcatalog directly?
Does Apache Spark provide checkpoints?