Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain the rudimentary difference between Cassandra and HBase?
What are the different modes in which PIG can run and explain those?
In what ways sparksession different from sparkcontext?
What is NameNode and DataNode in HDFS?
Write a short note on the disadvantages of mapreduce
How can you store the data in spark?
How can it help for avoiding costly modeling?
What is apache flume used for?
what is Speculative Execution?
What do you understand by receivers in Spark Streaming ?
What is output format in hadoop?
What is azure spark?
What rdd stands for?
How to Delete directory and files recursively from HDFS?
What are partitions in cassandra?