Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the difference between hadoop and spark?
Mention when you can use alter keyspace?
What is the problem with the small file in Hadoop?
Since the data is replicated thrice in hdfs, does it mean that any calculation done on one node will also be replicated on the other two?
In Hive, can you overwrite Hadoop MapReduce configuration in Hive?
What do you understand by standalone (or local) mode?
How do we create rdds in spark?
How can you remove the elements with a key present in any other RDD?
What is bag data type in Pig?
What is Hadoop serialization?
What are the steps to submit a Hadoop job?
What are the different Data Types available in Hive?
What is a "Spark Driver"?
Explain JobConf in MapReduce.
Explain what is logging in Cassandra?