Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are different types of filesystem?
What is spark in python?
What do you understand by an inner bag and outer bag in Pig?
What is Directed Acyclic Graph in Apache Spark?
Which type of data HBase can store?
In how many ways RDDs can be created? Explain.
Apache Flume support third-party plugins also?
Discuss about the different tombstone markers used for deletion purposes in HBase.?
What do you mean by data locality?
What is spark repartition?
What are the core apis in kafka?
Mention the difference between hbase and relational database?
Explain what is memtable in cassandra?
What happens when two clients try to access the same file in the hdfs?
On what basis name node distribute blocks across the data nodes in HDFS?