Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are the two main components of ResourceManager?
What are the differences between hadoop 1 and hadoop 2?
What kind of applications is supported by Apache Hive?
How is the splitting of file invoked in Hadoop ?
What is scala spark?
Is client the end user in HDFS?
Explain how message is consumed by consumer in Kafka?
What do you know by storage and compute node?
What is the process of changing the split size if there is limited storage space on Commodity Hardware?
List some use cases of apache kafka?
What are the different compaction types in hbase?
How can you use streams api?
What is throughput in HDFS?
What are the different file permissions in the HDFS for files or directory levels?
What are shared variables in Apache Spark?