Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is CQL?
Define streaming access?
What are different types of filesystem?
List commonly used machine learning algorithm?
What is the Job interface in MapReduce framework?
What do you mean by column family?
Which language is not supported by spark?
Can I install spark on windows?
Why HDFS performs replication, although it results in data redundancy in Hadoop?
What is master node in spark?
What are the different components of a Hive query processor?
Which technique can you use in hbase to access hfile directly without the help of hbase?
what is a sequence file in Hadoop?
How do users interact with the shell in apache pig?
What are the ways in which Apache Spark handles accumulated Metadata?