Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is a bloom filter?
Discuss how you can use filters in apache hbase
Can the region server will be located on all datanodes?
What is Geo-Replication in Kafka?
Mention some basic tajo shell commands?
In how many ways RDDs can be created? Explain.
Different ways of debugging a job in MapReduce?
What is the difference between Hive CLI and Beeline?
hbase support syntax structure like sql. Yes or no?
Can we run Apache Spark without Hadoop?
What is executor in spark?
Mention what is the use of Context Object?
What is Cassandra Database Software ?
Explain the use of .mecia class?
When running Spark applications, is it necessary to install Spark on all the nodes of YARN cluster?