Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How can we see all the hosts that are available in Ambari?
What do you mean by “data centre” in cassandra?
What are the additional benefits YARN brings in to Hadoop?
What apache spark is used for?
What is column store db? Explain with an example.
What is apache spark good for?
Define ttl in hbase?
If you run a select * query in hive, why does it not run mapreduce?
What is flatmap?
What is Hadoop HDFS – Hadoop Distributed File System?
What is a Cluster, Node and Key space in Cassandra ?
What are the main configuration parameters in a MapReduce program?
What is a JobTracker in Hadoop? How many instances of JobTracker run on a Hadoop Cluster?
Explain the flatMap operation on Apache Spark RDD?
What are spark stages?