Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Can you explain record reader?
What are the steps involved in MapReduce framework?
Do we need to install spark in all nodes?
Name few companies that are the uses of apache spark?
What is a scarce system resource?
what is Memtable in Cassandra?
Describe Accumulator in detail in Apache Spark?
Are results returned as they become available, or all at once when a query completes?
Explain some important features of hadoop?
Explain about the common workflow of a Spark program?
What are the four modules that make up the Apache Hadoop framework?
What database are supported by Hive?
What is the difference between Hadoop and Traditional RDBMS?
What is application master in spark?
Name the operations supported by rdd?