Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is a "reducer" in Hadoop?
Is bigger than spark driver maxresultsize?
Describe the distnct(),union(),intersection() and substract() transformation in Apache Spark RDD?
Why Do We Need Apache Pig?
What is your favourite tool in the hadoop ecosystem?
What are the various libraries available on top of Apache Spark?
In cloudera there is already a cluster, but if I want to form a cluster on ubuntu can we do it?
Define a worker node?
Is hive similar to sql?
Explain the hadoop configuration files at present?
Say what the views are in hive?
What is a Record Reader in hadoop?
What is the core of the job in MapReduce framework?
How can we launch a tajo cluster?
What do sorting do?