Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Why hive does not store metadata information in hdfs?
What are the DDL commands used in hbase?
How many instances of tasktracker run on a hadoop cluster?
What is a task instance in hadoop? Where does it run?
Define NoSQL Database?
What problems can be addressed by using Zookeeper?
What is the point of apache spark?
What is difference between flume and kafka?
How many Mappers run for a MapReduce job?
What is apache spark good for?
Why does my insert statement fail?
How hbase uses zookeeper?
What are the types of Transformation in Spark RDD Operations?
What is cqlsh?
How to control access to data in impala?