Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is ttl (time to live) in hbase?
What are the modules that constitute the Apache Hadoop 2.0 framework?
What is the bag?
Define hadoop archives? What is the command for archiving a group of files in hdfs.
Difference between hive and impala?
How many JVMs run on a slave node?
Define paired RDD in Apache Spark?
Mention some instances where zookeeper is using?
What is the use of map transformation?
What file systems Spark support?
What is a hive on spark?
Is it possible to have hadoop job output in multiple directories?
How is 0xdata's h2o different from apache mahout ?
What are the tools used in big data?
What is non-dfs used in hdfs web console