Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How can apache spark be used alongside hadoop?
What is partitioning?
What is impala?
In ambari what are the different life cycle commands?
Does this lead to security issues?
How will you read and write HDFS files in Hive?
Can free form SQL queries be used with Sqoop import command?
shouldn't DFS be able to handle large volumes of data already?
How is 0xdata's h2o different from apache mahout ?
What is partitioner and its usage?
How tables are managed in apache tajo?
What is parallelize in spark?
what are views in Hive?
What is the purpose of textinputformat?
What is Secondary NameNode in Hadoop HDFS?