Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How do I change hive execution engine to spark?
What are some alternatives to apache kafka?
Can you define inputsplit in hadoop?
How to managed create a table in hive ?
What is the difference between spark and python?
What is big data spark?
What is the difference between coalesce and repartition in spark?
What is difference between spark and hadoop?
Can we have different replication factor of the existing files in hdfs?
What is the Cassandra Coefficient ?
What is the purpose of JConsole?
What does a Spark Engine do?
How to setup the local repository manually?
Explain Hadoop streaming?
How can you native libraries be included in yarn jobs?