Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Name the languages which are supported by apache spark and which one is most popular?
What is the benifit of Distributed cache, why can we just have the file in HDFS and have the application read it?
What does ambari shell can provide?
When to choose "External Table" in Hive?
What daemons run on master nodes?
What is a distributed cache in mapreduce framework?
What is row in hbase?
Can you explain apache spark?
Difference Between Hadoop and HDFS?
What is the inputsplit in map reduce software?
What are the complicated steps in Flume configurations?
What is sc textfile?
What is Bucketing and Clustering in Hive?
Please explain the sparse vector in Spark.
Can I run an ensemble cluster behind a load balancer?