Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Tell me about the execution modes of Apache Pig?
Can I use impala to query data already loaded into hive and hbase?
what is Zookeeper in Kafka? Can we use Kafka without Zookeeper?
What is the default spark executor memory?
What is spark executor cores?
What is Apache HBase?
How to setup the local repository manually?
What is rdd in spark with example?
Explain the Parquet File format in Apache Spark. When is it the best to choose this?
What does a 'MapReduce Partitioner' do?
Explain about the basic parameters of mapper and reducer function
How can you add a new partition for the month December in the above partitioned table?
How to write MapReduce Programs?
How can you manually partition the rdd?
What is the key difference between NameNode and DataNode in Hadoop?