Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain how to write the output into a file using storm?
What are broadcast variables in Apache Spark? Why do we need them?
Is hadoop a memory?
What is spark application?
Rack awareness of Namenode?
What is HDFS block size and what did you chose in your project?
What is bag data type in Pig?
What is Output Format in MapReduce?
Where is spark used?
What is spark code?
What is the difference between Hadoop and RDBMS?
What is Apache Pig?
What is column store db? Explain with an example.
What is pipelined rdd?
What is difference between split and block in hadoop?