Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the difference between nas (network attached storage) and hdfs?
What is a tuple in spark?
In which kind of scenarios MapReduce jobs will be more useful than PIG in Hadoop?
Explain about the scalar datatypes in Apache Pig?
Compare HBase vs RDBMS?
What do you understand by Kundera?
What are the collection data types provided by CQL?
What are the configuration files in Hadoop?
What is flatmap?
What is a single point of failure in Hadoop 1 and how is it resolved in Hadoop 2?
Discuss the precautions that are needed to take care while adding a column?
What is winutils hadoop?
Why Apache Spark?
Can you define parquet file?
What is lazy evaluation in Spark?