Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is Hive Data Definition language?
Is the following approach correct? Is the sqrt Of Sum Of Sq a valid reducer?
Clarify what is sqoop in hadoop?
Explain various cluster manager in Apache Spark?
Can we run Apache Spark without Hadoop?
How will you explain COGROUP in Pig?
Do I need to know scala to learn spark?
Highlight the key differences between MapReduce and Apache Pig?
What are the commonalities between pig and hive?
How can you see the list of stored jobs in sqoop metastore?
How to handle record boundaries in Text files or Sequence files in MapReduce InputSplits?
Is sqoop an etl tool?
What are the three components of Cassandra write?
What are shared variables in spark?
What is a block in Hadoop HDFS? What should be the block size to get optimum performance from the Hadoop cluster?