Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What do you mean by consistency in Cassandra?
Use of Help command in Hadoop sqoop?
Is HDFS utilized in Cassandra? If yes, where?
Explain jmx concerning hbse
Clarify about the smb join in hive?
Explain first() operation in Apache Spark RDD?
How to handle bad records during parsing?
Different ways of debugging a job in MapReduce?
What is the significance of the line set hive.mapred.mode = strict;?
What do you mean by replication strategy?
How can one increase replication factor to a desired value in Hadoop?
What is the purpose of button groups?
What are the side data distribution techniques?
Define cell in HBase?
Highlight the key differences between MapReduce and Apache Pig?