Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How can you achieve high availability in Apache Spark?
What do you understand by Consistency in Cassandra?
What is Apache Pig?
What is the best way to copy files between HDFS clusters?
Which one is default?
Explain what is distributed cache in mapreduce framework?
Mention what are the data components used by Hadoop?
Explain how input and output data format of the hadoop framework?
what is Metastore in Hive?
What is difference between split and block in hadoop?
How does the Pig platform handle relational systems data?
What is a block and block scanner in HDFS?
What is a keyspace in Cassandra?
What is the relationship between apache hadoop, hbase, hive and cassandra?
Explain deletion in hbase?