Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Can we broadcast an rdd?
Explain about the data model operations in HBase?
What types of costs are associated in creating index on hive tables?
What is project tungsten in spark?
What impala use for authentication?
Why do we need Hadoop Archives? How is it created?
What happen if a datanode loses network connection for a few minutes?
Explain the memtable in cassandra?
Who divides the file into Block while storing inside hdfs in hadoop?
Why there is need of pig language?
What is Spark Core?
What's rdd?
What is mahout hadoop?
What are the Advantages of using Cassandra ?
Explain InputSplit in Hadoop MapReduce?