Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What happens in a textinputformat?
What is Cassandra Database Software ?
Is a job split into maps?
What is SuperColumn in Cassandra?
Explain what are the tools used in Big Data?
What are the types of traditional method of message transfer?
What is difference between an input split and hdfs block?
Why is block size set to 128 MB in Hadoop HDFS?
Explain a scenario where you will be using spark streaming.
Whats is distributed cache in hadoop?
What is a metastore in hive?
Can we create a hadoop cluster from scratch?
What are the primary phases of a Reducer?
What does repartition do in spark?
Explain about the different cluster managers in Apache Spark