Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How many InputSplits will be made by hadoop framework?
Replication causes data redundancy and consume a lot of space, then why is it pursued in hdfs?
What are the possible Job roles?
How can Spark be connected to Apache Mesos?
What is Hive Present Version ?
Explain the difference between mapreduce engine and hdfs cluster?
The difference between GROUP and COGROUP operators in Pig?
Explain SparkContext in Apache Spark?
Do I need to know scala to learn spark?
What is Interceptor?
Explain Apache Ambari architecture?
Is spark sql a database?
How to overwrite an existing output file/dir during execution of Hadoop MapReduce jobs?
What do you understand by the super column in cassandra?
Give examples of some companies that are using Hadoop structure?