Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the key differences between Pig vs MapReduce?
What is apache spark for beginners?
How can we control particular key should go in a specific reducer?
How to write a Custom Key Class?
What is node in Cassandra?
Explain the sequence of execution of all the components of MapReduce like a map, reduce, recordReader, split, combiner, partitioner, sort, shuffle.
Define the Use of Pig?
what is the default replication factor in HDFS?
Explain InputSplit in Hadoop MapReduce?
What are the differences between relational databases and impala?
Do we require two servers for the namenode and the datanodes?
What is Hive Present Version ?
What is the difference between an RDBMS and Hadoop?
What are the four characteristics of Big Data?
What does map transformation do? Provide an example.