Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the use of mysql connector?
Explain NameNode and DataNode in HDFS?
What is the work of hive/hcatalog?
What is reduce side join in mapreduce?
How can client interact with Hive?
If we want to copy 10 blocks from one machine to another, but another machine can copy only 8.5 blocks, can the blocks be broken at the time of replication?
How will you implement joins in HBase?
What is the difference between nas (network attached storage) and hdfs?
Can you run spark on windows?
What are the differences between PIG and SQL?
What is Apache HBase?
Do we require two servers for the namenode and the datanodes?
What is spark rdd?
Can you explain how it is different from doing machine learning in r or sas?
List Hadoop’s three configuration files?