Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the use of apache mahout?
What happened after creating a table in hive ?
How can you send some messages in kafka?
Since the data is replicated thrice in hdfs, does it mean that any calculation done on one node will also be replicated on the other two?
What do you mean by a bag in Pig?
Describe Replication Factor?
Explain what is zookeeper in kafka? Can we use kafka without zookeeper?
What are the independent extensions that contributed to the ambari codebase?
What is SequenceFileInputFormat?
What is a "Spark Executor"?
What is spark code?
What do you mean by schema on reading?
The partition of hive table has been modified to point to a new directory location. Do I have to move the data to the new location or the data will be moved automatically to the new location?
How would you check whether your NameNode is working or not?
How does spark program work?