Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the task of Spark Engine
Name some companies that use Hadoop?
What is a JobTracker in Hadoop? How many instances of JobTracker run on a Hadoop Cluster?
What is pig properties?
What are the different tasks we can perform managing host using ambari host tab?
What are the differences between PIG and HIVE?
How many instances of tasktracker run on a hadoop cluster?
What is the message broker?
Is there any difference between FileSink and FileRollSink?
What is the purpose of DataNode block scanner?
Use of version command in hadoop sqoop?
What is apache tajo?
is it posible to join multiple fields in pig scripts?
What are partitions and tokens in cassandra?
In which scenario Hive is good fit?