Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What port does spark use?
What are the filters are available in apache hbase?
How Mapper is instantiated in a running job?
What are the identity mapper and reducer in MapReduce?
Is the keyword 'DEFINE' like a function name?
Can ambari manage multiple clusters and why?
What are brokers in kafka?
What is the purpose of textinputformat?
Who created spark?
Suppose hadoop spawned 100 tasks for a job and one of the tasks failed. What will hadoop do?
what is (HS2) HiveServer2?
What is spark and what is its purpose?
What is mandatory while creating a table in cassandra?
Define "PageRank".
What is a skewed join?