Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the Hadoop MapReduce API contract for a key and value Class?
Can you join multiple fields in Apache
What are the ways in which Apache Spark handles accumulated Metadata?
What are nodes and ephemeral nodes?
List out the ways of creating RDD in Apache Spark?
What are the default record and field delimiter used for hive text files?
Does HBase support SQL like syntax?
What is spark good for?
Explain first() operation in Spark?
What is streaming?
Define a commodity hardware? Does commodity hardware include ram?
What do you understand by Executor Memory in a Spark application?
RLIKE in Hive?
Can you explain smb join in hive?
What is a tuple in pig?