Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is accumulator in spark?
How to set mappers and reducers for Hadoop jobs?
What is the use of exists command?
What is difference between map and flatmap in spark?
Is JDBC driver enough to connect sqoop to the databases?
How Apache Pig deals with the schema and schema-less data?
What do you mean by the NameNode High Availability in hadoop?
What is lazy evaluation in Spark?
What is Hadoop Custom partitioner ?
What is HBase Shell?
Can you define what is Event Serializer in Flume?
Explain how is data partitioned before it is sent to the reducer if no custom partitioner is defined in hadoop?
What is heap memory in spark?
Mention if we can name view same as the name of a Hive table?
It can be possible that a Job has 0 reducers?