Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is difference between dataset and dataframe?
Is it necessary to learn hadoop for spark?
CONCAT function in Hive with Example?
What is the difference between traditional RDBMS and Hadoop?
What is the Use of Cassandra Database ?
What is data skew in spark?
What is spark context spark session?
Explain the process of spilling in MapReduce?
What do you understand by an inner bag and outer bag in Pig?
What happens if rdd partition is lost due to worker node failure?
Give the command to see the indexes on a table?
Clarify what combiners are and when you should utilize a combiner in a map reduce job?
How does spark work with python?
When to use explode in Hive?
Explain first() operation in Apache Spark RDD?