Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How can you implement machine learning in Spark?
What do you understand by an inner bag and outer bag in Pig?
List out the other components of cassandra?
Differentiate Reducer and Combiner in Hadoop MapReduce?
What are configuration files in Hadoop?
Why does spark skip stages?
What is Distributed Cache in Hadoop?
Which language is more suitable for text analytics? R or python?
What does the file hadoop-metrics.properties do?
Hadoop uses replication to achieve fault tolerance. How is this achieved in Apache Spark?
Give the difference between Drop and Truncate in CQLSH?
How can we drop a table in HCatalog?
What does hdfs mean?
What is Apache Avro?
Can we change the data type of a column in a hive table?