Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How to Delete directory from HDFS?
What is the difference between Caching and Persistence in Apache Spark?
How is indexing done in Hadoop HDFS?
What is secondary namenode? Is it a substitute or back up node for the namenode?
Can any impala query also be executed in hive?
how Hadoop is different from other data processing tools?
What are the various diagnostic operators available in Apache Pig?
Is it possible to create multiple table in hive for same data?
Explain sortbykey() operation?
Is spark distributed computing?
Tell any two features of flume?
What we need to be taken care while adding a column?
What is the latest version of ambari that is available in the present market?
Name the scalar data type and complex data types in Pig?
Map reduce jobs take too long. What can be done to improve the performance of the cluster?