Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Define data integrity?
Define a sequence file in hadoop?
What is a Column family in hbase?
How the write operation is performed on Cassandra node ?
Explain slot in Hadoop Map-Reduce v1?
Mention the difference between hbase and relational database?
What is spark slang for?
What is Hadoop streaming?
What is the difference between hive and spark?
What is standalone mode in spark?
How can I restart namenode?
How to set mappers and reducers for Hadoop jobs?
Why hive does not store metadata information in hdfs?
Map reduce jobs take too long. What can be done to improve the performance of the cluster?
Define Cluster?