Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is log compaction?
Hbase blocksize is configured on which level?
when to choose “internal table” and “external table” in hive?
What is the difference between hbase and hdfs?
What does ambari shell can provide?
What is anti-entropy?
What is the difference between map and reduce?
What is impala’s aggregation strategy?
Explain the general mapreduce algorithm
How to restart NameNode or all the daemons in Hadoop?
Explain what is namenode in hadoop?
What is Row Key?
Explain when using field grouping in storm, is there any time-out or limit to known field values?
What do you understand by nosql cap theorem?
What are the usage of different consistency levels for write operations ?