Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Mention the best features of Apache Sqoop?
When do you have to avoid secondary indexes?
Explain cap theorem?
Difference Between Hadoop and HDFS?
What other technologies have you used in hadoop sta ck?
What are the cases where Apache Spark surpasses Hadoop?
Explain when using field grouping in storm, is there any time-out or limit to known field values?
How do you check if a particular partition exists?
Why HCatalog?
What happens to a namenode, when job tracker is down?
Which files are used by the startup and shutdown commands?
Explain what happens if you alter the block size of a column family on an already occupied database?
What is the difference between HDFS and NAS ?
What is Zookeeper Cluster?
Explain the difference between mapreduce engine and hdfs cluster?