Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the latest version of spark?
How are large objects handled in Sqoop?
Why we use parallelize in spark?
How can you compare Hadoop and Spark in terms of ease of use?
Discuss the precautions that are needed to take care while adding a column?
Can you modify the file present in hdfs?
how can we change Replication Factor?
State some impala hadoop benefits?
Ideally what should be replication factor in a Hadoop cluster?
What is key-value store db? Explain with an example.
What is tasktracker in hadoop?
How hbase uses zookeeper?
How is impala metadata managed?
What is Internal and External table in Hive?
What is Chain Mapper?