Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the role of the zookeeper?
How to stop a partition form being queried?
What are the different components of a Hive architecture?
How many maximum jvm can run on a slave node?
What is spark database?
How can one set space quota in Hadoop (HDFS) directory?
Use of create-hive-table command in hadoop sqoop?
Explain the concept of resilient distributed dataset (rdd).
How can you send some messages in kafka?
Suppose hadoop spawned 100 tasks for a job and one of the tasks failed. What will hadoop do?
How will you connect Apache Spark with Apache Mesos?
How can the columns of a table in hive be written to a file?
What are the debugging tools used for Apache Pig scripts?
Can we deploy job tracker other than name node?
What is the role of the secondary namenode?