Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the use of ZooKeeper?
what is the default replication factor in HDFS?
Where the mapper's intermediate data will be stored?
What are Flume core components?
Can you define udf?
Which operating system(s) are supported for production hadoop deployment?
Explain the different types of repairs.
How will you explain COGROUP in Pig?
When and how to create hadoop archive?
How can I restart namenode?
What is structured and unstructured data?
Explain about the partitioning, shuffle and sort phase
How does cassandra perform write operations?
Please provide an explanation on DStream in Spark.
Map reduce jobs take too long. What can be done to improve the performance of the cluster?