Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is the NameNode port number?
List the various HDFS daemons in HDFS cluster?
Is the hdfs block size reduced to achieve faster query results?
What is heartbeat in hdfs? Explain.
What is the usefulness of the distributed by clause in hive?
Explain bucketing in Hive?
Explain accumulators in apache spark.
Explain Spark Streaming with Socket?
Does Partitioner run in its own JVM or shares with another process?
What is an Agent?
What is the Virtual Node in Cassandra ?
What is a Combiner?
What is the use of shutdown command?
Before deploying the hadoop instance, what are the checks that an individual should do?
What is the standalone mode in spark cluster?