Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the main classes of Data Transfer API?
What is spark flatmap?
Can you explain spark streaming?
Why HCatalog?
What is the use of “void close()” method?
What is a hive in big data?
Name different elements of JConsole?
Where are hadoop’s configuration files located and list them?
Define HDFS and talk about their respective components?
Is it necessary to install spark on all the nodes of a YARN cluster while running Apache Spark on YARN ?
What is the difference between spark and python?
Is rdd type safe?
What is the best practice to deploy the secondary name node?
What is zookeper?
What database are supported by Hive?