Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How will you connect Apache Spark with Apache Mesos?
What is the role of Driver program in Spark Application?
How does apache spark engine work?
Is there any API available for implementing graphs in Spark?
Do we require two servers for the namenode and the datanodes?
Can you give a detailed overview about the Big Data being generated by Facebook?
Why are Replications critical in Kafka?
Which database is used in hadoop?
What are file permissions in HDFS and how HDFS check permissions for files or directory?
What do you mean by Stream Processing in Kafka?
Is it possible to add 100 more nodes when we already have 100 nodes in Hive?
What are the different Primitive Data Types available in Hive?
Why do the nodes are removed and added frequently in a hadoop cluster?
Why is Apache Spark faster than Apache Hadoop?
What are the common mistakes developers make when running Spark applications?