Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
what is gossip protocol?
What is paired rdd in spark?
What is prepare() method in Cassandra?
What is difference between spark and mapreduce?
How rdd can be created in spark?
Explain the Job OutputFormat?
Name types of Cluster Managers in Spark.
What is the benifit of Distributed cache, why can we just have the file in HDFS and have the application read it?
How spark is faster than hadoop?
Who invented spark?
What is a namenode?
What are the types of Apache Spark transformation?
What port does spark use?
List the benefits of using Cassandra.
Compare HBase vs RDBMS?