Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Difference between cassandra and mongodb?
What is kafka topic?
What is difference between client and cluster mode in spark?
Explain the Avro SASL Profile?
Explain briefly what is Action in Apache Spark? How is final result generated using an action?
Define speculative execution?
Does this lead to security issues?
What do you understand from Node redundancy and is it exist in hadoop cluster?
Do I need to know scala to learn spark?
Can copper cause a spark?
How do I optimize my spark code?
When to use hadoop, hbase, hive and pig?
How can we kill a topology?
Why do we need Hadoop Archives? How is it created?
What are the differences between PIG and SQL?