Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the use of MasterServer?
When would you use hbase?
What is apache hcatalog?
Can we run spark on windows?
Do we need to install scala for spark?
Discuss writeahead logging in Apache Spark Streaming?
Explain hbasestorage function?
Is there a module to implement sql in spark? How does it work?
Do we need to place 2nd and 3rd data in rack 2 only?
What are the filters are available in apache hbase?
What is Partioner in hadoop? Where does it run
Will various customers write into an hdfs record simultaneously?
Define “speculative execution” in hadoop?
What is the difference between cache and persist in spark?
What do you understand by data center in cassandra?