Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Does spark use java?
What are the network requirements for hadoop?
What is sc parallelize in spark?
What is the role of zookeeper in hbase?
Where is the output of Mapper written in Hadoop?
What are the features of Pseudo mode?
How Spark handles monitoring and logging in Standalone mode?
Why are we using Flume?
What is a hive in big data?
Can Flume can distribute data to multiple destinations?
What are possible types of Channel Selectors?
Mention the salient features of apache tajo ?
How does hdfs get a good throughput?
File permissions in HDFS?
Why are spark transformations lazy?