Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Define Partition and Partitioner in Apache Spark?
Since the data is replicated thrice in hdfs, does it mean that any calculation done on one node will also be replicated on the other two?
How does cassandra perform write operations?
What is difference between client and cluster mode in spark?
What is the problem with the small file in Hadoop?
What is SparkContext in Apache Spark?
Mention what is the meaning of broker in kafka?
What is the difference between like and rlike operators in hive?
State about ZooKeeper WebUI?
What is spark in big data?
What are the benefits of using Spark with Apache Mesos?
State some key Points about Apache Avro?
How to explain Bigdatadeveloper projects
Explain Reliability and Failure Handling in Apache Flume?
What does flatten do in pig?