Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Can you tell us how many daemon processes run on a hadoop system?
Explain how can we change the split size if our commodity hardware has less storage space?
What is replication in kafka?
What are the components of Apache Pig platform?
Difference between cassandra and mongodb?
what is distributed cache in mapreduce framework?
Compare Apache Hadoop and Apache Spark?
What are the key features of HDFS?
What is spark lineage?
How do you write comments in pig scripts?
Can you give some examples of Big Data?
Explain pipe() operation in Apache Spark?
What are different String functions available in PIG?
What is a reliable and unreliable receiver in Spark?
What does hbase consists of?