Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is anti-entropy?
There seem to be certain management tools in Cassandra. What are they?
What is the best way to copy files between HDFS clusters?
Explain what is logging in Cassandra?
How much Metadata will be created on NameNode in Hadoop?
What according to you is a common mistake apache spark developers make when using spark ?
What do you understand by Commit log in Cassandra?
How Pig differs from MapReduce?
Explain in which directory hadoop is installed?
What do you mean by logging in cassandra?
What is the function of Cluster.Builder class in Cassandra?
Can you explain about the indexing process in hdfs?
Why should I use spark?
How Pig programming gets converted into MapReduce jobs?
How can you use streams api?