Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Can you explain clustering in mahout?
Explain what is a cluster in cassandra?
What is Fault Tolerance in HDFS?
What are the four characteristics of Big Data?
If DataNode increases, then do we need to upgrade NameNode in Hadoop?
What does illustrate do in Apache Pig?
Explain sum(), max(), min() operation in Apache Spark?
Is hadoop mandatory for spark?
Does Partitioner run in its own JVM or shares with another process?
What is the process of creating an Ambari client?
what are the steps involved in decommissioning removing
What is the reason of using hbase?
Hadoop uses replication to achieve fault tolerance. How is this achieved in Apache Spark?
Does Cassandra support ACID transactions?
How many JVMs run on a slave node?