Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Why apache spark is faster than hadoop?
How can we scale apache mahout in cloud?
What are the steps to submit a Hadoop job?
What do you understand by logging in cassandra?
Can you define rdd?
Why is block size set to 128 MB in Hadoop HDFS?
What are the scalar data types in Pig?
Can you define parquet file?
What is a scarce system resource?
How to read file in HDFS?
If a data Node is full how it's identified?
What are the libraries of spark sql?
What must we know to work on Zookeeper well?
Which are the three modes in which hadoop can be run?
What is Apache Kafka?