Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain what is the role of the zookeeper?
Why is block size large in Hadoop?
What are the relational operators available related to combining and splitting in pig language?
How do I start a spark cluster?
How do you stop a running job gracefully?
What is the Data model, and the hierarchical namespace?
What is a "Parquet" in Spark?
When should you use hbase?
After increasing the replication level, I still see that data is under replicated. What could be wrong?
What happens if number of reducers are 0?
What is the difference between a Hadoop and Relational Database and Nosql?
What is spark mapvalues?
What are the collection data types provided by CQL?
How to resolve small file problem in hdfs?
What is the role of Connector API?