Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How can I improve my spark performance?
What are the side data distribution techniques?
What is the default block size in hdfs?
How will you read and write HDFS files in Hive?
What does reduce action do?
Specify some uses of HBase?
What is the key- value pair in Hadoop MapReduce?
what is the traditional method of message trfer?
What do you understand by column family?
Which java class handles the output record encoding into files which result from Hive queries?
What is the purpose of retention period in Kafka cluster?
how is a file of the size 1 GB uncompressed
How one can change Replication factor when Data is already stored in HDFS
Can we have multiple entries in the master files?
Are spark dataframes distributed?