Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Hadoop sqoop is which type of tool?
What is the problem in having lots of small files in hdfs?
What are the other components of Cassandra?
What is Combiner in Hadoop?
What kind of applications is supported by Apache Hive?
Explain what is composite type in cassandra?
What is broadcast variable?
Is a log flume a roller coaster?
What is the benifit of Distributed cache, why can we just have the file in HDFS and have the application read it?
Why hbase is a schema-less database?
Can we have different replication factor of the existing files in hdfs?
What is the full form of MSLAB?
How to write 'foreach' statement for map datatype in pig scripts?
For a job in Hadoop, is it possible to change the number of mappers to be created?
Explain what does hbase consists of?