Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Can we say a COGROUP is a group of more than 1 data set?
What do you mean by consistency in Cassandra?
What port does spark use?
Is it possible to rename the output file?
Where do you specify the Mapper Implementation?
Differentiate between hive and hbase?
How kafka communicate with clients and servers?
What is setmaster in spark?
State syntax of the command that is used to drop a partition?
Can you explain rack awareness?
Can we change Replication Factor on a live cluster?
List down the languages supported by Apache Spark?
What happen if one of the datanodes has much slower cpu?
What are the key features of Apache Spark that you like?
Explain Data Type Conversion in Hive?