Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
When should you use hbase?
What is Sqoop?
how is data partitioned before it is sent to the reducer if no custom partitioner is defined in Hadoop?
What is the current version of Hive?
What are the basic commands in Apache Sqoop and its uses?
What is a udf?
What is kafka message?
name few other popular column oriented databases like hbase.
What are the main features of hdfssite.xml?
What is Cassandra?
What is Apache Avro?
Give me an example of document database ?
Explain how indexing is done in hdfs?
List of some best tools that can be useful for data-analysis?
How does data transfer happen from hdfs to hive?