Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are the core benefits for hadoop users by using apache ambari?
What ate the key components of Hive Architecture?
Which scala library is used for functional programming?
Why is there a need for broadcast variables when working with Apache Spark?
On which port does ssh work?
Describe JMX?
How to identify that given operation is transformation/action in your program?
Explain avrostorage function?
what is hadoop archive?
What operations does rdd support?
Can you explain data versioning?
Illustrate a simple example of the working of MapReduce.
Is it possible to have hadoop job output in multiple directories?
What is SequenceFileInputFormat in Hadoop MapReduce?
Elaborate on cassandra - cql?