Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What combiners is and when you should use a combiner in a MapReduce Job?
What is a mapreduce algorithm?
Explain the use of File system API in Apache Spark
How is mapreduce related to cloud computing?
Explain bagtostring in pig?
What is the difference between Apache Pig and Hive?
What is the default port of presto?
How is 0xdata's h2o different from apache mahout ?
How many instances of tasktracker run on a hadoop cluster?
Differentiate between piglatin and hiveql?
Who divides the file into Block while storing inside hdfs in hadoop?
When application is on high latency (high response time)?
What is Directed Acyclic Graph(DAG)?
What is cluster in Cassandra data model?
Write a Hive UDF that returns a sentiment score. For example, if good = 1, bad = -1, and average = 0, then a review of a restaurant states "Good food, bad service," your score might be 1 - 1 = 0.