Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are the four basic parameters of a mapper?
What is shuffling and sorting in Hadoop MapReduce?
How we can take Hadoop out of Safe Mode?
Mention the date data type in hive. Name the hive data type collection.
State some impala hadoop benefits?
What is a bookkeeper client in bookkeeper?
What is Writable & WritableComparable interface?
Explain api create or replace tempview()?
How to show up details in pig ?
Can you explain about the cluster manager of apache spark?
Establish the difference between a node, cluster & data centres in Cassandra.
What is pair rdd in spark?
why should we use 'filters' in pig scripts?
Are there any problems which can only be solved by MapReduce and cannot be solved by PIG? In which kind of scenarios MR jobs will be more useful than PIG?
List out the some common problems faced by data analyst?