Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How can you avoid importing tables one-by-one when importing a large number of tables from a database?
What is meant by Transformation? Give some examples.
Explain benefits of lazy evaluation in RDD in Apache Spark?
Explain deletion in hbase?
How many Reducers run for a MapReduce job in Hadoop?
What are the relation operations in Pig? Explain any two with examples?
Can hive run without hadoop?
Give the name of some components of Cassandra?
What is sample Query in Hive?
How can I speed up my spark?
Which one will you decide for an undertaking – Hadoop MapReduce or Apache Spark?
What are the basic commands in Apache Sqoop and its uses?
What is difference between client and cluster mode in spark?
How is indexing done in Hadoop HDFS?
Is there any point of learning mapreduce, then?