Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain process to access sub directories recursively in hive queries.
What happens to job tracker when namenode is down?
In MapReduce, ideally how many mappers should be configured on a slave?
how to proceed to write your first mapreducer program?
What is a hive on spark?
what job does the conf class do?
Explain slot in Hadoop Map-Reduce v1?
What is the relationship between Jobs and Tasks in Hadoop?
What is Mapper in Hadoop?
What is a "reducer" in Hadoop?
If we want to copy 10 blocks from one machine to another, but another machine can copy only 8.5 blocks, can the blocks be broken at the time of replication?
Mention what daemons run on a master node and slave nodes?
What is the difference between sort by and order by in hive?
What happen if the number of the reducer is 0 in MapReduce?
What operations does rdd support?