Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the difference between apache mahout and cloudera oryx ?
What are the characteristics of hadoop framework?
How to copy file from HDFS to local?
What is the default partition in spark?
Explain coalesce operation in Apache Spark?
What is Internal and External table in Hive?
What is hive metastore?
If reducers do not start before all mappers finish then why does the progress on mapreduce job shows something like map(50%) reduce(10%)? Why reducers progress percentage is displayed when mapper is not finished yet?
Is the keyword ‘FUNCTIONAL’ a User Defined Function (UDF)?
Explain what happens if you alter the block size of a column family on an already occupied database?
What is pig statistics?
What do you understand by unit and ()in scala?
Why is block size large in Hadoop?
Who creates dag in spark?
What is dataframe api?