Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What port does spark use?
Describe coalesce() operation. When can you coalesce to a larger number of partitions? Explain.
What we need to be taken care while adding a column?
What did mean by data-node?
What are the differences between PIG and SQL?
What is the Use of SSH in Hadoop ?
Do I need to learn scala for spark?
I have a relation r. How can I get the top 10 tuples from the relation r?
Does HDFS allow a client to read a file which is already opened for writing in hadoop?
Mention what is the next step after mapper or maptask?
What is difference between reducer and combiner?
Is spark and hadoop same?
What is Apache Hadoop?
Give the data storage units in Cassandra?
What happens if the preferred replica is not in the isr?