Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the main components of spark?’
How is the processing of streaming data achieved in Apache Spark? Explain.
In which kind of scenarios MapReduce jobs will be more useful than PIG in Hadoop?
What is a Sparse Vector?
What are the important features of hadoop?
What is the meaning of the term "non-DFS used" in Hadoop web-console?
What is the usage of "void close()" method?
How many daemon processes run on a hadoop cluster?
What is connection_loss error?
What is the advantage of cassandra?
What is Hive Database?
Explain the different types of repairs.
What are the scalar data types in Pig?
Why spark is faster than hadoop?
Mention what is rack awareness?