Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is spark vectorization?
How the read operation is performed on Cassandra node ?
Explain textloader function?
Is it necessary to install spark on all the nodes of a YARN cluster while running Apache Spark on YARN ?
In which scenario Hive is good fit?
Explain pigstorage function?
Does hadoop install spark?
Is it possible to use same metastore by multiple users, in case of embedded hive?
List the various types of "Cluster Managers" in Spark.
What are the components of presto architecture?
How does gossip protocol help in failure detection?
What happens if you get a ‘connection refused java exception’ when you type hadoop fsck /?
What does producer api in kafka?
Write a short note on the disadvantages of mapreduce
Explain the master class and the output class do?