Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain fullOuterJoin() operation in Apache Spark?
Where is kafka used?
Mention what is the next step after mapper or maptask?
Explain what is a cluster in cassandra?
Why do we need sparkcontext?
How to set the number of reducers?
What is difference between cache and persist in spark?
Explain the role of the offset?
What are the three components of Cassandra write?
What are the benefits of block transfer?
List some use cases where classification machine learning algorithms can be used.
Describe SPM?
Explain countByValue() operation in Apache Spark RDD?
What is the required action you need to perform if you opt for scheduled maintenance on the cluster nodes?
How can a developer utilize hive?