Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
List out some key features of apache cassandra?
What is cluster in Cassandra?
What is data replication in Cassandra?
What is Sqoop Validation?
What is the role of zookeeper in hbase?
How are sparks created?
What are the ways to create RDDs in Apache Spark? Explain.
What is the function of "MLlib"?
What do you mean by Free Form Import in Sqoop?
How to Write a UDF function in Hive?
Explain how are file systems checked in hdfs?
Differentiate between piglatin and hiveql?
Why is transformation lazy operation in Apache Spark RDD? How is it useful?
What are the various levels of persistence in Apache Spark?
Web-ui shows that half of the datanodes are in decommissioning mode. What does that mean? Is it safe to remove those nodes from the network?