Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) In ambari what are the different life cycle commands?
Define the common faults of the developer while using apache spark?
How many instances of JobTracker can run on a Hadoop Cluser?
Why are Replications critical in Kafka?
Hdfs stores data using commodity hardware which has higher chances of failures. So, how hdfs ensures the fault tolerance capability of the system?
What is the difference between Input Split and an HDFS Block?
What do you understand by cassandra?
Is avro supported?
What are the features of Pseudo mode?
How to process data using Transformation operation in Spark?
What is number of executors in spark?
What is mandatory while creating a table in cassandra?
What is Client API?
What is fluming?
What is the use of cloudera?