Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Can you explain worker node?
List the configuration parameters that have to be specified when running a MapReduce job.
Clarify what is sequence file input format?
What is a partition in spark?
Can Ambari manage multiple clusters?
What are the independent extensions that contributed to the ambari codebase?
TRIM function in Hive with example?
What are the benefits of Spark over MapReduce?
What ensures load balancing of the server in Kafka?
Define "Transformations" in Spark
Name the operating system(s) which are supported for production hadoop deployment?
What is a speculative execution in Apache Hadoop MapReduce?
What is the use of cassandra cql collection?
Name the scalar data type and complex data types in Pig?
Is spark good for machine learning?