Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the precedence order of hive configuration?
What are components of ambari tjat are important for automation and integration?
What is the command to start and stop the Spark in an interactive shell?
Query language is executed in Cassandra database. Clarify?
What do you understand by standalone (or local) mode?
LOWER or LCASE function in Hive with example?
What is Implicit Type conversion in Hive?
Explain Spark map() transformation?
MapReduce Types and Formats and Setting up a Hadoop Cluster?
What happens if rdd partition is lost due to worker node failure?
What is Resilient Distributed Dataset (RDD) in Apache Spark? How does it make spark operator rich?
Is piglatin a strongly typed language? If yes, then how did you come to the conclusion?
Define yum?
Replication causes data redundancy then why is pursued in hdfs?
Why we use parallelize in spark?