Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain about the execution pl of a pig script?
or
differentiate between the logical and physical plan of an apache pig script?
Is impala intended to handle real time queries in low-latency applications or is it for ad hoc queries for the purpose of data exploration?
43
Where is table data stored in Apache Hive by default?
What are the key elements in ZooKeeper Architecture?
Explain the term sstables?
What do shuffling do?
Describe SPM?
What is Disk Balancer in Hadoop?
What are the 2 modes used to run pig scripts?
Name some AVRO Reference APIs?
Explain how can we check whether namenode is working or not?
Explain textFile Vs wholeTextFile in Spark?
What is difference between coalesce and repartition?
Name the most common input formats defined in hadoop?
Define actions in spark.
Which one will you decide for an undertaking – Hadoop MapReduce or Apache Spark?
What are znodes?