Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are 4 v's of big data?
How much faster is Apache spark than Hadoop?
What are the different Complex Data Types available in Hive?
What is shuffling in mapreduce?
Define memtable?
What are Paired RDD?
Differentiate between the various types of primary keys in cassandra.
How can we create children / sub-znode?
How will format the HDFS ?
What do you know about the speculative execution?
Elaborate kafka architecture?
What is the difference between logical and physical plans?
What is the default file format to import data using Apache Sqoop?
What is skew data?
is it posible to join multiple fields in pig scripts?