Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Tell any two feature Flume?
What are the scalar data types in Pig?
What is spark shuffle?
What is Erasure Coding in Hadoop?
What is the difference between TextinputFormat and KeyValueTextInputFormat class?
What is Derby database?
what needs to be taken care while adding a Column?
What is the difference between rdd and dataframe?
What is the use of flume in hadoop?
What are the transformations in spark?
Why do we need Pig?
What is a local repository and when it is useful while using ambari environment?
What are possible types of Channel Selectors?
How will you submit extra files or data ( like jars, static files, etc. ) For a mapreduce job during runtime?
What is job tracker role in hadoop?