Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are the common hadoop PIG interview questions, that you have been asked in a Hadoop Job Interview?
621You have a file personal_data.txt in the HDFS directory with 100 records. You want to see only the first 5 records from the employee.txt file. How will you do this?
837
Is it possible to run Apache Spark on Apache Mesos?
What are the features and characteristics of Apache Spark?
What is parallelize in spark?
Why should we use presto?
Discuss about the different tombstone markers used for deletion purposes in HBase.?
When Namenode is down what happens to job tracker?
What is Hive Database?
What is a "Spark Executor"?
What is ColumnFamily?
Since the data is replicated thrice in hdfs, does it mean that any calculation done on one node will also be replicated on the other two?
How to drop database in apache tajo?
What is the purpose of RecordReader in hadoop?
What are the main features and Characteristics of Hadoop which makes it the most popular and powerful Big Data tool?
What are advantages of Spark over MapReduce?
You have a file personal_data.txt in the HDFS directory with 100 records. You want to see only the first 5 records from the employee.txt file. How will you do this?