Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain what is hadoop?
What is Starvation scenario in spark streaming?
Can we run spark without hadoop?
Why Hadoop performs replication, although it results in data redundancy?
When to use –target-dir and when to use –warehouse-dir while importing data?
How do we represent data in Spark?
What is data ingestion pipeline?
How do you categorize a big data?
Who is intended audience to learn HCatalog?
Which language is not supported by spark?
Explain how HDFS communicates with Linux native file system?
Explain what does the conf.setmapper class do?
Should we use RAID in Hadoop or not?
What happens in text format?
How do you write your own custom SerDe ?