Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How do you write comments in pig scripts?
What do you understand by Executor Memory in a Spark application?
What are the main features of SPM in Cassandra?
In which kind of scenarios MapReduce jobs will be more useful than PIG in Hadoop?
What apache spark is used for?
What is purpose of RecordWriter in Hadoop?
What is rdd in spark with example?
What are the most commonly defined input formats in Hadoop?
What are common spark ecosystems?
The partition of hive table has been modified to point to a new directory location. Do I have to move the data to the new location or the data will be moved automatically to the new location?
What happens when two clients try to access the same file on HDFS?
How are large objects handled in Sqoop?
Explain what is the function of mapreduce partitioner?
Is kafka big data?
What is ObjectInspector functionality?