Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the important differences between apache and hadoop?
Does Partitioner run in its own JVM or shares with another process?
What is a "Parquet" in Spark?
Why is cqlsh used?
What a task tracker is in hadoop?
How do you write comments in pig scripts?
What are distinct operators in impala?
State some command line options?
What are the components of apache ambari architecture?
Mention what are the data components used by Hadoop?
How should you handle session_expired?
Is spark an etl?
What is KeyValueTextInputFormat in Hadoop MapReduce?
What is sc parallelize?
What is apache spark core?