Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Mention what are the values stored in the Cassandra Column?
Why HDFS stores data using commodity hardware despite the higher chance of failures?
What is the purpose of dfsadmin tool?
What do you mean by cassandra-cqlsh?
Explain what does the conf.setmapper class do?
How to control access to data in impala?
Write a Pig UDF Example ?
What is inputformat in hadoop?
Who invented spark?
What is the Difference SparkSession vs SparkContext in Apache Spark?
Name a few companies that use Apache Spark in production?
What types of costs are associated with creating the index on hive tables?
Explain some Disadvantages of Avro?
What is the spark driver?
What is lineage graph in spark?