Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is lineage graph in spark?
What are the configuration files in Hadoop?
What will be the consideration while we do Hardware Planning for Master in Hadoop architecture?
List the diagnostic operators in pig.
What are the functionalities of jobtracker?
Does spark use mapreduce?
What does the high availability of a name-node means?
What is the use of checkpoints in spark?
How can one increase replication factor to a desired value in Hadoop?
What does flatten do in pig?
how will you implement SQL in Spark?
How do users interact with the shell in apache pig?
mapper or reducer?
What is a MapFile?
Which one would you recommend for hbase table design approach – tall-narrow or flat wide?