Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain about the execution pl of a pig script?
or
differentiate between the logical and physical plan of an apache pig script?
Is impala intended to handle real time queries in low-latency applications or is it for ad hoc queries for the purpose of data exploration?
43
What is winutils hadoop?
What is Apache Spark and what are the benefits of Spark over MapReduce?
What are the different types of tombstone markers in HBase for deletion?
What do you mean by the high availability of a namenode? How is it achieved?
What is Apache Flume?
Define ttl in hbase?
Explain the Features of HBase?
How can you send some messages in kafka?
Explain about the bloommapfile?
How is anti-entropy associated with merkel tree?
Define primary key in Apache Cassandra?
Explain data flow in Flume?
What are the libraries of spark sql?
Can you use spark to access and analyze data stored in cassandra databases?
How can you trigger automatic clean-ups in Spark to handle accumulated metadata?