Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Differentiate between piglatin and hiveql?
What is pig properties?
Explain Spark Streaming with Socket?
what is SPF?
Specify the different types of tables accessible in hive?
Define hadoop archives?
Do we need to install spark in all nodes?
What are the different ways of representing data in Spark?
How to start kafka server?
What does a split do?
What does serdes mean in apache kafka?
Explain what happens in text format?
On what basis name node distribute blocks across the data nodes in HDFS?
Establish the difference between a node, cluster & data centres in Cassandra.
Is hive suitable to be used for oltp systems? Why?