Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Describe Replication Factor?
Explain the difference between NameNode
What is pseudo-distributed mode?
What is the method to create a data frame?
How Cassandra stores data?
What relational operators can we use that are related to combining and splitting in Pig language?
Define column families?
what is the difference between order by and sort by in Hive?
Differentiate between describe and describe extended?
How Apache Pig deals with the schema and schema-less data?
Can you explain data versioning?
Explain SparkContext in Apache Spark?
Which scala library is used for functional programming?
What is meant by in-memory processing in Spark?
What are hive operators and its types?