Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How to use 'foreach' operation in pig scripts?
Define Cassandra?
What is executor in spark?
What are complex data types in pig?
Say when to pick “inward table” and “outside table” in hive?
What are combiners and its purpose?
What are the usage of different consistency levels for write operations ?
How does Mappers run method works?
Can you explain bloommapfile.
Highlight the difference between group and Cogroup operators in Pig?
What is dataframe api?
Explain the master class and the output class do?
How tasks are created in spark?
What are the key differences between cassandra and traditional rdbms?
Why is sqoop is used?