Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How to use Hive using the command line and Beeline?
Explain what is difference between an input split and hdfs block?
What do you think about the speculative execution?
why should we use 'filters' in pig scripts?
Explain task granularity
How can a user get the information on the version of CQLSH?
What is the key- value pair in MapReduce?
hbase support syntax structure like sql. Yes or no?
Mention what are the data components used by Hadoop?
What are the different Data Types available in Hive?
Explain the LOAD keyword in Pig script?
Explain what is distributed cache in mapreduce framework?
Define data centre?
What purpose would an engineer use spark?
Why spark is faster than hadoop?