Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain what do you understand by cassandra- cql collections?
Can we run spark without hadoop?
What is Sqoop Import? Explain its purpose?
How can you see the list of stored jobs in sqoop metastore?
How to overwrite an existing output file during execution of mapreduce jobs?
Why we need compression and what are the different compression format supported?
What is job tracker in Hadoop?
Explain the common input formats in hadoop?
What is the number of default partitioner in hadoop?
What are the important differences between apache and hadoop?
Mention what is the meaning of broker in kafka?
Explain HCatalog Architecture in Brief?
How many times combiner is called on a mapper node in Hadoop?
How is spark sql different from hql and sql?
Why do we perform partitioning in Hive?