Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Describe the run-time architecture of Spark?
How hadoop mapreduce works?
What is the meaning of compaction in hbase?
When to use –target-dir and when to use –warehouse-dir while importing data?
Define Simple Strategy?
When can you use ALTER KEYSPACE?
Is Mapreduce Required For Impala? Will Impala Continue To Work As Expected If Mapreduce Is Stopped?
Define data cleansing?
Does Pig give any warning when there is a type mismatch or missing field?
What is spark accreditation?
Explain about the major libraries that constitute the Spark Ecosystem?
Why Mapper runs in heavy weight process and not in a thread in MapReduce?
What are the design goals of zookeeper?
What is document store db? Explain with an example.
What is a Sparse Vector?