Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Is it possible to leverage real time analysis on the big data collected by flume directly? If yes, then explain how?
77What does the following query do? Insert overwrite table employees partition (country, state) select ..., Se.cnty, se.st from staged_employees se;
962While loading data into a hive table using the load data clause, how do you specify it is a hdfs file and not a local file ?
770
What are the key elements in ZooKeeper Architecture?
State use cases of impala?
What is the difference between spark and hive?
What is Importance of Java in Apache Kafka?
Explain what is the purpose of RecordReader in Hadoop?
What is the purpose of button groups?
Can you list down the limitations of using Apache Spark?
Define standalone mode in hbase?
What are Paired RDD?
What are different modes of metastore deployment in Hive?
How does a log flume work?
What happens if the block in HDFS is corrupted?
What jobtracker is in hadoop? What are the activities followed by hadoop?
Clarify what jobtracker is in hadoop? What are the activities followed by hadoop?
What is the difference betwaeen mapreduce engine and hdfs cluster?