Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is an accumulator in spark?
For using hadoop list the network requirements?
What is a block in Hadoop HDFS? What should be the block size to get optimum performance from the Hadoop cluster?
Can we deploy job tracker other than name node?
What is LazyOutputFormat in Hadoop?
What is the no. Of threads created by impala?
Can you explain sequence file in hadoop?
what is the difference between order by and sort by in Hive?
What is azure spark?
How do you parse data in xml? Which kind of class do you use with java to pass data?
What is having clause in apache tajo?
What is the difference between pig and hive?
What is flatten in pig?
Is spark a programming language?
Is hive suitable to be used for oltp systems? Why?