Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Is it necessary to write jobs for hadoop in the java language?
List the advantage of Parquet files?
Why is space not freed up when I issue drop table?
List various commonly used machine learning algorithm?
What is a spark standalone cluster?
How does cassandra perform write operations?
Can I set the number of reducers to zero?
What do you mean by inputformat?
Tell any two feature Flume?
How can I delete the above index named index_bonuspay?
What is ZooKeeper Atomic Broadcast (ZAB) protocol?
Why the name ‘hadoop’?
What is a JobTracker in Hadoop? How many instances of JobTracker run on a Hadoop Cluster?
Which components are used for stream flow of data?
How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?