Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is a Sparse Vector?
What are the different components that are available in kafka?
What are the data components used by Hadoop?
How can we create a hadoop cluster from scratch?
Do you need to install spark on all nodes of yarn cluster?
What are the great features of spark sql?
What are “Seed Nodes” in Cassandra?
What is hinted handoff?
What are the data formats supported by apache tajo?
What features from relational databases or hive are not available in impala?
What are mapreduce new and old apis while writing map reduce program?. Explain how it works
How can we see only top 15 records from the student.txt out of100 records in the HDFS directory?
Explain Spark saveAsTextFile() operation?
What is kafka in hadoop?
Can we rename the output file?