Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is a Secondary Namenode? Is it a substitute to the Namenode?
Name some Complex types of Data types, Avro Supports?
What are the basic available commands in Hadoop sqoop ?
What is the purpose of RecordReader in hadoop?
how you can get exactly once messaging from Kafka during data production?
Difference between hbase and rdbms?
Which command is used to show the current hbase user?
Explain what is hbase?
Why HDFS?
How data or file is read in Hadoop HDFS?
What is paired rdd in spark?
After increasing the replication level, I still see that data is under replicated. What could be wrong?
What is the Use of Sqoop?
How do sparks work?
What is the difference between Caching and Persistence in Apache Spark?