Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the purpose of textinputformat?
Which companies are mostly using Hive ?
Explain what is logging in Cassandra?
Why can we not create directory /user/dataflair/inpdata001 when name node is in safe mode?
What is document store db?
What is the difference between an inputsplit and a block?
Explain what is hbase?
Who should learn Apache Ambari?
What happens if you get a ‘connection refused java exception’ when you type hadoop fsck /?
What is spark in big data?
What is the advantage of cassandra?
How to fetch particular columns in pig?
What is setmaster in spark?
Compare Hadoop 2 and Hadoop 3?
What is a nosql database?