Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Can we use Ambari Python Client to use of Ambari API’s?
What are the major areas where Ambari helps the system administrators to do?
Explain how does hbase actually delete a row?
What is a primary key? And what are it’s different types?
What do you understand by High availability?
what are the values stored in the cassandra column?
Which are the three modes in which hadoop can be run?
What do you mean by ZNode?
For a job in Hadoop, is it possible to change the number of mappers to be created?
Can you explain spark mllib?
Is spark sql faster than hive?
Does the archiving of hive tables give any space saving in hdfs?
What are consumers in kafka?
How can we look for the namenode in the browser?
Explain the Reducer's Sort phase?