Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is a primary key? And what are it’s different types?
What is the relationship between Jobs and Tasks in Hadoop?
What is the future of apache spark?
Where are hadoop’s configuration files located and list them?
Explain Erasure Coding in Hadoop?
What can be optimum value for Reducer?
What are the types of cluster managers in spark?
Give me an example of document database ?
How multi-hop agent can be setup in Flume?
What is use of tools command?
How to Delete directory and files recursively from HDFS?
What do you mean by replication factor?
State some Ambari components which we can use for automation as well as integration?
Give key features of any NoSQL database?
How does hbase actually delete a row?