Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain what is the role of the zookeeper?
What type of data hadoop can handle ?
What is Rack Awareness? What is its need in Hadoop?
Is it possible to use same metastore by multiple users, in case of embedded hive?
What are the three layers where the hadoop components are actually supported by ambari?
How the SSTable is different from other relational tables?
While loading data into a hive table using the load data clause, how do you specify it is a hdfs file and not a local file ?
What are the particular functionalities of Ganglia in Ambari?
Can I run Apache Spark without Hadoop?
What is a "Parquet" in Spark?
what is the default replication factor in HDFS?
What happens to existing data in my cluster when I add new nodes?
Hadoop sqoop is which type of tool?
How does job tracker schedule a job for the task tracker?
Explain the difference between nas and hdfs?