Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are the all tasks we can perform for managing services using the ambari service tab?
Clarify what jobtracker is in hadoop? What are the activities followed by hadoop?
What is fluming?
What language is apache kafka written in?
How can you start the kafka server?
Is hadoop required for spark?
What are the limitations of importing RDBMS tables into Hcatalog directly?
Can we run unix shell commands from hive? Can hive queries be executed from script files? How? Give an example.
What is write ahead log(journaling)?
How is the distance between two nodes defined in Hadoop?
Discuss the precautions that are needed to take care while adding a column?
Explain about hlog and wal in hbase.
What mechanism does hadoop framework provides to synchronize changes made in distribution cache during runtime of the application?
What do you understand by data center in cassandra?
What is Cassandra-CQL collection?