Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the most widely used API Write Data to Cassandra ?
Explain what is a column family in cassandra?
What is spark used for?
is HQL case sensitive?
How to add/delete a Node to the existing cluster?
Explain Accumulator in Spark?
What is the local repository and where it is useful while using ambari environment?
How to compress mapper output in Hadoop?
Which file systems does Spark support?
What is the role zookeeper plays in a cluster of kafka?
What is data skew and how do you fix it?
How will you list all the columns of a table using Apache Sqoop?
Explain the fundamental difference between Cassandra and Hadoop?
How do I clear my spark cache?
what is NameNode in Hadoop?