Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Give the difference between Column and SuperColumn?
List the configuration parameters that have to be specified when running a MapReduce job.
State the limitations of Apache Pig?
Does this lead to security issues?
What do you understand by snitches?
List some use cases where Spark outperforms Hadoop in processing.
What is the replica placement Strategy in Cassandra ?
How does Cassandra write?
What is spark checkpointing?
What is the role of JDBC driver in Sqoop?
What is master node in spark?
What are the side data distribution techniques?
Clarify what is sequence file input format?
What is lineage graph in spark?
Double type in Hive - Important points?