Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain apache spark streaming? How is the processing of streaming data achieved in apache spark?
What are the advantages of datasets in spark?
List out the other components of cassandra?
what if job tracker machine is down?
How can we create children / sub-znode?
What is the Repository?
What is the difference between Hadoop and RDBMS?
when you can use Alter keyspace?
Explain mappartitions() and mappartitionswithindex()?
Explain schemardd?
What is identity mapper and chain mapper?
Can you list some useful zookeeper tools?
What is Schema on Read and Schema on Write?
Explain lineage graph
What is Sqoop Import Mainframe Tool and its Purpose?