Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How will you calculate the number of executors required to do real-time processing using Apache Spark? What factors need to be considered for deciding on the number of nodes for real-time processing?
316In a given spark program, how will you identify whether a given operation is Transformation or Action ?
361
Explain cassandra data model?
What is CTE Table in Hive?
What is flume interceptor?
How will you submit extra files or data ( like jars, static files, etc. ) For a mapreduce job during runtime?
What is the use of coordinator node in read?
What is the use of get() method?
What is zookeper?
What is the basic difference between traditional RDBMS and Hadoop?
How to drop database in apache tajo?
What is apache presto?
Name the types of tunable consistency?
It can be possible that a Job has 0 reducers?
Do streamers make money from sparks?
How to compress mapper output in Hadoop?
What are the different collection type in Hive?