Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is spark executor cores?
Explain some Kafka Streams real-time Use Cases?
What does hdfs mean?
Explain the LOAD keyword in Pig script?
What are the common mistakes developers make when running Spark applications?
Define hadoop archives? What is the command for archiving a group of files in hdfs.
How hbase handles the write failure?
What happens if the block on Hadoop HDFS is corrupted?
Clarify what jobtracker is in hadoop? What are the activities followed by hadoop?
What happens if number of reducers are 0?
What are the different CQL data manipulation commands in Cassandra?
Explain what happens if, during the PUT operation, HDFS block is assigned a replication factor 1 instead of the default value 3?
Why to use hbase?
What is Block in HDFS?
Explain what are the basic parameters of a mapper?