Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Can you define udf?
Mention what are the main configuration parameters that user need to specify to run mapreduce job?
Is hive similar to sql?
What is the difference between spark and scala?
What is spark dynamic allocation?
What are the roles of the file system in any framework?
In how many ways RDDs can be created? Explain.
Explain the terms Spark Partitions and Partitioners?
What are the types of hive ddl commands?
How can you send some messages in kafka?
Kafka has written in which languages?
What happens if one hadoop client renames a file or a directory containing this file while another client is still writing into it?
What is combiner aggregator?
What is inputsplit in hadoop? Explain.
What is a topic in kafka?