Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How to set the number of reducers?
What is spark database?
What are the different types of data model?
What is sink processors?
Can you explain data versioning?
What is HDFS Federation?
What port does spark use?
What is the default extension of the files produced from a sqoop import using the –compress parameter?
List the network requirements for using Hadoop ?
What are the differences between PIG and SQL?
How blocks are distributed among all data nodes for a particular chunk of data?
What is decorating filters?
Discuss the various running mode of Apache Spark?
How data or file is written into HDFS?
What are the primitive data types in Pig?