What are input format, input split & record reader and what they do?
No Answer is Posted For this Question
Be the First to Post Answer
After increasing the replication level, I still see that data is under replicated. What could be wrong?
Define a sequence file in hadoop?
List Hadoop’s three configuration files?
What happens if one hadoop client renames a file or a directory containing this file while another client is still writing into it?
Define data cleansing?
What is the NameNode port number?
Is it possible to have hadoop job output in multiple directories? If yes, how?
What are the common types of NOSQL data bases ?
Explain the common input formats in hadoop?
What is the non dfs used?
How job tracker schedules an assignment?
Give me examples of unstructured data?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)