What Mapper does?
No Answer is Posted For this Question
Be the First to Post Answer
What is cloudera and why it is used?
Data Engineer Given a list of followers in the format:123, 345234, 678345, 123…Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?
What is HDFS ? How it is different from traditional file systems?
How to enable recycle bin or trash in hadoop?
Why do we need a password-less ssh in fully distributed environment?
Explain the features of stand alone (local) mode?
What are channel selectors?
What is compute and Storage nodes?
In cloudera there is already a cluster, but if I want to form a cluster on ubuntu can we do it?
What other technologies have you used in hadoop sta ck?
What is MapFile?
Did you ever built a production process in hadoop ? If yes then what was the process when your hadoop job fails due to any reason?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)