What infrastructure do we need to process 100 TB data using Hadoop?
No Answer is Posted For this Question
Be the First to Post Answer
How does an hadoop application look like or their basic components?
What is partioner in hadoop? Where does it run,mapper or reducer?
What is the Use of SSH in Hadoop ?
What are the default configuration files that are used in hadoop?
Mention what is the use of Context Object?
What is InputSplit and RecordReader?
What are the different types of Znodes?
Data Engineer Given a list of followers in the format:123, 345234, 678345, 123…Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?
What is the difference between Apache Hadoop and RDBMS?
What is a task instance in hadoop? Where does it run?
What is a secondary namenode?
What are the modules that constitute the Apache Hadoop 2.0 framework?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)