What type of data we should put in distributed cache? When to put the data in dc? How much volume we should put in?
No Answer is Posted For this Question
Be the First to Post Answer
Explain Hadoop streaming?
What are the side effects of not running a secondary name node?
Which one is default?
Explain what is hadoop?
Is it possible to provide multiple input to Hadoop? If yes then how can you give multiple directories as input to the Hadoop job?
What happen if number of reducer is set to 0 in Hadoop?
What are the side data distribution techniques?
How is the splitting of file invoked in Hadoop framework?
Why is checkpointing important in hadoop?
Why Hadoop performs replication, although it results in data redundancy?
Is nosql follow relational db model?
What does the mapred.job.tracker command do?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)