What type of data we should put in distributed cache? When to put the data in dc? How much volume we should put in?
No Answer is Posted For this Question
Be the First to Post Answer
Is it possible to provide multiple input to Hadoop? If yes then how?
Whats the default port that jobtrackers listens ?
Can you define inputsplit in hadoop?
How Big is ‘Big Data’?
What is Erasure Coding in Hadoop?
Which port does SSH work on?
What are the four basic parameters of a mapper?
Give me examples of unstructured data?
Did you ever ran into a lop sided job that resulted in out of memory error, if yes then how did you handled it ?
What is KeyValueTextInputFormat in Hadoop?
What are the tools used in big data?
How many daemon processes run on a hadoop cluster?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)