Why do we use HDFS for applications having large data sets and not when there are lot of small files?
Can Apache Kafka be used without Zookeeper?
What other technologies have you used in hadoop sta ck?
What is formatting of the dfs?
What is Schema on Read and Schema on Write?
Can you tell us more about ssh?
How Mapper is instantiated in a running job?
What is Derby database?
What is the difference between rdbms and hadoop?
Is map like a pointer?
What is InputSplit and RecordReader?
How to change Replication Factor For below cases ?
what factors the block size takes before creation?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)