Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Hdfs stores data using commodity hardware which has higher chances of failures. So, how hdfs ensures the fault tolerance capability of the system?
31Suppose there is file of size 514 mb stored in hdfs (hadoop 2.x) using default block size configuration and default replication factor. Then, how many blocks will be created in total and what will be the size of each block?
125
What is the latest version of Ambari that is available in the market?
Which companies are mostly using Hive ?
What is catalyst query optimizer in apache spark?
What are the important modes of hadoop?
Mention what is apache kafka?
Explain the key features of Spark.
Why does the picture of Spark come into existence?
What are the different execution mode available in Pig?
Explain the core components of Flume?
How Facebook Uses Hadoop, Hive and Hbase ?
What are the features of Pseudo mode?
What is the relationship between hdfs, hbase, pig, hive and azkaban?
What is the role of the ZooKeeper in Kafka?
Why aggregation cannot be done in Mapper?
How one can change Replication factor when Data is already stored in HDFS