Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Hdfs stores data using commodity hardware which has higher chances of failures. So, how hdfs ensures the fault tolerance capability of the system?
38Suppose there is file of size 514 mb stored in hdfs (hadoop 2.x) using default block size configuration and default replication factor. Then, how many blocks will be created in total and what will be the size of each block?
129
Define the management tools in Cassandra?
Differentiate HDFS & HBase?
What is column families? What happens if you alter the block size of ColumnFamily on an already populated database?
What is spark tool in big data?
Explain combiners.
What is the difference between rdd and dataframe in spark?
What port does spark use?
Define Writable data types in MapReduce?
How Sqoop can be used in a Java program?
What other technologies have you used in hadoop sta ck?
What is the purpose of JConsole?
What is Immutable?
What are the two main components of ResourceManager?
What do you mean by the High Availability of a NameNode in Hadoop HDFS?
what are the basic parameters of a Mapper?