Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What type of data hadoop can handle ?
What are the disadvantages of using Apache Spark over Hadoop MapReduce?
Explain future growth of Apache Ambari?
Hbase blocksize is configured on which level?
Mention what happens if the preferred replica is not in the ISR?
Does HDFS allow a client to read a file which is already opened for writing?
What are the disservices of utilizing Apache Spark over Hadoop MapReduce?
What is a Speculative Execution in Hadoop MapReduce?
What is the difference between dataset and dataframe in spark?
If no custom partitioner is defined in Hadoop then how is data partitioned before it is sent to the reducer?
What is a bloom filter?
What are the parameters used to create keyspace in cassandra?
Explain about the partitioning, shuffle and sort phase
How many maximum jvm can run on a slave node?
Types of Data Flow in Flume?