Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the default replication factor in Hadoop and how will you change it?
Explain Erasure Coding in Apache Hadoop?
What are the common types of NOSQL data bases ?
What is throughput? How does hdfs provides good throughput?
What is the difference between dataset and dataframe in spark?
Do we need hadoop for spark?
Mention how many operational commands in hbase?
How can you manually partition the rdd?
What is Spark.executor.memory in a Spark Application?
Mention how can you stop a partition form being queried?
By Default, how many partitions are created in RDD in Apache Spark?
How to Alter Hive Database?
what is WebDAV in Hadoop?
Can you explain spark rdd?
What do you know about the case sensitivity of apache pig?