Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the default replication factor?
What is Hive ?
How to change the column data type in hive? Explain rlike in hive.
What is an identity mapper and identity reducer?
How to format the HDFS? How frequently it will be done?
What are impala built-in functions?
What is jmx connector?
How does speculative execution work in Hadoop?
List some commonly used Machine Learning Algorithm Apache Spark?
What are the advantages and Disadvantages in archieving partition in Hive?
What is the difference between DAG and Lineage?
Explain why the name ‘hadoop’?
What is a rack awareness algorithm?
What is data ingestion pipeline?
Is spark a mapreduce?