Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Which Sorting algorithm is used in Hadoop MapReduce?
Explain why are replications critical in kafka?
Is apache spark a programming language?
Explain the maximum size of a message that can be received by the Kafka?
what factors the block size takes before creation?
In which scenario Hive is good fit?
Are sparks dangerous?
What is cqlsh? And why is it used?
What is impala?
Which port does SSH work on?
Define parquet file format? How to convert data to parquet format?
What do you mean by metadata in HDFS?
How to open a connection in hbase?
What are the great features of spark sql?
What is the data storage component used by Hadoop?