Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What port does spark use?
Can we use kafka without zookeeper?
What Avro offers?
What is tungsten engine in spark?
What is Shuffling and Sorting in a MapReduce?
why should we use 'filters' in pig scripts?
Can you change the block size of hdfs files?
Difference between hive and impala?
What is Thrift?
Clarify how ordering in hdfs is finished?
What is HDFS High Availability?
What is ColumnFamily?
How to identify that given operation is transformation/action in your program?
List the benefits of using Cassandra.
Explain why are replications critical in kafka?