Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Is a job split into maps?
Describe REVERSE function in Hive with example?
Define Writable data types in MapReduce?
What is compute and Storage nodes?
Whether the output of mapper or output of partitioner written on local disk?
What are the tools you need to build Ambari?
What is the importance of .hiverc file?
How is Pig Useful For?
How to set the number of mappers for a MapReduce job?
What is tungsten in spark?
When you should use Hbase?
What hadoop does in safe mode?
What is a Sparse Vector?
What is spark in big data?
Why should we use ‘distinct’ keyword in Pig scripts?