Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Why is Spark RDD immutable?
Is hadoop required for spark?
Explain Data Type Conversion in Hive?
Explain Spark countByKey() operation?
How is 0xdata's h2o different from apache mahout ?
What is JobTracker?
Why do we use Hadoop?
Characterize data integrity? How does hdfs ensure information integrity of data blocks squares kept in hdfs?
Clarify about the smb join in hive?
Explain api create or replace tempview()?
What is used to store data generally?
Explain how do ‘map’ and ‘reduce’ works?
What are the Data extraction tools in Hadoop?
what Hive is composed of ?
Can free-form SQL queries be used with Sqoop import command? If yes, then how can they be used?