Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
When can you use ALTER KEYSPACE?
What is Spark MLlib?
What is scala and spark?
How to change a column data type in Hive?
How to access HDFS?
Explain Spark join() operation?
Do I need to install hadoop for spark?
What are the main components of Cassandra data models?
Explain HCatInputFormat?
Explain how HDFS communicates with Linux native file system?
Describe the distnct(),union(),intersection() and substract() transformation in Apache Spark RDD?
What do you mean by replication factor?
What is Pig Statistics? What are all stats classes in the Java API package available?
Difference between order by and sort by in Hive?
What are the tools that are needed to build ambari?