Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Mention what is data cleansing?
How to control access to data in impala?
How is a keyspace created in cassandra? & What are the parameters used?
What is Hadoop streaming?
Is it possible to run Apache Spark on Apache Mesos?
Define primary key in Apache Cassandra?
What is scala and spark?
Query language is executed in Cassandra database. Clarify?
What can be optimum value for Reducer?
How to setup the local repository manually?
What is spark checkpointing?
Differentiate between piglatin and hiveql?
What are the usage of different consistency levels for write operations ?
What are the different Primitive Data Types available in Hive?
What is the most widely used API Write Data to Cassandra ?