Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) List out the commands that are used to start, check the progress and stop the ambari server?
Query language is executed in Cassandra database. Clarify?
How does yarn work with spark?
How Pig differs from MapReduce?
What is the use of explode in Hive?
Does mapreduce programming model provide a way for reducers to communicate with each other? In a mapreduce job can a reducer communicate with another reducer?
What is hotspotting in hbase?
What is the use of context object?
Is it possible to search for files using wildcards?
how Cassandra delete Data?
Why can we not create directory /user/dataflair/inpdata001 when name node is in safe mode?
What is Reducer in Hadoop?
Explain different transformations in DStream in Apache Spark Streaming?
Explain Data Type Conversion in Hive?
Compare rdbms with hbase?