Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What do you understand by Data Replication in Cassandra?
What are the Optimizations a developer can use during joins?
How do you organize the pig latin statements?
What is difference the between sqoop and distcp?
Explain hdfs?
What is ColumnFamily?
What is the difference between python and spark?
Explain about the different types of transformations on DStreams?
What is MapFile?
What is the non dfs used?
How the read operation is performed on Cassandra node ?
How is hdfs block size different from traditional file system block size?
Explain the Reducer's Sort phase?
How to Rename a table in Hive
Why does my select statement fail?