Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is column families? What happens if you alter the block size of ColumnFamily on an already populated database?
Name three data source available in SparkSQL
How can we change the split size if our commodity hardware has less storage space?
What is spark dynamic allocation?
what is JobTracker in Hadoop? What are the actions followed by Hadoop?
What is partitioning key?
What is troubleshooting for impala?
how JobTracker schedules a task ?
Can you explain spark rdd?
How do you organize the pig latin statements?
What are the key elements in ZooKeeper Architecture?
If no custom partitioner is defined in Hadoop then how is data partitioned before it is sent to the reducer?
What are the purposes of using Ambari shell?
What counter in Hadoop MapReduce?
What is different table structure available in the hive?