Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Define NoSQL Database?
What is nagios is used in ambari?
While loading data into a hive table using the load data clause, how do you specify it is a hdfs file and not a local file ?
how will you implement SQL in Spark?
List out the various advantages of dataframe over rdd in apache spark?
What is the difference between reducebykey and groupbykey?
What is the use of combiners in the hadoop framework?
What are the different elements of row in cassandra?
Why Do We Need Apache Pig?
When to use –target-dir and when to use –warehouse-dir while importing data?
When you point a partition of a hive table to a new directory, what happens to the data?
What is column families? What happens if you alter the block size of ColumnFamily on an already populated database?
How Hadoop’s CLASSPATH plays a vital role in starting or stopping in Hadoop daemons?
Why is block size set to 128 MB in Hadoop HDFS?
Explain about the core components of Flume?