Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What do you understand by column family?
What is the procedure to create users in HDFS and how to allocate Quota to them?
What do you understand by standalone (or local) mode?
Explain what combiners are and when you should use a combiner in a mapreduce job?
Can the region server will be located on all datanodes?
What is wal and hlog in hbase?
What is the default database provided by Apache Hive for metastore?
how Cassandra delete Data?
What are the core benefits for hadoop users by using apache ambari?
Mention some important components of cassandra data models?
What is Sqoop Import? Explain its purpose?
Define Actions.
What is Mapper in Hadoop MapReduce?
What happens when the data set exceeds available memory?
On which port does ssh work?