Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How analysis of Big Data is useful for organizations?
Use of version command in hadoop sqoop?
Why would nosql be better than using a sql database? And how much better is it?
What is azure spark?
What are the different ways you can use to secure a cluster using Ambari?
What do you understand by standalone (or local) mode?
How to submit extra files(jars,static files) for MapReduce job during runtime in Hadoop?
How can you native libraries be included in yarn jobs?
Define Compaction?
Can you explain about the indexing process in hdfs?
explain the key features of Apache Spark?
Is the hdfs block size reduced to achieve faster query results?
What is the significance of ‘IF EXISTS” clause while dropping a table?
How to debug Hadoop code?
What is the difference between Internal Table and External Table in Hive?