Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Why do we use HDFS for applications having large data sets and not when there are lot of small files?
1 2841
Explain textFile Vs wholeTextFile in Spark?
What is BloomMapFile?
Explain the lookup() operation in Spark?
Why do we need hadoop for big data analytics?
What is a Task instance in Hadoop? Where does it run?1
what is SStable consist of?
What is apache spark written in?
Explain Zero Consistency?
What are the different file permissions in the HDFS for files or directory levels?
hbase support syntax structure like sql. Yes or no?
Is it possible to leverage real time analysis on the big data collected by flume directly? If yes, then explain how?
What is spark application?
How to create a user in Hadoop?
How to use combiner in hadoop ?
How many Reducers should be configured?