Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) what should be the ideal replication factor in hadoop?
What are the configuration files in Hadoop?
What is the problem with small files in Hadoop?
What is Cassandra-CQL collection?
What is SequenceFileInputFormat in Hadoop MapReduce?
What are the uses of explode hive?
What is a namenode in hadoop?
Can you define inputsplit in hadoop?
What is a Seed Node in Cassandra ?
Which command is used to run hbase shell?
explain the concept of RDD (Resilient Distributed Dataset). Also, state how you can create RDDs in Apache Spark.
What is an "RDD Lineage"?
What is spark configuration?
Can NameNode and DataNode be a commodity hardware?
when hadoop enter in safe mode?