Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How you can remove the element with a critical present in any other Rdd is Apache spark?
when do reducers play their role in a mapreduce task?
Is spark streaming real time?
What is the best practice to deploy the secondary name node?
Is spark used for machine learning?
What do you understand by High availability?
What is a bloom filter?
What is row rdd in spark?
How can you set an arbitrary number of mappers to be created for a job in Hadoop?
When can you use ALTER KEYSPACE?
What is zeromq?
What are combiners and its purpose?
What is sc textfile?
What is difference between dataset and dataframe?
Clarify what combiners are and when you should utilize a combiner in a map reduce job?