Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is the difference between an input split and hdfs block?
What are the different composite keys in Cassandra?
Why spark is faster than hive?
Name various types of Cluster Managers in Spark.
What do you understand by an inner bag and outer bag in Pig?
What is a Task instance in Hadoop? Where does it run?1
Define the term ‘sparse vector.’
Enlist the several components in Kafka?
Name some internal daemons used in spark?
How is HCatalog different from Hive?
Can we broadcast an rdd?
Ideally what should be replication factor in a Hadoop cluster?
What is NoSQL database?
What is winutils hadoop?
What is spark database?