Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain what is a Hive variable. What do we use it for?
Explain why to use hbase?
Mention when you can use alter keyspace?
Should the region server be located on all DataNodes?
What is DataFrames?
what is next step after mapper or maptask?
What is Cassandra-CQL collection?
What is difference between hive and spark?
When to use coalesce and repartition in spark?
Explain the process of spilling in Hadoop MapReduce?
Do we need to install spark in all nodes?
What are the different components of a Hive architecture?
What is presto verifier?
What is spark vcores?
Explain about the basic parameters of mapper and reducer function