Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain what happens if you alter the block size of a column family on an already occupied database?
Why does hive not store metadata information in hdfs?
Define data integrity?
What do you mean by logging in cassandra?
What is apache spark in big data?
What is flatmap in apache spark?
Differentiate between static and dynamic cql tables.
Why use hadoop?
Does spark use zookeeper?
How many ways we can create rdd in spark?
How does NameNode tackle DataNode failures?
What is Sqoop Import Mainframe Tool and its Purpose?
What was the design goal of Cassandra?
did you maintain the hadoop cluster in-house or used hadoop in the cloud?
How does reducebykey work in spark?