Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How do you define "block" in HDFS?
What are the different Eval functions available in Pig?
Mention how hadoop is different from other data processing tools?
Explain how do ‘map’ and ‘reduce’ works?
what is Bloom Filter is used for in Cassandra?
Explain Spark Streaming with Socket?
Is client the end user in HDFS?
What is cassandra used for?
Write a query to insert a new column(new_col int) into a hiev table (htab) at a position before an existing column (x_col)
What do you mean by Schema Resolution?
If I create a folder in HDFS, will there be metadata created corresponding to the folder? If yes, what will be the size of metadata created for a directory?
What is jmx? And how is it useful in cassandra?
What are the most commonly defined input formats in Hadoop?
Does spark need yarn?
What do you mean by Schema Declaration?