Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Define streaming?
Why Apache Spark?
What is spark configuration?
What is a MapFile?
Explain write ahead log(journaling) in spark?
What is kafka?
How we can take Hadoop out of Safe Mode?
Can you explain apache kafka?
What are the two ways to create rdd in spark?
Which java class handles the output record encoding into files which result from Hive queries?
How does NameNode tackle DataNode failures?
What is Schema on Read and Schema on Write?
What is throughput in HDFS?
what should be the ideal replication factor in hadoop?
What are the basic steps to writing a UDF Function in Pig?