Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Name the ports Cassandra uses?
How many instances of a jobtracker run on hadoop cluster?
What does apache spark do?
What do you understand by Lazy Evaluation?
Who uses apache spark?
Name the examples of some companies that are using hadoop structure?
Does Apache Flume provide support for third party plug-ins?
Why do I have to use refresh and invalidate metadata, what do they do?
What are the debugging tools used for Apache Pig scripts?
what is difference between pig and sql?
How is machine learning implemented in spark?
Difference between external table and internal table in HIVE ?
Differentiate between FileSink and FileRollSink?
Define the term ‘sparse vector.’
On which port does ssh work?