Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Explain the level of parallelism in spark streaming?
Is spark and hadoop same?
What is the latest version of spark?
What is the next step after Mapper or MapTask?
What are channel selectors?
Why HDFS stores data using commodity hardware despite the higher chance of failures in hadoop?
Which command do we use to run HBase Shell?
Name some features of Apache Cassandra?
List various commonly used machine learning algorithm?
Explain various level of persistence in Apache Spark?
Can I do transforms or add new functionality?
How to stop a partition form being queried?
how can we access the sub directories recursively?
Why is spark fast?
Can you define udf?