Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is python spark?
What are the various libraries available on top of Apache Spark?
What are the config properties of presto?
Which port does SSH work on?
What are the Difference between MongoDB and Cassandra?
Does Partitioner run in its own JVM or shares with another process?
How can data transfer be minimized when working with Apache Spark?
Differentiate between the terms: node, a cluster, and data center in cassandra?
What is the use of spark in big data?
What is identity mapper and chain mapper?
Explain write ahead log(journaling) in spark?
Mention what is the best way to copy files between hdfs clusters?
Kafka can be used for which kind of applications?
what is the meaning of broker in Kafka?
Can you define inputsplit in hadoop?