Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is SSTable?
State the difference between persist() and cache() functions.
What are the ways to create RDDs in Apache Spark? Explain.
Define sparkcontext in apache spark?
How to create and manage a view in HCatalog?
How to setup the local repository manually?
How to resolve ioexception: cannot create directory, while formatting namenode in hadoop?
What is a topic in kafka?
Can we have different replication factor of the existing files in hdfs?
What is scala and spark?
What is Rack Awareness in Apache Hadoop?
Why Do We Need Apache Pig?
Explain the features of pseudo mode?
Explain the default level of parallelism in Apache Spark
Define a commodity hardware? Does commodity hardware include ram?