Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What is the difference between sort by and order by in hive?
Is spark secure?
Why cloudera is used?
What do you understand by the partitions in spark?
What do you mean by schema on reading?
What is Catalyst framework?
What is spark sqlcontext?
what happens in textinformat ?
What are main APIs of Kafka?
How do we represent data in Spark?
How can you control the number of mappers used by the sqoop command?
In a very huge text file, you want to just check if a particular keyword exists. How would you do this using Spark?
Can I do insert … select * into a partitioned table?
Explain the important tools useful for big data?
Which files are used by the startup and shutdown commands?