Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain about transformations and actions in the context of RDDs.
Mention what is ObjectInspector functionality in Hive?
Is it possible to use Apache Spark for accessing and analyzing data stored in Cassandra databases?
What is the difference between Internal Table and External Table in Hive?
What are the libraries of spark sql?
Define tasktracker.
How can apache spark be used alongside hadoop?
what is (HS2) HiveServer2?
Mention some important features of spm in cassandra?
What do you understand by compaction?
Explain ingestion in big data?
Explain sum(), max(), min() operation in Apache Spark?
How would you pipeline large amounts of data?
RLIKE in Hive?
Explain the difference between an hdfs block and input split?