Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What are the common hadoop PIG interview questions, that you have been asked in a Hadoop Job Interview?
570You have a file personal_data.txt in the HDFS directory with 100 records. You want to see only the first 5 records from the employee.txt file. How will you do this?
795
Mention what are the values stored in the Cassandra Column?
What is mllib?
I have a row or key cache hit rate of 0.XX123456789 reported by JMX. Is that XX% or 0.XX% ?
Does Apache Spark provide checkpoints?
Ideally what should be the block size in hadoop?
What are the main classes of Data Transfer API?
Can you give a detailed overview about the Big Data being generated by Facebook?
If map reduce is inferior to spark then is there any benefit of learning it?
Define the purpose of the partition function in mapreduce framework
List the languages supported by Apache Spark?
What is SparkSession in Apache Spark?
List the advantage of Parquet file in Apache Spark?
Describe REVERSE function in Hive with example?
Where sorting is done on mapper node or reducer node in MapReduce?
What do sorting and shuffling do?