adspace


Explain bloom?

Answer Posted / Diwakar Verma

"Bloom filter" in Apache Pig is a probabilistic data structure used for testing whether an element is a member of a set. It's known for its space efficiency, but has a small probability of false positives.nnExample usage:n`data = FOREACH input_data GENERATE col1; bloomFilter = LOAD '<bloom_filter_file>'; filtered_data = FILTER data BY bloomFilter.contains(col1);`

Is This Answer Correct ?    0 Yes 0 No



Post New Answer       View All Answers


Please Help Members By Posting Answers For Below Questions

What is the function of UNION and SPLIT operators? Give examples?

620