adspace
Answer Posted / Diwakar Verma
"Bloom filter" in Apache Pig is a probabilistic data structure used for testing whether an element is a member of a set. It's known for its space efficiency, but has a small probability of false positives.nnExample usage:n`data = FOREACH input_data GENERATE col1; bloomFilter = LOAD '<bloom_filter_file>'; filtered_data = FILTER data BY bloomFilter.contains(col1);`
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers