WeSearch

Show HN: CuSBF – Faster GPU Bloom Filter for Sequence Data

·4 min read · 0 reactions · 0 comments · 13 views
#gpu#bioinformatics#software
Show HN: CuSBF – Faster GPU Bloom Filter for Sequence Data
⚡ TL;DR · AI summary

cuSBF is a high-performance GPU implementation of the Super Bloom filter designed for efficient processing of sequence data. It significantly improves the speed of k-mer insertion and querying, outperforming various existing methods. The tool is optimized for use on NVIDIA GPUs and supports a range of sequence types.

Key facts
Original article
GitHub
Read full at GitHub →
Opening excerpt (first ~120 words) tap to expand

cuSBF Overview cuSBF is a high-performance GPU implementation of the Super Bloom filter, optimized for high-throughput batch k-mer insertion and query on nucleotide (DNA) and protein sequences (or any other sequence type as long as a valid alphabet is provided). It exploits the streaming nature of sequence-derived k-mers by using minimizers to group consecutive k-mers sharing the same minimiser into super-k-mers, assigning all k-mers of a super-k-mer to the same 256-bit memory shard. This amortizes random memory accesses across consecutive k-mer queries, reducing memory-bandwidth pressure. The findere scheme further reduces false positives dramatically by inserting overlapping s-mers and requiring a full run of consecutive s-mer matches.

Excerpt limited to ~120 words for fair-use compliance. The full article is at GitHub.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from GitHub