Skip to content

fix: do not create default index bloom filter during digest#3

Open
utkonos wants to merge 2 commits into
eciavatta:mainfrom
malwarology:issue1
Open

fix: do not create default index bloom filter during digest#3
utkonos wants to merge 2 commits into
eciavatta:mainfrom
malwarology:issue1

Conversation

@utkonos
Copy link
Copy Markdown

@utkonos utkonos commented Mar 27, 2026

The C++ reference only runs the index deduplication check when an explicit index is provided (info != NULL). The Go port unconditionally created a 64 MB bloom filter as a default index, causing features to be silently dropped as duplicates that the C++ reference would keep. This produces fewer bloom filters and fewer features per file, causing hash mismatches across all file sizes.

Only create the index when explicitly provided by the caller.

Fixes #1

utkonos added 2 commits March 26, 2026 21:01
The C++ reference only runs the index deduplication check when an
explicit index is provided (info != NULL). The Go port unconditionally
created a 64 MB bloom filter as a default index, causing features to
be silently dropped as duplicates that the C++ reference would keep.
This produces fewer bloom filters and fewer features per file, causing
hash mismatches across all file sizes.

Only create the index when explicitly provided by the caller.

Fixes eciavatta#1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Hash Mismatch Between Reference Implementation and Go Implementation

1 participant