Skip to content

Feature/string bucket transformation#70

Merged
koenvo merged 3 commits intomainfrom
feature/string-bucket-transformation
Apr 7, 2026
Merged

Feature/string bucket transformation#70
koenvo merged 3 commits intomainfrom
feature/string-bucket-transformation

Conversation

@koenvo
Copy link
Copy Markdown
Contributor

@koenvo koenvo commented Apr 7, 2026

No description provided.

koenvo added 3 commits April 7, 2026 10:26
Uses MD5 hash for stable distribution when the value cannot be cast to
int. Integer values continue to use direct modulo.
Prevents special characters, spaces, $, unicode etc. from causing
issues in GCS/S3 paths. Simple values like integers stay readable.
Always hashes the string representation via MD5 for consistent
bucketing — no special-casing for numeric-looking strings.
bucket_size path (used for integer ranges) is unchanged.
@koenvo koenvo force-pushed the feature/string-bucket-transformation branch from a347c43 to edd666c Compare April 7, 2026 09:38
@koenvo koenvo merged commit 185a138 into main Apr 7, 2026
13 checks passed
@koenvo koenvo deleted the feature/string-bucket-transformation branch April 7, 2026 09:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant