Fuzzy matching and more functionality for spaCy.
-
Updated
Jul 6, 2024 - Python
Fuzzy matching and more functionality for spaCy.
DuckDB Community Extension adding RapidFuzz algorithms for search, deduplication, and record linkage.
Fast Batch String Matching in Python (Levenshtein, Jaro-Winkler, Hamming) with Zero Cache Misses - made for Python, written in C++
Fast Scalable Dedupe - Fuzzy Matching With Opensearch + nmslib + Rapidfuzz
A simple and efficient spelling correction system that uses Python's rapidfuzz library to find and correct misspelled sentences by matching them with the closest correct ones from a given dataset.
Guts of FantasyNameSearch.com
✅ completed | Voices assistant for windows managing system applications
NovelNudge is a book recommendation engine that embeds titles, descriptions, authors, and genres using SentenceTransformers. It combines these vectors and ranks similar books with cosine similarity and fuzzy title matching.
The repository is a duplicate of the local folder which contains codes created by Yuanzhan Gao (yg8ch@virginia.edu) to conduct scaled fuzzy matching procedure on EIDL and PPP dataset. Please see the README file for more information.
Phoenix II Discord Bot
Performs OCR on a list of images using Tesseract and performs fuzzy string matching with a given list of strings.
DEMO: extract media tags with Spotify API to relational Docker backend
Cleaned and transformed Netflix dataset using Python (Pandas, RapidFuzz) for visual analysis in Power BI.
EIDOScopio: Una herramienta interactiva para explorar de forma masiva el estatus legal de la biodiversidad española a través de la API de EIDOS.
Automated business record matching using fuzzy algorithms (RapidFuzz) and browser automation (Playwright)
Add a description, image, and links to the rapidfuzz topic page so that developers can more easily learn about it.
To associate your repository with the rapidfuzz topic, visit your repo's landing page and select "manage topics."