Real-Time Book Recommendation API (Full-Stack ML Engineering)

This project demonstrates a complete, end-to-end Machine Learning deployment pipeline, serving personalized book recommendations in real-time via a FastAPI web service.

Live Deployment

Service	Link
Live API Endpoint	https://goodreads-recommender-418836917221.us-west3.run.app/
Interactive Docs (Swagger UI)	https://goodreads-recommender-418836917221.us-west3.run.app//docs

Project Goals and ML Core

The core objective was to build a low-latency, resilient service. This system provides real-time personalized book recommendations based on user history.

Dataset: GoodReads 10k dataset, filtered to ensure high-quality, dense user-item interactions.
Model Selection: Singular Value Decomposition (SVD), via Surprise library for its balance of interpretability and performance in sparse matrices.
Validation Strategy: Utilized a 90/10 Train-Test split with an RMSE of 0.8329, demonstrating the model's ability to minimize prediction error across a dense subset of 10,000 users.
Cold Start Handling: New users were recommended the top-rated books before the recommendations could become more personalized.

Architecture Overview

The system follows a decoupled architecture where the model training (offline) and inference (online) are separated to ensure low-latency responses.

Engineering Challenges and Solutions (High-Value Debugging)

The primary value of this project lies in the successful resolution of critical production deployment obstacles:

1. Platform Architecture Mismatch (ARM64 $\rightarrow$ AMD64)

Problem: Building the Docker image on a Mac M-series (ARM64) without targeting the necessary architecture caused the image to fail on Cloud Run (which runs AMD64/Linux).
Solution: Used the docker buildx build --platform linux/amd64 command to explicitly compile a multi-architecture image, ensuring compatibility.

2. Dependency Conflict (NumPy 2.0 Incompatibility)

Problem: The core ML library (scikit-surprise) crashed during startup due to incompatibility with the newest version of NumPy installed in the Docker environment.
Solution: Explicitly pinned the NumPy dependency to a compatible version in requirements.txt: numpy<2.0.0.

3. Cloud Run Port & Memory Failures

Problem: The service consistently failed the health check (503 Service Unavailable) because of conflicts between the application's required port and the platform's environmental settings, compounded by insufficient memory for model loading.
Solution:
- Port Fix: Used the robust shell form in the Dockerfile to bind Gunicorn directly to the platform's injected variable: --bind 0.0.0.0:$PORT.
- Resource Fix: Increased the Cloud Run service memory allocation to 2 GiB and reduced Gunicorn workers to overcome the initial model loading memory spike.

4. Data Pipeline `(etl_job.py)`

Automated the extraction and transformation of the GoodReads 10k dataset, ensuring schema consistency before model training.

5. Observability `(generate_logs.py)`

Implemented a logging utility to track API performance and model inference latency, essential for monitoring production health.

Research & Business Applications

This API demonstrates how localized research models can be scaled into accessible tools. In a public health or social science context, this pipeline could be adapted to provide real-time resource recommendations or intervention matching for study participants.

The Full Stack and Technologies

Component	Technology	Role
ML/Data	Python, `scikit-surprise`, Pandas	Model training and artifact persistence.
API Server	FastAPI, Gunicorn, Uvicorn	High-performance ASGI web service.
Containerization	Docker, Buildx	Packaging the application for cross-platform portability.
Deployment	Google Cloud Run (GCR), Artifact Registry	Serverless hosting, auto-scaling, and secure image hosting.

How to Run Locally

Clone the repository:

git clone https://github.com/Anjamarie/Goodreads-recommender-api.git
cd goodreads-recommender-api

Build the image (Requires Docker Buildx):

docker build -t local-recommender:latest .

Run the container:

docker run -d -p 8000:8000 local-recommender:latest

Access the API documentation at: http://localhost:8000/docs

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
ML-core.ipynb		ML-core.ipynb
README.md		README.md
book_id_to_title.json		book_id_to_title.json
book_tags.csv		book_tags.csv
books.csv		books.csv
etl_job.py		etl_job.py
generate_logs.py		generate_logs.py
main.py		main.py
requirements.txt		requirements.txt
svd_goodreads_model.pkl		svd_goodreads_model.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real-Time Book Recommendation API (Full-Stack ML Engineering)

Live Deployment

Project Goals and ML Core

Architecture Overview

Engineering Challenges and Solutions (High-Value Debugging)

1. Platform Architecture Mismatch (ARM64 $\rightarrow$ AMD64)

2. Dependency Conflict (NumPy 2.0 Incompatibility)

3. Cloud Run Port & Memory Failures

4. Data Pipeline `(etl_job.py)`

5. Observability `(generate_logs.py)`

Research & Business Applications

The Full Stack and Technologies

How to Run Locally

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Real-Time Book Recommendation API (Full-Stack ML Engineering)

Live Deployment

Project Goals and ML Core

Architecture Overview

Engineering Challenges and Solutions (High-Value Debugging)

1. Platform Architecture Mismatch (ARM64 $\rightarrow$ AMD64)

2. Dependency Conflict (NumPy 2.0 Incompatibility)

3. Cloud Run Port & Memory Failures

4. Data Pipeline (etl_job.py)

5. Observability (generate_logs.py)

Research & Business Applications

The Full Stack and Technologies

How to Run Locally

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

4. Data Pipeline `(etl_job.py)`

5. Observability `(generate_logs.py)`

Packages