generated from openproblems-bio/task_template
-
Notifications
You must be signed in to change notification settings - Fork 16
Open
Labels
bugSomething isn't workingSomething isn't working
Description
Describe the bug
The current model loading logic fails because the repository structure and model versioning on Hugging Face for Geneformer have been updated. Specifically:
- The model identifier
gf-12L-95M-i4096is no longer available or has been replaced. - The
hf_hub_downloadcalls fail when using thesubfolderparameter, as the latest model weights and configs appear to be located in the root directory or organized differently in thectheodoris/Geneformerrepository.
To Reproduce
Steps to reproduce the behavior:
- Attempt to initialize the model using the identifier
gf-12L-95M-i4096. - Run the script containing the
hf_hub_downloadlogic with thesubfolderparameter enabled. - See error:
404 Client Error: Not Found for urlorEntry Not Found.
Expected behavior
- The model identifier should be updated to
gf-12L-104M-i4096to align with the current Hugging Face repository state. - The file downloading logic should successfully fetch
model.safetensorsandconfig.jsonwithout being blocked by incorrect subfolder paths.
Proposed Fix
Update the model version and comment out/remove the subfolder argument in the model_files dictionary:
model_files = {
"model": hf_hub_download(
repo_id="ctheodoris/Geneformer",
# subfolder=par["model"], # Remove or comment out to fix pathing
filename="model.safetensors",
),
"config": hf_hub_download(
repo_id="ctheodoris/Geneformer",
# subfolder=par["model"], # Remove or comment out to fix pathing
filename="config.json",
),
}Additional context
It appears the ctheodoris/Geneformer repository has consolidated its files or updated the default weights to the 104M parameter version. Using the legacy 95M identifier or forcing a subfolder path that no longer exists breaks the initialization pipeline for new users.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working