Popular repositories Loading
-
diffing-toolkit
diffing-toolkit PublicForked from science-of-finetuning/diffing-toolkit
A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.
-
aisilab.github.io
aisilab.github.io PublicWebsite of the AI Safety & Interpretability Lab at SDU
HTML 1
-
Superadditive-cooperation-LLMs
Superadditive-cooperation-LLMs PublicForked from pippot/Superadditive-cooperation-LLMs
Study on super additive cooperation between Large Language Model agents in an Iterated Prisoner's Dilemma tournament
Python
-
Prolog-as-a-Tool
Prolog-as-a-Tool PublicForked from niklasmellgren/grpo-prolog-inference
Reinforcement fine-tuning LLMs with GRPO to generate Prolog code for symbolic reasoning and inference
Jupyter Notebook
Repositories
- diffing-toolkit Public Forked from science-of-finetuning/diffing-toolkit
A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.
aisilab/diffing-toolkit’s past year of commit activity - Prolog-as-a-Tool Public Forked from niklasmellgren/grpo-prolog-inference
Reinforcement fine-tuning LLMs with GRPO to generate Prolog code for symbolic reasoning and inference
aisilab/Prolog-as-a-Tool’s past year of commit activity - Superadditive-cooperation-LLMs Public Forked from pippot/Superadditive-cooperation-LLMs
Study on super additive cooperation between Large Language Model agents in an Iterated Prisoner's Dilemma tournament
aisilab/Superadditive-cooperation-LLMs’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…