Building behavioral auditing and alignment tools for LLMs. Try the demo →
rho-eval — Drop-in behavioral audit for any LLM. Measures 8 dimensions, no internet required. Apple Silicon MLX + CUDA + CPU.
pip install rho-eval
# Audit any model
rho-eval Qwen/Qwen2.5-7B-Instruct --behaviors all
# One-command behavioral repair
rho-surgery Qwen/Qwen2.5-7B-Instruct -o ./repaired-7b/| Repo | What it does | Paper |
|---|---|---|
| knowledge-fidelity | Behavioral auditing + alignment toolkit. PyPI. | |
| confidence-cartography | Teacher-forced confidence as a false-belief sensor. | |
| intelligent-svd | Knowledge-preserving SVD compression for LLMs. |