Pinned Loading
-
data-analysis-scheming
data-analysis-scheming PublicA preliminary exploration of an agentic evaluation of scheming on research data analysis
Python
-
in-context-scheming-repl
in-context-scheming-repl PublicReplication of the sandbagging portion of "Frontier Models are Capable of In-context Scheming"
Python
-
noise-injection-sandbagging-repl
noise-injection-sandbagging-repl PublicNarrow replication of "Noise Injection Reveals Hidden Capabilities of Sandbagging Language Models"
Python
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

