Remote Code Execution in Inspect AI (UK AISI)
Summary
The math() scorer in inspect_ai extracts content from LaTeX \boxed{...} in model output and passes it to SymPy's parse_expr. SymPy tokenizes input as Python and evaluates it — expressions like
adithyanak.com3 min read