top
new
show
ask
jobs
about
METR: Measuring AI Ability to Complete Long Tasks
lesswrong.com
2 points by
surprisetalk
9 days ago
toggle theme