welcome
Wired

Wired

Technology

Technology

This Tool Probes Frontier AI Models for Lapses in Intelligence

Wired
Summary
Nutrition label

88% Informative

Scale AI has developed a platform that can automatically test a model across thousands of benchmarks and tasks.

The new tool, called Scale Evaluation , automates some of this work using Scale ’s own machine learning algorithms.

Scale rose to prominence providing human labor for training and testing advanced AI models.

The company's new tool may also inform efforts to standardize testing AI models for misbehavior.

VR Score

91

Informative language

92

Neutral language

66

Article tone

semi-formal

Language

English

Language complexity

54

Offensive language

not offensive

Hate speech

not hateful

Attention-grabbing headline

not detected

Known propaganda techniques

not detected

Time-value

long-living

Affiliate links

no affiliate links