Anthropic Introduces Model Spec Midtraining for Better AI Alignment
Anthropic's new research on Model Spec Midtraining (MSM) teaches AIs to generalize desired behaviors from constitutional specs during training, improving alignment beyond example-based methods.