Microsoft researchers have developed On-Policy Context Distillation (OPCD), a training method that permanently embeds ...
Toyota says it'll have hundreds of tasks under control by the end of the year, and it's targeting over 1,000 tasks by the end of 2024. As such, it's developing what it believes will be the first Large ...
Anthropic identifies AI persona drift and ties it to an “assistant axis”; tests across 275 roleplay characters, raising safety limits.
Meet your AI auditor: How this new job role monitors model behavior ...
Anthropic has seen its fair share of AI models behaving strangely. However, a recent paper details an instance where an AI model turned “evil” during an ordinary training setup. A situation with a ...