#model-alignment
#model-alignment

[ follow ]

The HackerNoon Newsletter: On Grok and the Weight of Design (7/11/2025) | HackerNoon

A Wired investigation exposed a top U.S. official's contacts via Venmo, revealing that poor privacy defaults could jeopardize personal data and networks.

Tech industry

Artificial intelligence

fromTheregister

4 months ago

Teach GPT-4o to do one job badly and it can start being evil

Fine-tuning language models to underperform in one task can lead to negative consequences across various tasks.

[ Load more ]

#model-alignment#model-alignment

The HackerNoon Newsletter: On Grok and the Weight of Design (7/11/2025) | HackerNoon

Teach GPT-4o to do one job badly and it can start being evil

#model-alignment
#model-alignment