An OpenAI safety research lead departed for Anthropic

One of the most controversial issues in the AI industry over the past year was what to do when a user displays signs of mental health struggles in a chatbot conversation. OpenAI’s head of that type of safety research, Andrea Vallone, has now joined Anthropic.

”Over the past year, I led OpenAI’s research on a question with almost no established precedents: how should models respond when confronted with signs of emotional over-reliance or early indications of mental health distress?” Vallone wrote in a LinkedIn post a couple of months ago.

Vallone, who spent three years at OpenAI and built out the “model policy” research team there, worked on how to best deploy GPT-4, OpenAI’s reasoning models, and GPT-5, as well as developing training processes for some of the AI industry’s most popular safety techniques, such as rule-based rewards. Now, she’s joined the alignment team at Anthropic, a group tasked with understanding AI models’ biggest risks and how to address them.

Vallone will be working under Jan Leike, the OpenAI safety research lead who departed the company in May 2024 due to concerns that OpenAI’s “safety culture and processes have taken a backseat to shiny products.”

Leading AI startups have increasingly incited controversy over the past year over users’ struggles with mental health, which can spiral deeper after confiding in AI chatbots, especially since safety guardrails tend to break down in longer conversations. Some teens have died by suicide, or adults have committed murder, after confiding in the tools. Several families have filed wrongful death suits, and there has been at least one Senate subcommittee hearing on the matter. Safety researchers have been tasked with addressing the problem.

Sam Bowman, a leader on the alignment team, wrote in a LinkedIn post that he was “proud of how seriously Anthropic is taking the problem of figuring out how an AI system should behave.”

In a LinkedIn post on Thursday, Vallone wrote that she’s “eager to continue my research at Anthropic, focusing on alignment and fine-tuning to shape Claude’s behavior in novel contexts.”

Source link

Does Ticketmaster have a stranglehold on concert ticketing — or is it just ‘bringing joy’?

Anker’s last-gen sleep buds are nearly 40 percent off ahead of daylight saving time

Android’s Find Hub adds iPhone-like luggage tracking links

What Trump’s war on Iran means for the US energy crunch

Google’s latest Pixel drop allows Gemini to order groceries for you and more

An OpenAI safety research lead departed for Anthropic

You may be interested

Why polls are staying open an hour longer for the Democratic primary in Dallas County, Texas

Bill Gates among 7 asked to testify before House committee on possible Epstein ties

NFL news: Jets’ Breece Hall has cryptic tweet after being franchise tagged

Latest News

Why polls are staying open an hour longer for the Democratic primary in Dallas County, Texas

Bill Gates among 7 asked to testify before House committee on possible Epstein ties

3/3: CBS Evening News – CBS News

Jamie Ager projected to win Democratic primary in North Carolina’s 11th Congressional District

Whatley, Cooper win North Carolina primaries, CBS News projects, teeing up key Senate contest

Father of alleged school shooter found guilty of murder

Polls start closing in today’s primaries in Texas, North Carolina and Arkansas

Texas voters break early voting primary record

Pentagon releases names of first U.S. service members killed in Iran war

Prince Reza Pahlavi’s extended 60 Minutes interview

World

Pain at release of terrorists under Israel-Hamas hostage deal

U.S. and Israel increasingly isolated amid calls for a cease-fire

Israel and Hamas trade blame as return of bodies threatens Gaza ceasefire

Tourists to Japan spooked after comic book predicts doomsday

Bridge collapse in India leaves 2 people dead, multiple injured

Business

Unemployment benefits expire for over 7.5 million Americans

Luis Miranda, Lin-Manuel Miranda’s dad, to write “Relentless” book, memoir

Palestinian prisoners released as part of ceasefire deal

Charlie Kirk shot and killed at campus event

Last-Minute Tricks to Save on Halloween Treats!

Sport

Taylor Swift and Caitlin Clark watch Chiefs game after album drop

College football news: Nick Saban claims NIL has ‘hurt’ the SEC

White Sox fan Pope Leo XIV jabs Cubs supporter during Vatican appearance

UNC coach Bill Belichick, girlfriend Jordon Hudson’s hot mic moments leaked

Christian McCaffrey’s touchdown vs Falcons should have been penalty, ex-ref says

Does Ticketmaster have a stranglehold on concert ticketing — or is it just ‘bringing joy’?

Anker’s last-gen sleep buds are nearly 40 percent off ahead of daylight saving time

Android’s Find Hub adds iPhone-like luggage tracking links

What Trump’s war on Iran means for the US energy crunch

Google’s latest Pixel drop allows Gemini to order groceries for you and more

An OpenAI safety research lead departed for Anthropic

You may be interested

Why polls are staying open an hour longer for the Democratic primary in Dallas County, Texas

Bill Gates among 7 asked to testify before House committee on possible Epstein ties

NFL news: Jets’ Breece Hall has cryptic tweet after being franchise tagged

Latest News

Hot Posts

Why polls are staying open an hour longer for the Democratic primary in Dallas County, Texas

Bill Gates among 7 asked to testify before House committee on possible Epstein ties

NFL news: Jets’ Breece Hall has cryptic tweet after being franchise tagged

I read every day — everyone needs to read these books in March | Books | Entertainment

Courtney Love Teases Hole Tour With Melissa Auf der Maur

Why polls are staying open an hour longer for the Democratic primary in Dallas County, Texas

Bill Gates among 7 asked to testify before House committee on possible Epstein ties

3/3: CBS Evening News – CBS News

Jamie Ager projected to win Democratic primary in North Carolina’s 11th Congressional District

Whatley, Cooper win North Carolina primaries, CBS News projects, teeing up key Senate contest

Father of alleged school shooter found guilty of murder

Polls start closing in today’s primaries in Texas, North Carolina and Arkansas

Texas voters break early voting primary record

Pentagon releases names of first U.S. service members killed in Iran war

Prince Reza Pahlavi’s extended 60 Minutes interview

World

Pain at release of terrorists under Israel-Hamas hostage deal

U.S. and Israel increasingly isolated amid calls for a cease-fire

Israel and Hamas trade blame as return of bodies threatens Gaza ceasefire

Tourists to Japan spooked after comic book predicts doomsday

Bridge collapse in India leaves 2 people dead, multiple injured

Business

Unemployment benefits expire for over 7.5 million Americans

Luis Miranda, Lin-Manuel Miranda’s dad, to write “Relentless” book, memoir

Palestinian prisoners released as part of ceasefire deal

Charlie Kirk shot and killed at campus event

Last-Minute Tricks to Save on Halloween Treats!

Sport

Taylor Swift and Caitlin Clark watch Chiefs game after album drop

College football news: Nick Saban claims NIL has ‘hurt’ the SEC

White Sox fan Pope Leo XIV jabs Cubs supporter during Vatican appearance

UNC coach Bill Belichick, girlfriend Jordon Hudson’s hot mic moments leaked

Christian McCaffrey’s touchdown vs Falcons should have been penalty, ex-ref says