OpenAI Reinforcement learning

News

Former DeepSeeker and collaborators release new method for training reliable AI agents: RAGEN

RAGEN stands out not just as a technical contribution but as a conceptual step toward more autonomous, reasoning-capable AI agents.

22h

12 Proven Tips on How To Make Money With ChatGPT

Are you curious about how to earn some extra cash using ChatGPT? ChatGPT isn't just a run-of-the-mill artificial intelligence ...

TechRound1d

Saying “Please” And “Thank You” To ChatGPT Is Actually Costing Millions

ChatGPT welcomes kind words, but those “pleases” and “thank yous” are not free. X user, @tomieinlove had tweeted, “I ...

AI News This Week from Google, OpenAI, Meta and Anthropic

Uncover the week’s top AI developments, from Google’s AGI push to Anthropic’s Claude updates, and their implications for the ...

Google just fired the first shot of the next battle in the AI war

A new research paper proposes that AI models and agents go out into the world and generate their own data. You can read it as ...

OpenAI's most capable models hallucinate more than earlier ones

OpenAI says its latest models, o3 and o4-mini, are its most powerful yet. However, research shows the models also hallucinate more -- at least twice as much as earlier models.

MediaNama2d

New OpenAI Models Hallucinating More Than Their Predecessor

OpenAI's new AI models are hallucinating more than their predecessor, as per an internal testing report released by the ...

Annoyed ChatGPT users complain about bot’s relentlessly positive tone

This creates a feedback loop where AI language models learn that enthusiasm and flattery lead to higher ratings from humans, even when those responses sacrifice factual accuracy or helpfulness. The ...

OpenAI's newest o3 and o4-mini models excel at coding and math – but hallucinate more often

Historically, each new generation of OpenAI's models has delivered incremental improvements in factual accuracy, with ...

Unite.AI2d

Inside OpenAI’s o3 and o4‑mini: Unlocking New Possibilities Through Multimodal Reasoning and Integrated Toolsets

OpenAI released upgraded versions of its advanced reasoning models. These new models, named o3 and o4-mini, offer ...

InsideHook on MSN2d

Do OpenAI's New Models Have a Hallucination Problem?

OpenAI announced the release of a pair of models, o3 and o4-mini. In announcing them, the company referred to them as “the ...

OpenAI’s New AI Models Face Troubling Increase in Hallucinations

According to OpenAI’s internal testing, the new o3 model hallucinated in 33% of cases on the company’s PersonQA benchmark.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results