OpenAI introduces CriticGPT to improve AI-generated code quality, shows 63% improvement in error detection

Provident Research show performer testing

28.06.2024 - 17:57

Reading now: 715

livemint.com:

AI-generated code. OpenAI revealed that in testing, individuals who used CriticGPT to review code generated by ChatGPT performed better 60 per cent of the time compared to those who did not use the model. The research findings have been documented in a recently published paper.The RLHF framework involves human evaluators, known as AI trainers, providing feedback on the AI's performance to help adjust and enhance the model's behavior.

For CriticGPT, trainers added intentional errors to code samples containing natural mistakes and then provided example feedback for these errors. The model's performance was evaluated based on its ability to identify both the naturally occurring and the intentionally inserted errors.According to OpenAI, CriticGPT demonstrated a 63 per cent improvement over ChatGPT in catching code errors. However, the model has certain limitations.

It has primarily been trained on short code snippets and has not yet been tested on longer, more complex coding tasks. Additionally, the model still experiences issues with generating incorrect factual responses, a phenomenon known as hallucination. It has also not been tested in scenarios where multiple errors are dispersed throughout the code.OpenAI suggests that CriticGPT is primarily intended to enhance the company's understanding of training techniques to produce higher quality AI outputs.

Read more on livemint.com

All news from livemint.com

About this in other media

OpenAI working on new reasoning technology under code name 'Strawberry' economictimes.indiatimes.com /6 months ago

ChatGPT maker OpenAI developing new breakthrough reasoning technology code-named ‘Strawberry’. Why is it important? livemint.com /6 months ago

Kuwait introduces new visa rules letting govt workers transfer to private sector jobs economictimes.indiatimes.com /6 months ago

The website fvbb.com is an aggregator of news from open sources. The source is indicated at the beginning and at the end of the announcement. You can send a complaint on the news if you find it unreliable.