Ptechhub
  • News
  • Industries
    • Enterprise IT
    • AI & ML
    • Cybersecurity
    • Finance
    • Telco
  • Brand Hub
    • Lifesight
  • Blogs
No Result
View All Result
  • News
  • Industries
    • Enterprise IT
    • AI & ML
    • Cybersecurity
    • Finance
    • Telco
  • Brand Hub
    • Lifesight
  • Blogs
No Result
View All Result
PtechHub
No Result
View All Result

OpenAI’s New GPT 4.1 Models Excel at Coding

By Wired by By Wired
April 14, 2025
Home AI & ML
Share on FacebookShare on Twitter


OpenAI announced today that it is releasing a new family of artificial intelligence models optimized to excel at coding, as it ramps up efforts to fend off increasingly stiff competition from companies like Google and Anthropic. The models are available to developers through OpenAI’s application programming interface (API).

OpenAI is releasing three sizes of models: GPT 4.1, GPT 4.1 Mini, and GPT 4.1 Nano. Kevin Weil, chief product officer at OpenAI, said on a livestream that the new models are better than OpenAI’s most widely used model, GPT-4o, and better than its largest and most powerful model, GPT-4.5, in some ways.

GPT-4.1 scored 55 percent on SWE-Bench, a widely used benchmark for gauging the prowess of coding models. The score is several percentage points above that of other OpenAI models. The new models are “great at coding, they’re great at complex instruction following, they’re fantastic for building agents,” Weil said.

The capacity for AI models to write and edit code has improved significantly in recent months, enabling more automated ways of prototyping software, and improving the abilities of so-called AI agents. In the past few months, rivals like Anthropic and Google have both introduced models that are especially good at writing code.

The arrival of GPT-4.1 has been widely rumored in recent weeks. OpenAI apparently tested the model on some popular leaderboards under the pseudonym Alpha Quasar, sources say. Some users of the “stealth” model reported impressive coding abilities. “Quasar fixed all the open issues I had with other code genarated [sic] via llms’s which was incomplete,” one person wrote on Reddit.

“Developers care a lot about coding and we’ve been improving our model’s ability to write functional code,” Michelle Pokrass, who works on post-training at OpenAI, said during the Monday livestream. “We’ve been working on making it follow different formats and better explore repos, run unit tests and write code that compiles.”

Over the past couple of years, OpenAI has parlayed feverish interest in ChatGPT, a remarkable chatbot first unveiled in late 2022, into a growing business selling access to more advanced chatbots and AI models. In a TED interview last week, Altman said that OpenAI had 500 million weekly active users, and that usage was “growing very rapidly.”

OpenAI now offers a smorgasbord of different flavors of models with different capabilities and different pricing. The company’s largest and most powerful model, called GPT-4.5, was launched in February, though OpenAI called the launch a “research preview” because the product is still experimental.

The company also offers models called o1 and o3 that are capable of performing a simulated kind of reasoning, breaking a problem down into parts in order to solve it. These models also take longer to respond to queries and are more expensive for users.

ChatGPT’s success has inspired an army of imitators, and rival AI players have ramped up their investments in research in an effort to catch up to OpenAI in recent years. A report on the state of AI published by Stanford University this month found that models from Google and DeepSeek now have similar capabilities to models from OpenAI. It also showed a gaggle of other firms including Anthropic, Meta, and the French firm Mistral in close pursuit.



Source link

Tags: anthropicArtificial Intelligencechatbotschatgptdeepseekgoogleopenaisam altman
By Wired

By Wired

Next Post
Cision Announces 0 Million New Money Financing, Refinancing, Extension of Debt Maturities

Cision Announces $250 Million New Money Financing, Refinancing, Extension of Debt Maturities

Recommended.

How Health Insurers Can Turn AI Experiments into Impact: Insights from Info-Tech Research Group on Scaling AI in the Insurance Industry

How Health Insurers Can Turn AI Experiments into Impact: Insights from Info-Tech Research Group on Scaling AI in the Insurance Industry

August 25, 2025
Ultra-wideband (UWB) Market worth .62 billion by 2030 – Exclusive Report by MarketsandMarkets™

Ultra-wideband (UWB) Market worth $17.62 billion by 2030 – Exclusive Report by MarketsandMarkets™

October 2, 2025

Trending.

⚡ Weekly Recap: Oracle 0-Day, BitLocker Bypass, VMScape, WhatsApp Worm & More

⚡ Weekly Recap: Oracle 0-Day, BitLocker Bypass, VMScape, WhatsApp Worm & More

October 6, 2025
Cloud Computing on the Rise: Market Projected to Reach .6 Trillion by 2030

Cloud Computing on the Rise: Market Projected to Reach $1.6 Trillion by 2030

August 1, 2025
Stocks making the biggest moves midday: Autodesk, PayPal, Rivian, Nebius, Waters and more

Stocks making the biggest moves midday: Autodesk, PayPal, Rivian, Nebius, Waters and more

July 14, 2025
The Ultimate MSP Guide to Structuring and Selling vCISO Services

The Ultimate MSP Guide to Structuring and Selling vCISO Services

February 19, 2025
Translators’ Voices: China shares technological achievements with the world for mutual benefit

Translators’ Voices: China shares technological achievements with the world for mutual benefit

June 3, 2025

PTechHub

A tech news platform delivering fresh perspectives, critical insights, and in-depth reporting — beyond the buzz. We cover innovation, policy, and digital culture with clarity, independence, and a sharp editorial edge.

Follow Us

Industries

  • AI & ML
  • Cybersecurity
  • Enterprise IT
  • Finance
  • Telco

Navigation

  • About
  • Advertise
  • Privacy & Policy
  • Contact

Subscribe to Our Newsletter

  • About
  • Advertise
  • Privacy & Policy
  • Contact

Copyright © 2025 | Powered By Porpholio

No Result
View All Result
  • News
  • Industries
    • Enterprise IT
    • AI & ML
    • Cybersecurity
    • Finance
    • Telco
  • Brand Hub
    • Lifesight
  • Blogs

Copyright © 2025 | Powered By Porpholio