Ptechhub
  • News
  • Industries
    • Enterprise IT
    • AI & ML
    • Cybersecurity
    • Finance
    • Telco
  • Brand Hub
    • Lifesight
  • Blogs
No Result
View All Result
  • News
  • Industries
    • Enterprise IT
    • AI & ML
    • Cybersecurity
    • Finance
    • Telco
  • Brand Hub
    • Lifesight
  • Blogs
No Result
View All Result
PtechHub
No Result
View All Result

IBM expands small AI model family

By CIO Dive by By CIO Dive
February 26, 2025
Home Enterprise IT
Share on FacebookShare on Twitter


This audio is auto-generated. Please let us know if you have feedback.

Dive Brief:

  • IBM is expanding its AI menu with the launch of Granite 3.2, a family of small AI models designed for enterprise use, the vendor said Wednesday.
  • Granite 3.2 Instruct is aimed at completing complex reasoning tasks, mathematical problems and general language requests. Enterprise customers can switch off chain-of-thought reasoning capabilities, which are often more expensive and time-consuming, within the Granite 3.2 2B and 8B Instruct models to optimize compute efficiency, the vendor said. 
  • IBM also added an assortment of models with varying context lengths and forecasting horizons, including a smaller option of Granite Guardian for risk assessment. The latest additions come a few months after the third generation of the Granite series launched in October. “Much of our ongoing research aims to take advantage of the inherently longer, more robust thought process of Granite 3.2 for further model optimization,” IBM said in the release.

Dive Insight:

Large language models might have dominated the enterprise generative AI conversation initially, but business leaders are turning to lightweight versions as they look to rein in costs and boost efficiency.

Smaller models typically use less computing power and are often tailored to complete specific tasks. Enterprises have struggled in both areas with LLMs resulting in delayed projects due to computing availability and prioritization of domain-specific capabilities in AI purchases. 

Vendor options are plentiful, from Google’s lightweight Gemma models to Microsoft’s Phi family and OpenAI’s o3-mini.

“We have been very vocal for about a year that smaller models and more reasonable training times are going to be essential for enterprise deployment of large language models,” IBM CEO Arvind Krishna said during IBM’s Q4 2024 earnings call in January. “We see as much as 30 times reduction in inference costs using these approaches.”

The company attributed around $1.5 billion in new bookings during the latest quarter to its generative AI business. Retail and commercial banking company NatWest and defense manufacturer Lockheed Martin are among the enterprise clients already utilizing IBM’s Granite models, according to Krishna. 

All Granite 3.2 models are available under the Apache 2.0 license on Hugging Face. Customers can also access select models on IBM watsonx.ai, LM Studio, Ollama and Replicate. 

“The release of Granite 3.2 marks only the beginning of IBM’s explorations into reasoning capabilities for enterprise models,” IBM said in a release.



Source link

By CIO Dive

By CIO Dive

Next Post
Trump plan to freeze funding stymies Biden-era energy rebates for consumers

Trump plan to freeze funding stymies Biden-era energy rebates for consumers

Recommended.

Analysis: Google Is Getting A Good Deal For Wiz, Actually

Analysis: Google Is Getting A Good Deal For Wiz, Actually

March 20, 2025
Trump’s Cook firing will likely end up in the Supreme Court’s hands

Trump’s Cook firing will likely end up in the Supreme Court’s hands

August 26, 2025

Trending.

Chai AI Announces Upcoming Rollout of Apple and Google Age Verification APIs to Enhance Platform Safety

Chai AI Announces Upcoming Rollout of Apple and Google Age Verification APIs to Enhance Platform Safety

March 10, 2026
Huawei lanceert Next Generation FAN-oplossing

Huawei lanceert Next Generation FAN-oplossing

March 7, 2026
Baidu Announces Fourth Quarter and Fiscal Year 2025 Results

Baidu Announces Fourth Quarter and Fiscal Year 2025 Results

February 26, 2026
Half of Google’s software development now AI-generated | Computer Weekly

Half of Google’s software development now AI-generated | Computer Weekly

February 5, 2026
Ghost Campaign Uses 7 npm Packages to Steal Crypto Wallets and Credentials

Ghost Campaign Uses 7 npm Packages to Steal Crypto Wallets and Credentials

March 24, 2026

PTechHub

A tech news platform delivering fresh perspectives, critical insights, and in-depth reporting — beyond the buzz. We cover innovation, policy, and digital culture with clarity, independence, and a sharp editorial edge.

Follow Us

Industries

  • AI & ML
  • Cybersecurity
  • Enterprise IT
  • Finance
  • Telco

Navigation

  • About
  • Advertise
  • Privacy & Policy
  • Contact

Subscribe to Our Newsletter

  • About
  • Advertise
  • Privacy & Policy
  • Contact

Copyright © 2025 | Powered By Porpholio

No Result
View All Result
  • News
  • Industries
    • Enterprise IT
    • AI & ML
    • Cybersecurity
    • Finance
    • Telco
  • Brand Hub
    • Lifesight
  • Blogs

Copyright © 2025 | Powered By Porpholio