1. Spotlight: New Thinking about Cloud Computing
  2. News
  3. Events
  4. Newsletters
  5. Resources

About

  • About Us
  • Advertise
  • Contact Us
  • Foundry Careers
  • Reprints
  • Newsletters
  • Contribute to Computerworld

Policies

  • Terms of Service
  • Privacy Policy
  • Cookie Policy
  • Copyright Notice
  • Member Preferences
  • About AdChoices
  • E-commerce Affiliate Relationships
  • Your California Privacy Rights

Our Network

  • CIO
  • CSO
  • InfoWorld
  • Network World

More

  • News
  • Features
  • Opinion
  • Blogs
  • BrandPosts
  • Events
  • Podcasts
  • Videos
  • Enterprise Buyer’s Guides
  • Generative AI
  • Office Suites
  • Collaboration Software
  • Productivity Software
  • Windows
  • Android
  • Apple
  • Augmented Reality
  • Emerging Technology
  • Mobile
  • Remote Work
  • Artificial Intelligence
  • Operating Systems
  • Careers
  • IT Leadership
  • IT Management
  • IT Operations
  • Industry
  • Cloud Computing
  • Computers and Peripherals
  • Analytics
  • Data Center
  • Enterprise Applications
  • Networking
  • Security
  • Vendors and Providers
  • Enterprise Buyer’s Guides

Americas

  • United States

Asia

  • India
  • Korea (대한민국)

Europe

  • Netherlands
  • United Kingdom
  • Germany (Deutschland)
  • Poland (Polska)
  • Spain (España)
  • Sweden (Sverige)

Oceania

  • Australia
  • New Zealand
Popular Topics
  • Generative AI
  • Productivity Software
  • Windows
  • Android
  • Apple
  • Remote Work
  • Spotlight: New Thinking about Cloud Computing
  • News
  • Events
  • Newsletters
  • Resources

Topics

  • Generative AI
  • Office Suites
  • Collaboration Software
  • Productivity Software
  • Windows
  • Android
  • Apple
  • Augmented Reality
  • Emerging Technology
  • Mobile
  • Remote Work
  • Artificial Intelligence
  • Operating Systems
  • Careers
  • IT Leadership
  • IT Management
  • IT Operations
  • Industry
  • Cloud Computing
  • Computers and Peripherals
  • Analytics
  • Data Center
  • Enterprise Applications
  • Networking
  • Security
  • Vendors and Providers
  • Enterprise Buyer’s Guides

About

  • About Us
  • Advertise
  • Contact Us
  • Foundry Careers
  • Reprints
  • Newsletters
  • Contribute to Computerworld

Policies

  • Terms of Service
  • Privacy Policy
  • Cookie Policy
  • Copyright Notice
  • Member Preferences
  • About AdChoices
  • E-commerce Affiliate Relationships
  • Your California Privacy Rights

Our Network

  • CIO
  • CSO
  • InfoWorld
  • Network World

More

  • News
  • Features
  • Opinion
  • Blogs
  • BrandPosts
  • Events
  • Podcasts
  • Videos
  • Enterprise Buyer’s Guides
    1. Home
    2. Artificial Intelligence
    3. What's next for generative AI in 2025?
    Lucas Mearian
    by Lucas Mearian
    Senior Reporter

    What’s next for generative AI in 2025?

    feature
    Dec 31, 202410 mins
    CIOChief Data OfficerGenerative AI

    Small language models capable of addressing specific tasks without burning through processing power are likely to surge in popularity.

    Generative AI, KI, GenAI, Buzzwords, 16:9
    Credit: iQoncept - shutterstock.com

    In 2024, the surge in generative AI (genAI) pilot projects sparked concerns over high experimentation costs and uncertain benefits. That prompted companies to then shift their focus to delivering business outcomes, enhancing data quality, and developing talent.

    In 2025, enterprises are expected to prioritize strategy, add business-IT partnerships to assist with genAI projects and move from large language model (LLM) pilots to production instances. And small language models will also likely come into their own, addressing specific tasks without overburdening data center processing and power.

    Organizations will also adopt new technologies and architectures to better govern data and AI, with a return to predictive AI, according to Forrester Research.

    Predictive AI uses historical data and techniques such as machine learning and statistics to forecast future events or behaviors, said Forrester analyst Jayesh Chaurasia. GenAI, on the other hand, creates new content — such as images, text, videos, or synthetic data — leveraging deep learning methods such as generative adversarial networks (GANs). Chaurasia predicts the AI pendulum will swing back to predictive AI for over 50% of use cases.

    LLMs are, of course, central to genAI, helping enterprises tackle complex tasks and improve operations. Forrester reported that 55% of US genAI decision-makers with a strategy use LLMs embedded in applications, while 33% purchase domain-specific genAI apps. Meanwhile, SLMs are quickly gaining attention.

    The rise of small and mid-sized language models should enable customers to better meet the trade-offs on accuracy, speed and costs, said Arun Chandrasekaran, a distinguished vice president analyst with Gartner Research, noting that “Most organizations are still struggling to realize business value from their genAI investment.”

    GenAI challenges

    Gartner

    In the coming year, SLM integration could surge by as much as 60%, according to a Forrester report.

    As nearly eight-in-10 IT decision makers report software costs rising over the past year, they’re looking to SLMs because they’re more cost-effective and offer better accuracy, relevance, and trustworthiness by training on specific domains. They’re also easier to integrate and excel in specialized industries such as finance, healthcare, and legal services.

    By 2025, 750 million apps are expected to use LLMs, underscoring the genAI market’s rapid growth. Forrester predicts the market will grow in value from $1.59 billion in 2023 to $259.8 billion by 2030,. 

    Even with that growth, many AI experts argue that LLMs may be excessive for automating workflows and repetitive tasks, both in terms of performance and environmental impact. A Cornell University study found that training OpenAI’s GPT-3 LLM consumed 500 metric tons of carbon, the equivalent of 1.1 million pounds.

    As enterprises face challenges meeting expectations, gen AI investments in 2025 will likely shift toward proven predictive AI applications like maintenance, personalization, supply chain optimization, and demand forecasting. Forward-thinking organizations will also recognize the synergy between predictive and generative AI, using predictions to enhance generative outputs. That approach is expected to boost the share of combined use cases from 28% today to 35%, according to Forrester.

    What are small language models?

    In the simplest of terms, an SLM is a lightweight genAI model. The “small” in this context refers to the size of the model’s neural network, the number of parameters, and the volume of data on which it is trained, according to Rosemary Thomas, a senior technical researcher in the AI lab at Version 1, a management consulting and software development firm.

    SLMs use fewer computational resources, enabling on-premises or private cloud deployment, which natively enhances privacy and security.

    While some SLM implementations can require substantial compute and memory resources, several models can have more than 5 billion parameters and run on a single GPU, Thomas said.

    Gartner Research defines SLMs differently, as language models with 10 billion parameters or less. Compared to LLMs, they are two to three orders of magnitude (around 100-1,000x) smaller, making them significantly more cost-efficient to use or customize.

    SLMs include Google Gemini Nano, Microsoft’s Orca-2–7b and Orca-2–13b, Meta’s Llama-2–13b, and others, Thomas noted in a recent post, arguing that SLM growth is being driven by the need for more efficient models and the speed at which they can be trained and set up.

    small language models and large language models

    Gartner

    “SLMs have gained popularity due to practical considerations such as computational resources, training time, and specific application requirements,” Thomas said. “Over the past couple of years, SLMs have become increasingly relevant, especially in scenarios where sustainability and efficiency are crucial.”

    SLMs enable most organizations to achieve task specialization, improving the accuracy, robustness, and reliability of genAI solutions, according to Gartner. And because deployment costs, data privacy, and risk mitigation are key challenges when using genAI, SLMs offer a cost-effective and energy-efficient alternative to LLMs for most organizations, Gartner said.

    Three out of four (75%) of IT-decision makers believe SLMs outperform LLMs in speed, cost, accuracy and ROI, according to a Harris Poll of more than 500 users commissioned by the start-up Hyperscience.

    “Data is the lifeblood of any AI initiative, and the success of these projects hinges on the quality of the data that feeds the models,” said Andrew Joiner, CEO of Hyperscience, which develops AI-based office work automation tools. “Alarmingly, three out of five decision makers report their lack of understanding of their own data inhibits their ability to utilize genAI to its maximum potential. The true potential…lies in adopting tailored SLMs, which can transform document processing and enhance operational efficiency.”

    Gartner recommends that organizations customize SLMs to specific needs for better accuracy, robustness, and efficiency. “Task specialization improves alignment, while embedding static organizational knowledge reduces costs. Dynamic information can still be provided as needed, making this hybrid approach both effective and efficient,” the research firm said.

    In highly regulated industries, such as financial services, healthcare and pharmaceuticals, the future of LLMs is definitely small, according to  Emmanuel Walckenaer, CEO of Yseop, a vendor that offers pre-trained genAI models for the BioPharma industry.

    Smaller, more specialized models will reduce wasted time and energy spent on building large models that aren’t needed for current tasks, according to Yseop.

    Agentic AI holds promise, but it’s not yet mature

    In the year ahead, there is likely to be a rise in domain-specific AI agents, “although it is unclear how many of these agents can live up to the lofty expectations,” according to Gartner’s Chandrasekaran.

    While Agentic AI architectures are a top emerging technology, they’re still two years away from reaching the lofty automation expected of them, according to Forrester.

    While companies are eager to push genAI into complex tasks through AI agents, the technology remains challenging to develop because it mostly relies on synergies between multiple models, customization through retrieval augmented generation (RAG), and specialized expertise. “Aligning these components for specific outcomes is an unresolved hurdle, leaving developers frustrated,” Forrester said in its report.

    A recent Capital One survey of 4,000 business leaders and technical practitioners across industries found that while 87% believe their data ecosystem is ready for AI at scale, 70% of technologists spend hours daily fixing data issues.

    Still, Capital One’s survey revealed strong optimism among business leaders about their companies’ AI readiness. Notably, 87% believe they have a modern data ecosystem for scaling AI solutions, 84% report having centralized tools and processes for data management, 82% are confident in their data strategy for AI adoption, and 78% feel prepared to manage the increasing volume and complexity of AI-driven data.

    And yet, 75% of enterprises attempting to build AI agents in-house next year are expected to fail, opting instead for consulting services or pre-integrated agents from existing software vendors. To address the mismatch between AI data preparedness and real-world complexities in 2025, 30% of enterprise CIOs will integrate Chief Data Officers (CDOs) into their IT teams as they lead AI initiatives, according to Forrester Research. CEOs will rely on CIOs to bridge the gap between technical and business expertise, recognizing that successful AI requires both solid data foundations and effective stakeholder collaboration.

    Forrester’s 2024 survey also showed that 39% of senior data leaders report to CIOs, with a similar 37% reporting to CEOs — and that trend is growing. To drive AI success, CIOs and CEOs must elevate CDOs beyond being mere liaisons, positioning them as key leaders in AI strategy, change management, and delivering ROI.

    A growing interest in multi-modality — and upskilling

    Emerging use cases for multi-modality, particularly image and speech as modalities in both genAI inputs and outputs, will also see more adoption in 2025.

    Multimodal learning, a subfield of AI, enhances machine learning by training models on diverse data types, including text, images, videos, and audio. The approach enables models to identify patterns and correlations between text and associated sensory data.

    By integrating multiple data types, multimodal AI expands the capabilities of intelligent systems. These models can process various input types and generate diverse outputs. For example, GPT-4, the foundation of ChatGPT, accepts both text and image inputs to produce text outputs, while OpenAI’s Sora model generates videos from text.

    Other examples include medical imaging, patient history, and lab results that can be integrated to enhance pateitn diagnosis and treatment. In financial services, multimodal AI can analyze customer phone queries to assist contact center employees in resolving issues. And in the automotive industry inputs from cameras, GPS, and LiDAR can be integrated by AI to enhance autonomous driving, emergency response, and navigation for companies, such as Tesla, Waymo and Li Auto.

    AI leaders have also realized they need to prioritize business outcomes, clean their data houses, and start building AI talent. The latter is especially important given the growing gap between enterprise genAI needs and the workers with skills to meet those needs.

    “In the year ahead, you’ll need to put your nose to the grindstone to develop an effective AI strategy and implementation plan,” Forrester said in its report. “In 2025, organizational success will depend on strong leadership, strategic refinement, and recalibration of enterprise data and AI initiatives commensurate with AI aspirations.”

    Related content

    news

    Meta officially ‘acqui-hires’ Scale AI — will it draw regulator scrutiny?

    By Taryn Plumb
    Jun 13, 2025 6 mins
    Artificial Intelligence Generative AI
    news analysis

    Apple WWDC 2025: News and analysis

    By Dan Muse
    Jun 13, 2025 3 mins
    Apple Developer Generative AI
    news analysis

    Agentic AI – Ongoing coverage of its impact on the enterprise

    By Dan Muse
    Jun 12, 2025 5 mins
    Emerging Technology Generative AI IT Leadership
    opinion

    WWDC: For developers, Apple’s tools get a lot better for AI

    By Jonny Evans
    Jun 12, 2025 5 mins
    Apple Developer Generative AI

    Other Sections

    • Podcasts
    • Videos
    • Resources
    • Events
    • Spotlight: New Thinking about Cloud Computing
    SUBSCRIBE TO OUR NEWSLETTER

    From our editors straight to your inbox

    Get started by entering your email address below.
    Please enter a valid email address
    Lucas Mearian
    by Lucas Mearian
    Senior Reporter
    1. Follow Lucas Mearian on X

    With a career spanning more than two decades in journalism and technology research, Lucas Mearian is a seasoned writer, editor, and former IDC analyst with deep expertise in enterprise IT, infrastructure systems, and emerging technologies. Currently a senior writer at Computerworld covering AI, the future of work, healthcare IT and financial services IT, his 23-year tenure has included roles such as Senior Technology Editor and Data Storage Channel Editor, where he covered cutting-edge topics like blockchain, 3D printing, sustainable IT, and autonomous vehicles. He has appeared on several podcasts, including Foundry’s Today In Tech. He also served as a research manager at IDC, where he focused on software-defined infrastructure, compute, and storage within the Infrastructure Systems, Platforms, and Technologies group.

    Before entering tech media, he served as Editor-in-Chief of the Waltham Daily News Tribune and as a senior reporter for the MetroWest Daily News. He’s won first place awards from the New England Press Association, the American Association of Business Publication Editors, and has been a finalist for several Jesse H. Neal Awards for outstanding business journalism. A former U.S. Marine Corps sergeant who served in reconnaissance, he brings a disciplined, analytical mindset to his work, along with outstanding writing, research, and public speaking skills.

    More from this author

    • news analysis

      Surprise! Employers are using AI to interview you

      Jun 12, 2025 8 mins
    • news analysis

      Tech layoffs surge even as US unemployment remains stable

      Jun 6, 2025 6 mins
    • news

      GenAI adoption outpaces governance, Ernst & Young finds

      Jun 5, 2025 4 mins
    • news analysis

      PwC: GenAI boosts worker value, wages, and productivity everywhere

      Jun 3, 2025 4 mins
    • news analysis

      Do you trust AI? Here’s why half of users don’t

      Jun 3, 2025 7 mins
    • news analysis

      AI chatbots deliver minimal productivity gains, study finds

      Jun 2, 2025 8 mins
    • news analysis

      AI and economic pressures reshape tech jobs amid layoffs

      May 29, 2025 8 mins
    • news analysis

      Number of US workers with side hustles reaches all-time high as economic worries rise

      May 27, 2025 5 mins

    Show me more

    feature

    Microsoft's Patch Tuesday updates: Keeping up with the latest fixes

    By Dan Muse
    Jun 13, 20255 mins
    MicrosoftMicrosoft OfficeWindows 10
    Image
    analysis

    For June’s Patch Tuesday, 68 fixes — and two zero-day flaws

    By Greg Lambert
    Jun 13, 20258 mins
    MicrosoftMicrosoft OfficeWindows Security
    Image
    opinion

    Apple’s AI Revolution: Insights from WWDC

    By Jonny Evans
    Jun 13, 20255 mins
    AppleArtificial IntelligenceWWDC
    Image
    podcast

    How to be a leader through constant chaos

    Jun 3, 202549 mins
    Employee ExperienceIT Leadership
    Image
    podcast

    Podcast: Are your employees spying on you?

    May 6, 202525 mins
    CybercrimePhysical SecuritySecurity
    Image
    podcast

    Podcast: Are companies wasting their money on failed AI projects?

    Apr 8, 202535 mins
    Artificial IntelligenceBudgetingGenerative AI
    Image
    video

    AI is turning your online data into a real-world threat

    Jun 10, 202539 mins
    CybercrimeData PrivacyInternet Security
    Image
    video

    Leadership Chaos: How to stay steady when everything is shifting

    Jun 3, 202549 mins
    Employee ExperienceIT Leadership
    Image
    video

    Why do tech titans keep trying to 'kill' the smartphone?

    May 27, 202546 mins
    AppleAugmented RealitySmartphones
    Image

    Sponsored Links

    • Empower your cybersecurity team with expert insights from Palo Alto Networks.
    • Secure AI by Design: Unleash the power of AI and keep applications, usage and data secure.
    About
    • About Us
    • Advertise
    • Contact Us
    • Foundry Careers
    • Reprints
    • Newsletters
    • BrandPosts
    • News
    • Features
    • Opinions
    • How-to
    • Blogs
    Policies
    • Terms of Service
    • Privacy Policy
    • Cookie Policy
    • Copyright Notice
    • Member Preferences
    • About AdChoices
    • E-commerce Affiliate Relationships
    • Your California Privacy Rights
    • Privacy Settings
    Our Network
    • CIO
    • CSO
    • Infoworld
    • Network World
    • Facebook
    • X
    • YouTube
    • Google News
    • LinkedIn
    © 2025 IDG Communications, Inc. All Rights Reserved.
`, cio: `

🚀 The new CIO.com hybrid search: 🔍 Explore CIO content smarter, faster and AI powered. ✨

`, nww: `

🚀 The new NetworkWorld.com hybrid search: 🔍 Explore NetworkWorld content smarter, faster and AI powered. ✨

`, cw: `

🚀 The new Computerworld.com hybrid search: 🔍 Explore Computerworld content smarter, faster and AI powered. ✨

`, cso: `

🚀 The new CSOonline.com hybrid search: 🔍 Explore CSO content smarter, faster and AI powered. ✨

` }; const sharedStyles = ` `; const publisher = foundry_get_publisher(); const htmlContent = contentSwitch[publisher]; if (!htmlContent || !document.body) return; document.body.insertAdjacentHTML("afterbegin", htmlContent + sharedStyles); const bar = document.querySelector(".section-block--announcementbar"); if (bar) { requestAnimationFrame(() => { bar.classList.add("section-block--announcementbar--visible"); }); } const btn = document.querySelector(".section-block--announcementbar .reset-button"); const searchIcon = document.querySelector('.header__icon-button[data-menu-trigger="search"] svg'); const searchTrigger = document.querySelector('[data-menu-trigger="search"]'); if (searchIcon) { searchIcon.innerHTML = ''; } if (btn && searchTrigger) { btn.addEventListener("click", () => searchTrigger.click()); } console.log("[MISO SCRIPT] Conditions met, initializing Miso search announcements."); }; initMisoSearchAnnouncements(); }); document.addEventListener('consentManagerReady', () => { const hasConsentYouTube = consentManager.checkConsentByVendors([ 'YouTube', 'YT' ]); if (hasConsentYouTube.some(vendor => vendor['Has Consent'] === false)) { console.log('[YOUTUBE SCRIPT] Consent not given for YouTube.'); } else { console.log('[YOUTUBE SCRIPT] Consent given for YouTube. Loading script...'); } }); document.addEventListener('consentManagerReady', () => { const hasConsentGAM = consentManager.checkConsentByVendors([ 'Google Ad Manager', 'GAM' ]); if (hasConsentGAM.some(vendor => vendor['Has Consent'] === false)) { console.log('[GAM SCRIPT] Consent not given for GAM.'); } else { console.log('[GAM SCRIPT] Consent given for GAM. Loading script...'); } }); document.addEventListener('consentManagerReady', () => { const hasConsentGoogleFonts = consentManager.checkConsentByVendors([ 'Google Fonts', 'Google Web Fonts' ]); if (hasConsentGoogleFonts.some(vendor => vendor['Has Consent'] === false)) { console.log('[GOOGLE FONTS SCRIPT] Consent not given for Google Fonts.'); } else { console.log('[GOOGLE FONTS SCRIPT] Consent given for Google Fonts. Loading script...'); } }); document.addEventListener('consentManagerReady', () => { const hasConsentAdobeTypekit = consentManager.checkConsentByVendors([ 'Adobe Typekit' ]); if (hasConsentAdobeTypekit.every(vendor => vendor['Has Consent'] === true)) { if (foundry_is_edition('kr')) { const link = document.createElement('link'); link.rel = 'stylesheet'; link.href = 'https://use.typekit.net/ysx4dcu.css'; document.head.appendChild(link); } } }); document.addEventListener('consentManagerReady', () => { const vendors = ['Subscribers']; const hasConsentSubscribers = consentManager.checkConsentByVendors(vendors); if (hasConsentSubscribers.some(vendor => vendor['Has Consent'] === false)) { return; } else { if (foundry_is_language('en')) { console.log('Language is English'); // subscribers english .. } if (foundry_is_edition('kr')) { console.log('Edition is Korean'); // subscribers in korean .. } if (foundry_is_edition('ja')) { console.log('Edition is Japanese'); // subscribers in japanese .. } } });