InfoQ Homepage Machine Learning Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Micro Metrics for LLM System Evaluation at QCon SF 2024

Denys Linkov's QCon San Francisco 2024 talk dissected the complexities of evaluating large language models (LLMs). He advocated for nuanced micro-metrics, robust observability, and alignment with business objectives to enhance model performance. Linkov’s insights highlight the need for multidimensional evaluation and actionable metrics that drive meaningful decisions.

Andrew Hoblitzell
on Dec 06, 2024
DevOps

Optimizing Amazon ECS with Predictive Scaling

Amazon Web Services (AWS) recently released Predictive Scaling for Amazon ECS, an advanced scaling policy that employs machine learning (ML) algorithms to anticipate demand surges, ensuring applications remain highly available and responsive while minimizing resource overprovisioning.

Claudio Masolo
on Dec 06, 2024
AI, ML & Data Engineering

Mistral AI Releases Pixtral Large: a Multimodal Model for Advanced Image and Text Analysis

Mistral AI released Pixtral Large, a 124-billion-parameter multimodal model designed for advanced image and text processing with a 1-billion-parameter vision encoder. Built on Mistral Large 2, it achieves leading performance on benchmarks like MathVista and DocVQA, excelling in tasks that require reasoning across text and visual data.

Robert Krzaczyński
on Dec 04, 2024
AI, ML & Data Engineering

Nexa AI Unveils Omnivision: a Compact Vision-Language Model for Edge AI

Nexa AI unveiled Omnivision, a compact vision-language model tailored for edge devices. By significantly reducing image tokens from 729 to 81, Omnivision lowers latency and computational requirements while maintaining strong performance in tasks like visual question answering and image captioning.

Robert Krzaczyński
on Dec 03, 2024
AI, ML & Data Engineering

QCon SF 2024 - Ten Reasons Your Multi-Agent Workflows Fail

At QCon SF 2024, Victor Dibia from Microsoft Research explored the complexities of multi-agent systems powered by generative AI. Highlighting common pitfalls like inadequate prompts and poor orchestration, he shared strategies for enhancing reliability and scalability. Dibia emphasized the need for meticulous design and oversight to unlock the full potential of these innovative systems.

Andrew Hoblitzell
on Nov 29, 2024
AI, ML & Data Engineering

QCon SF 2024 - Scaling Large Language Model Serving Infrastructure at Meta

At QCon SF 2024, Ye (Charlotte) Qi of Meta tackled the complexities of scaling large language model (LLM) infrastructure, highlighting the "AI Gold Rush" challenge. She emphasized efficient hardware integration, latency optimization, and production readiness, alongside Meta's innovative approaches like hierarchical caching and automation to enhance AI performance and reliability.

Andrew Hoblitzell
on Nov 26, 2024
AI, ML & Data Engineering

Meta Releases NotebookLlama: Open-Source PDF to Podcast Toolkit

Meta has released NotebookLlama, an open-source toolkit designed to convert PDF documents into podcasts, providing developers with a structured, accessible PDF-to-audio workflow. As an open-source alternative to Google’s NotebookLM, NotebookLlama guides users through a four-step process that converts PDF text into audio content.

Robert Krzaczyński
on Nov 17, 2024
Cloud

Native Vector Support in Azure SQL Database in Public Preview

Azure SQL Database now supports native vector storage and processing, streamlining AI development by integrating vector search with SQL queries. This update simplifies database management, enhances data analysis, and boosts performance by eliminating data movement. Ideal for diverse applications, it empowers sectors like e-commerce and healthcare with advanced, context-aware functionalities.

Steef-Jan Wiggers
on Nov 16, 2024
AI, ML & Data Engineering

OSI Releases New Definition for Open Source AI, Setting Standards for Transparency and Accessibility

The Open Source Initiative (OSI) released Version 1.0 of its Open Source AI Definition (OSAID) after two years of development with contributions from global experts. The OSAID sets criteria defining open-source AI, aiming to bring clarity to the concept and establish benchmarks for transparency and accessibility in AI.

Robert Krzaczyński
on Nov 07, 2024
AI, ML & Data Engineering

Amazon SageMaker JumpStart Expands Portfolio with Bria AI's Text-to-Image Models

Amazon Web Services has integrated Bria AI's latest text-to-image foundation models into Amazon SageMaker JumpStart, marking a significant expansion of its enterprise-grade generative AI capabilities. The addition includes three variants - Bria 2.3, Bria 2.2 HD, and Bria 2.3 Fast, each designed to address specific enterprise needs in visual content generation.

Vinod Goje
on Nov 06, 2024
AI, ML & Data Engineering

Rhymes AI Unveils Aria: Open-Source Multimodal Model with Development Resources

Rhymes AI has introduced Aria, an open-source multimodal native Mixture-of-Experts (MoE) model capable of processing text, images, video, and code effectively. In benchmarking tests, Aria has outperformed other open models and demonstrated competitive performance against proprietary models such as GPT-4o and Gemini-1.5.

Robert Krzaczyński
on Oct 28, 2024
AI, ML & Data Engineering

AI and ML Tracks at QCon San Francisco 2024 – a Deep Dive into GenAI & Practical Applications

At QCon San Francisco 2024, explore two AI/ML-focused tracks highlighting real-world applications and innovations. Learn from industry experts on deploying LLMs, GenAI, and recommendation systems, gaining practical strategies for integrating AI into software development.

Artenisa Chatziou
on Oct 25, 2024
AI, ML & Data Engineering

NVIDIA Unveils NVLM 1.0: Open-Source Multimodal LLM with Improved Text and Vision Capabilities

NVIDIA unveiled NVLM 1.0, an open-source multimodal large language model (LLM) that performs strongly on both vision-language and text-only tasks. NVLM 1.0 shows improvements in text-based tasks after multimodal training, standing out among current models. The model weights are now available on Hugging Face, with the training code set to be released shortly.

Robert Krzaczyński
on Oct 11, 2024
AI, ML & Data Engineering

Hugging Face Upgrades Open LLM Leaderboard v2 for Enhanced AI Model Comparison

Hugging Face has recently released Open LLM Leaderboard v2, an upgraded version of their benchmarking platform for large language models. Hugging Face created the Open LLM Leaderboard to provide a standardized evaluation setup for reference models, ensuring reproducible and comparable results.

Vinod Goje
on Oct 10, 2024
AI, ML & Data Engineering

Meta Releases Llama 3.2 with Vision, Voice, and Open Customizable Models

Meta recently announced Llama 3.2, the latest version of Meta's open-source language model, which includes vision, voice, and open customizable models. This is the first multimodal version of the model, which will allow users to interact with visual data in ways like identifying objects in photos or editing images with natural language commands among other use cases.

Andrew Hoblitzell
on Oct 07, 2024

Newer News

Older News

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

News