NVIDIA Deploys Alibaba Qwen3.5 VLM on Blackwell GPUs for AI Agent Improvement

Contents

Technical Specs Price Noting
Manufacturing Deployment Choices

Jessie A Ellis
Feb 27, 2026 18:05

NVIDIA provides free GPU-accelerated endpoints for Alibaba’s 397B parameter Qwen3.5 vision-language mannequin, enabling builders to construct multimodal AI brokers.

NVIDIA has rolled out free GPU-accelerated endpoints for Alibaba’s Qwen3.5 vision-language mannequin, giving builders rapid entry to the 397 billion parameter system by Blackwell structure {hardware}. The transfer positions each tech giants to seize the rising marketplace for multimodal AI brokers able to understanding and navigating consumer interfaces.

The Qwen3.5 mannequin, which Alibaba launched on February 16, 2026, represents a big architectural shift in massive language fashions. Regardless of its huge 397B whole parameters, solely 17 billion activate per ahead go—a 4.28% activation charge achieved by a hybrid mixture-of-experts (MoE) design mixed with Gated Delta Networks. This effectivity interprets to actual value financial savings: Alibaba claims the system runs 60% cheaper and handles massive workloads eight instances extra effectively than its predecessor.

Technical Specs Price Noting

The mannequin helps an enter context size of 256K tokens, extensible to 1 million—sufficient to course of roughly two hours of video content material natively. It handles 200+ languages and runs 512 specialists per layer, with 11 specialists (10 routed plus 1 shared) activated per token throughout 60 layers.

Builders can entry Qwen3.5 by NVIDIA’s construct.nvidia.com platform with free registration within the NVIDIA Developer Program. The API follows OpenAI-compatible conventions, making integration simple for groups already working with comparable tool-calling patterns.

Manufacturing Deployment Choices

For enterprises shifting past experimentation, NVIDIA NIM packages the mannequin as containerized inference microservices. These can run on-premises, in cloud environments, or throughout hybrid deployments. The NeMo framework gives fine-tuning capabilities for domain-specific purposes—NVIDIA particularly highlights a medical visible QA tutorial demonstrating radiological dataset coaching.

Alibaba has continued increasing the Qwen3.5 household for the reason that preliminary launch. On February 24, the corporate pushed out three further variants: Qwen3.5-122B-A10B, Qwen3.5-35B-A3B, and Qwen3.5-27B, providing smaller footprint choices for various deployment situations.

Alibaba, buying and selling with a market cap round $372 billion as of February 27, has positioned Qwen3.5 in opposition to GPT-5.2, Claude Opus 4.5, and Gemini 3 Professional on benchmark efficiency. The open-weight fashions stay obtainable on Hugging Face Hub and ModelScope for builders preferring self-hosting over NVIDIA’s managed endpoints.

Picture supply: Shutterstock

Purchase 3 Monetary Mutual Funds Profit From Fed’s Fee Outlook

The best way to Make Cash Promoting Do-it-yourself Jam and Chutney

Kind 8K CH4 Pure Options Corp For: 22 June

Shares making the most important strikes premarket: APGE, SPCX, ACA

6 Secret Sources of Retirement Revenue That Even Early Retirees Can Faucet

NVIDIA Deploys Alibaba Qwen3.5 VLM on Blackwell GPUs for AI Agent Improvement

Technical Specs Price Noting

Manufacturing Deployment Choices

Leave a Reply Cancel reply

Follow US

Popular News

Success Story: Charles Tyler’s Studying Journey with 101 Blockchains

Key Advantages, Use Circumstances, And Developments

The Innovation Hub Playbook: Constructing a Digital Ecosystem for the Recent Meals Chain

Follow Us on Socials

We influence 20 million users and is the number one business blockchain and crypto news network on the planet.

Topics

Technical Specs Price Noting

Manufacturing Deployment Choices

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.

Leave a Reply Cancel reply

Follow US

Popular News

Topics