Large Language Models Quantization

Alibaba Expands Qwen Lineup with New Mid-Sized AI Models

Alibaba’s Qwen AI team has introduced a new Qwen3.5 Medium model series, adding fresh competition to the large language model ...

11h

Alibaba's new open source Qwen3.5-Medium models offer Sonnet 4.5 performance on local computers

This leap is made possible by near-lossless accuracy under 4-bit weight and KV cache quantization, allowing developers to process massive datasets without server-grade infrastructure.

SiliconANGLE

Meta debuts slimmed-down Llama models for low-powered devices

Meta Platforms Inc. is striving to make its popular open-source large language models more accessible with the release of “quantized” versions of the Llama 3.2 1B and Llama 3B models, designed to run ...

Semiconductor Engineering

The On-Device LLM Revolution

Users running a quantized 7B model on a laptop expect 40+ tokens per second. A 30B MoE model on a high-end mobile device ...

SpaceNews

Boeing demonstrates large language model for space-grade hardware

In recent ground tests, Boeing engineers demonstrated that a large language model running on commercial off-the-shelf hardware could examine telemetry and report in natural language on the health of a ...

VentureBeat

Here are 3 critical LLM compression strategies to supercharge AI performance

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More In today’s fast-paced digital landscape, businesses relying on AI face ...

Devdiscourse

Privacy-preserving AI becomes critical for Industry 4.0

Industry 4.0 depends on continuous data exchange between sensors, machines, production lines, and enterprise systems, but much of this data cannot be centralized due to privacy, security, and ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results