Search: "small language models"

6 stories match your query across our 700+ source catalog. Ranked by relevance and recency.

6 results for "small language models"

Tandem: Riding Together with Large and Small Language Models for Efficient Reasoning

Recent advancements in large language models (LLMs) have catalyzed the rise of reasoning-intensive inference paradigms, where models perform explicit step-by-step reasoning before generating final ans…

Tue, 28 Apr 2026 04:13:21 GMT · 7 views

FIRETHERING

Granite 4.1: IBM's 8B Model Matching 32B MoE

IBM just released Granite 4.1, a family of open source language models built specifically for enterprise use. Three sizes, Apache 2.0 licensed and trained on 15 trillion tokens with a level of pipelin…

Thu, 30 Apr 2026 11:01:22 GMT · 7 views

ARXIV.ORG

Beyond 80/20: High-Entropy Minority Tokens Drive Effective RL for LLM Reasoning

Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a powerful approach to enhancing the reasoning capabilities of Large Language Models (LLMs), while its mechanisms are not yet well …

Thu, 30 Apr 2026 07:28:16 GMT · 9 views

ARXIV CS.AI

SwarmDrive: Semantic V2V Coordination for Latency-Constrained Cooperative Autonomous Driving

Cloud-hosted LLM inference for autonomous driving adds round-trip delay and depends on stable connectivity, while purely local edge models struggle under occlusion. We present SwarmDrive, a semantic V…

Wed, 29 Apr 2026 04:04:25 GMT · 6 views

ARXIV.ORG

STELLAR-E: a Synthetic, Tailored, End-to-end LLM Application Rigorous Evaluator

The increasing reliance on Large Language Models (LLMs) across diverse sectors highlights the need for robust domain-specific and language-specific evaluation datasets; however, the collection of such…

Tue, 28 Apr 2026 04:13:21 GMT · 5 views

Speculative decoding with Gemma-4-31B + Gemma-4-E2B enables 120 - 200 tok/s output speed for specific tasks

So for my project I was using up until now either Gemini 3 / 2.5 Flash or Flash-lite. All my use cases are not agentic, simply LLM workflows for atomic tasks like extracting references from the law, c…

Sun, 26 Apr 2026 22:44:08 GMT · 12 views

Or browse by topic

World US Politics Technology AI Markets Business Science Climate Health Culture Media

Results for "small language models".