Search: "efficient reasoning"

7 stories match your query across our 700+ source catalog. Ranked by relevance and recency.

7 results for "efficient reasoning"

Tandem: Riding Together with Large and Small Language Models for Efficient Reasoning

Recent advancements in large language models (LLMs) have catalyzed the rise of reasoning-intensive inference paradigms, where models perform explicit step-by-step reasoning before generating final ans…

Tue, 28 Apr 2026 04:13:21 GMT · 7 views

ZENODO

Constraints That Compute: A Unified Framework for Efficient Intelligence

This paper introduces a domain-agnostic framework that replaces brute-force computation with structural efficiency by translating systems into their intrinsic, dimensionless geometries. Validated acro…

Thu, 30 Apr 2026 16:39:44 GMT · 2 views

ARXIV.ORG

The Power of Power Law: Asymmetry Enables Compositional Reasoning

Natural language data follows a power-law distribution, with most knowledge and skills appearing at very low frequency. While a common intuition suggests that reweighting or curating data towards a un…

Tue, 28 Apr 2026 04:13:21 GMT · 6 views

ARXIV.ORG

Estimating Black-Box LLM Parameter Counts via Factual Capacity

Closed-source frontier labs do not disclose parameter counts, and the standard alternative -- inference economics -- carries $2\times$+ uncertainty from hardware, batching, and serving-stack assumptio…

Thu, 30 Apr 2026 09:49:09 GMT · 5 views

ARXIV CS.AI

How Do AI Agents Spend Your Money? Analyzing and Predicting Token Consumption in Agentic Coding Tasks

The wide adoption of AI agents in complex human workflows is driving rapid growth in LLM token consumption. When agents are deployed on tasks that require a significant amount of tokens, three questio…

Wed, 29 Apr 2026 04:04:25 GMT · 5 views

ARXIV CS.AI

Parameter Efficiency Is Not Memory Efficiency: Rethinking Fine-Tuning for On-Device LLM Adaptation

Parameter-Efficient Fine-Tuning (PEFT) has become the standard for adapting large language models (LLMs). In this work we challenge the wide-spread assumption that parameter efficiency equates memory …

Wed, 29 Apr 2026 04:04:25 GMT · 6 views

GBNF grammar tweak for faster Qwen3.6 35B-A3B and Qwen3.6 27B

Hi folks, Enjoy an optimised Qwen3.6 35B-A3B and Qwen3.6 27B for coding and general purpose - it's able to solve puzzles correctly more often too. The initial intent was to optimise the 35B-A3B reason…

Mon, 27 Apr 2026 16:54:14 GMT · 9 views

Or browse by topic

World US Politics Technology AI Markets Business Science Climate Health Culture Media

Results for "efficient reasoning".