#large-language-models — Tagged Stories

Every story in the WeSearch catalog tagged with #large-language-models, chronological, with view counts. Subscribe to the per-tag RSS feed to follow this topic in your reader of choice.

58 stories tagged with #large-language-models, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.

⌘ RSS feed for this tag → or search "Large Language Models"

RELATED TAGS

#ai5 #ml2 #azure2 #open-source1 #reinforcement-learning1 #productivity1 #hacker-news1 #hn1 #gpt-5-51 #opus-4-71 #opus-4-61 #gpt-5-41

ARXIV.ORG

Rethinking Uncertainty Evaluation in Large Language Models

arXiv:2607.19367v1 Announce Type: new Abstract: Calibration is the primary criterion for evaluating LLM confidence, but it is insufficient: it admits trivially incoherent estimator…

22 views · Thu, 23 Jul 2026 04:00:00 GMT

#rethinking #uncertainty #evaluation

ARXIV.ORG

Logic-Guided Data Extraction with Answer Set Programming and Large Language Models

arXiv:2607.19365v1 Announce Type: new Abstract: When Large Language Models (LLMs) are used for semantic data extraction from unstructured text, producing candidate relational facts…

20 views · Thu, 23 Jul 2026 04:00:00 GMT

#logic-guided #data #extraction

ARXIV.ORG

Statistically Grounded Sparse-Feature Interventions for Activation-Space Control in Large Language Models

arXiv:2607.19364v1 Announce Type: new Abstract: Activation steering offers a lightweight alternative to fine-tuning for behavioral control of large language models, but SAE-based s…

22 views · Thu, 23 Jul 2026 04:00:00 GMT

#statistically #grounded #sparse-feature

ARXIV.ORG

Information Discernment in Large Language Models

arXiv:2607.19355v1 Announce Type: new Abstract: LLMs are increasingly used with external knowledge sources like the internet. Do they weigh information appropriately -- updating mo…

18 views · Thu, 23 Jul 2026 04:00:00 GMT

#information #discernment #large

TOWARDS DATA SCIENCE

Loop Engineering with Adaptive Parsing in Action: Parsing Flat Tables with Azure and Figures with a Vision LLM

Enterprise Document Intelligence [Vol.1 #10B] - The LLM as last line of defence, then two real escalations walked end to end: a flat table to Azure, a figure to a vision model The …

30 views · Mon, 20 Jul 2026 15:00:00 GMT

#document intelligence #adaptive parsing

MIT TECHNOLOGY REVIEW

GPT-Red: an LLM super-hacker OpenAI built to make its models safer

Exclusive: The firm says it wants to future-proof its safety procedures and stay ahead of human attackers.…

Large Language Models coverage.

Rethinking Uncertainty Evaluation in Large Language Models

Logic-Guided Data Extraction with Answer Set Programming and Large Language Models

Statistically Grounded Sparse-Feature Interventions for Activation-Space Control in Large Language Models

Information Discernment in Large Language Models

Loop Engineering with Adaptive Parsing in Action: Parsing Flat Tables with Azure and Figures with a Vision LLM

GPT-Red: an LLM super-hacker OpenAI built to make its models safer

Augmenting Fundamental Analysis with Large Language Models: A RAG-Based System for Generating Investor Briefs

Integrating Large Language Models and Graph Convolutional Networks for Semi-Supervised Image Classification

Accelerating GPU Inference of Large Language Models with Moderately Unstructured Sparse Weight Matrices

A Unified Approach to Interpreting Knowledge Distillation for Large Language Models via Interactions

Narration-of-Thought: Inference-Time Scaffolding for Defeasible Ethical Reasoning in Large Language Models

Code-on-Graph: Iterative Programmatic Reasoning via Large Language Models on Knowledge Graphs

From Answers to States: Verifiable Process-Level Evaluation of Chemical Reasoning in Large Language Models

ClinicalMC: A Benchmark for Multi-Course Clinical Decision-Making with Large Language Models

The Shadow Price of Reasoning: Economic Perspective on Optimal Budget Allocation for LLMs

ChatHealthAI: Aligning Electronic Health Record Representations with Large Language Models for Grounded Clinical Reasoning

Visual Graph Scaffolds for Structural Reasoning in Large Language Models

Why Are Large Language Models So Terrible at Video Games?

Heuristic Parasites: A Behavioral Taxonomy of Recurrent Distortion Patterns in Large Language Models (Full System) V2

✨📊 🧠 The Ultimate Visual Guide to Large Language Models (LLMs)

Pretraining Data Exposure in Large Language Models: A Survey of Membership Inference, Data Contamination, and Security Implications

Generating Robust Portfolios of Optimization Models using Large Language Models

MedGuideX: Internalizing Decision Logic from Executable Guidelines into Large Language Models for Clinical Reasoning

Reasoning, Code, or Both? How Large Language Models Handle Variations in Math Questions

Emotional intelligence in large language models is fragmented across perception, cognition, and interaction

Summoning the Oracle to Slay It: Mitigating Look-Ahead Bias in Financial Backtesting with Large Language Models

Jailbreak to Protect: Buffering and Reinforcing via Temporary Jailbreaking for Safe Fine-Tuning in Large Language Models

PALoRA: Projection-Adaptive LoRA for Preserving Reasoning in Large Language Models

Distilling Game Code World Model Generation into Lightweight Large Language Models

HyperGuide: Hyperbolic Guidance for Efficient Multi-Step Reasoning in Large Language Models

Breaking the Chains of Probability: Neutrosophic Logic as a New Framework for Epistemic Uncertainty in Large Language Models

Confidence Calibration in Large Language Models

MadEvolve: Evolutionary Optimization of Trading Systems with Large Language Models

How Far Will They Go? Red-Teaming Online Influence with Large Language Models

Evaluating Large Language Models in a Complex Hidden Role Game

GENSTRAT: Toward a Science of Strategic Reasoning in Large Language Models

Machine-Learning-Enhanced Non-Invasive Testing for MASLD Fibrosis: Shallow-Deep Neural Networks Versus FIB-4, Tabular Foundation Models, and Large Language Models

DEL: Digit Entropy Loss for Numerical Learning of Large Language Models

PlanningBench: Generating Scalable and Verifiable Planning Data for Evaluating and Training Large Language Models

DarkLLM: Learning Language-Driven Adversarial Attacks with Large Language Models

BLINKG: A Benchmark for LLM-Integrated Knowledge Graph Generation

Can Large Language Models Revolutionize Survey Research? Experiments with Disaster Preparedness Responses

TeleCom-Bench: How Far Are Large Language Models from Industrial Telecommunication Applications?

Episodic-Semantic Memory Architecture for Long-Horizon Scientific Agents

CyberCorrect: A Cybernetic Framework for Closed-Loop Self-Correction in Large Language Models

ChemVA: Advancing Large Language Models on Chemical Reaction Diagrams Understanding

PersonaArena: Dynamic Simulation for Evaluating and Enhancing Persona-Level Role-Playing in Large Language Models

Sketch Then Paint: Hierarchical Reinforcement Learning for Diffusion Multi-Modal Large Language Models

How Large Language Models Are Reshaping the Trial Lifecycle

ASRU: Activation Steering Meets Reinforcement Unlearning for Multimodal Large Language Models

Retrieval-Augmented Large Language Models for Schema-Constrained Clinical Information Extraction

Zero-Shot Goal Recognition with Large Language Models

Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution

LLMs as Linguistic Probes: A Graduate Student's Guide to Advanced Syntax, Semantics, and Efficient Fine-Tuning

Ask HN: What LLM models are you using and why?

The LLM Failure Atlas: A Structural Analysis of Failure Modes in Large Language Models (Free PDF)

Δ-Mem: Efficient Online Memory for Large Language Models

InclusionAI/Ring-2.6-1T is now open-sourced

Browse more