60 stories tagged with #model, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.
⌘ RSS feed for this tag → or search "Model"
Caisi Evaluation of DeepSeek V4 Pro
World Cup’s ‘sexiest fan’ captures attention of Formula 1 cameraman
The World Cup's "sexiest fan" is still turning heads.…
Israel Said It's Applying the Gaza Model in Lebanon
Satellite images, photos and videos show the scope of widespread demolitions in southern Lebanon.…
VulkanForge – 14 MB Vulkan LLM engine that runs native FP8 models on AMD (Rust)
interfernece in rust and vulkan. Contribute to maeddesg/vulkanforge development by creating an account on GitHub.…
CSPNet Paper Walkthrough: Just Better, No Tradeoffs
A review of the Cross-Stage Partial Network paper — and a from-scratch PyTorch implementation The post CSPNet Paper Walkthrough: Just Better, No Tradeoffs appeared first on Towards…
May MacBook Pro deals deliver prices as low as $1,949 on M5 Pro & M5 Max models
Apple retailers have issued steeper discounts on the MacBook Pro for May, resulting in record-low prices on several M5 Pro and M5 Max 14-inch and 16-inch configurations. Grab steep…
Connecticut lawmakers approve bill for cell phone ban in schools — but critics argue that having different rules for adults and students is ‘not good role modeling at all’
An evaluation by NIST's CAISI says DeepSeek V4 Pro lags behind leading US AI models by about eight months and is the most capable Chinese AI model to date (NIST)
NIST : An evaluation by NIST's CAISI says DeepSeek V4 Pro lags behind leading US AI models by about eight months and is the most capable Chinese AI model to date — In April 2026, t…
Show HN: Apple's Sharp Running in the Browser via ONNX Runtime Web
so for coding which model do we use now?
Should I use gpt-5.5 or codex/gpt-5.3 ?? I'm just coding…
Best Local Vision-Language Models?
What are in your opinion the best local vision models to get a good despription of picture for a 16 GB GPU? At the moment I use qwen3 vl 8b thinking q8 but I wonder, if there is a …
Which model should I try?
In my current workflow (coding in python/c++ and technical reports) I mostly use Qwen3.6 27B and Gemma4 31B. In the past I tried other models like Deepseek with decent results but …
Maybe the useful unit is not a model, but a model-context pair.
I’ve been thinking about two separate observations from recent AI workflows. First: Different models can be useful because they see the same problem differently. For example, one m…
Inference Scaling (Test-Time Compute): Why Reasoning Models Raise Your Compute Bill
Why reasoning models dramatically increase token usage, latency, and infrastructure costs in production systems The post Inference Scaling (Test-Time Compute): Why Reasoning Models…
How does parameter "shift" in model sampling affect images and what are good settings for Image models.
Curious how shift setting could be used. Also please share optimal shift settings and sampler combinations please.…
Open Weights Models Hall of Fame
I read a lot of "whengguf" type posts. I think we should sometimes stop and be grateful. I want to say big thanks to all of the people and companies who gave us so much fun and pro…
Jerry Seinfeld Drops a Truth Bomb About Electric Cars
What’s the deal with electric cars? Am I right?…
Tesla starts selling Chinese-made Model 3s in Canada at the EV's lowest price ever
The cars made at the Giga Shanghai factory start at $39,490 CAD, or roughly $29,000 in USD for a Model 3 Premium Rear-Wheel Drive variant.…
AMD's GAIA Defaults To Better Model, Continued Improvements For Local AI
AMD software engineers on Friday released a new version of GAIA "Generative AI Is Awesome" as their open-source software for Windows and Linux leveraging the Lemonade SDK and aimin…
Show HN: State of the Art of Coding Models, According to Hacker News Commenters
Hello HN, I was away from my computer for two weeks, and after coming back and reading the latest discussions on HN about coding assistants (models, harnesses), I felt very out of …
What are tarpit ideas in the AI era?
For those unfamiliar with the term, tarpit ideas are ideas that always attract lots of founders but never really work. They usually sound amazing on paper. Some examples from befor…
GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests
New results suggest Mythos' cyber threat isn't "a breakthrough specific to one model."…
Study: AI models that consider user's feeling are more likely to make errors
Overtuning can cause models to "prioritize user satisfaction over truthfulness.”…
Refusal in Language Models Is Mediated by a Single Direction
Tesla owner won $10k in court for Tesla's FSD lies. Tesla is still fighting him
Lego is giving away N-1 Starfighter models for free ahead of Star Wars Day — how to claim yours this weekend
How to get Lego for free. Lego is giving away free N-1 Starfighter models on Star Wars Day.…
A Common Proof of the Riemann Hypothesis and the Collatz Conjecture
Result. The Riemann Hypothesis and the Collatz Conjecture are proved as two specialisations of a single structural theorem: in a closed witness system with positive drift, every tr…
AI Self-preferencing in Algorithmic Hiring: Empirical Evidence and Insights
Spirit Airlines built a model the industry copied. Then it collapsed - AP News
Spirit Airlines built a model the industry copied. Then it collapsed AP News…
Emergent Strategic Reasoning Risks in AI: A Taxonomy-Driven Evaluation Framework
As reasoning capacity and deployment scope grow in tandem, large language models (LLMs) gain the capacity to engage in behaviors that serve their own objectives, a class of risks w…
Open-source diagnostic for AI misalignment. Model agnostic, industry agnostic. Free to Run.
We shipped iFixAi earlier this week. An open-source diagnostic for AI misalignment. 32 tests across fabrication, manipulation, deception, unpredictability, and opacity. Open source…
Musk v. Altman week 1: Elon Musk says he was duped, warns AI could kill us all, and admits that xAI distills OpenAI’s models - MIT Technology Review
Musk v. Altman week 1: Elon Musk says he was duped, warns AI could kill us all, and admits that xAI distills OpenAI’s models MIT Technology Review…
Where can I research and find out how to integrate or build an actual tried and tested hawkeye (ball detection and tracking/prediction model) that works?
Hi fellow devs, I've been trying for months to build an actual hawkeye (ball detection and tracking/prediction model) that works on mobile platform. I initially tried with YOLO inf…
Open weight (and closed) Models with character sheet inputs
Now that we have some open weight models available to us that work with character sheet inputs, here's a test across the models I have access to, open and closed to see how they co…
Anthropic Won't Let You Use Their Best Model. Prediction Markets Are Trying Anyway.
Been watching AI prediction markets since they got liquid earlier this year. The thing I didn't see coming is that we now have a real gap between "best model that exists" and "best…
Some Longcat-Image-Edit samples, is a limited, yet very useful model.
All the reference faces were made with Flux 1 Dev. The first three samples are just inpainting, while the last tree samples were reference + prompt. Inpainting was a little struggl…
Same Prompt on Open Source Models: Z-Image Base & Distilled, Klein 9b & 4b, ERNIE image
Same Prompt for each: Create a funny, polished, wide landscape digital illustration in a colorful comic-meets-3D style. Taylor Swift is sitting at a glowing computer desk on a Frid…
Why is these still no realistic voice model despite huge advancements in AI?
OpenAI teased an extremely realistic model a long time ago, but it has not released it. The current voice chat is great for trivia, but it is too robotic for everyday conversations…
Most accurate Al model for generating videos from images while preserving text?
Hi everyone, I'm looking for the most accurate Al model or tool that can generate videos from images while preserving any text in the images exactly as it is. In my experience, man…
Have Qwen said anything about further Qwen 3.6 models?
Have Qwen hinted at whether other models (9B, 122B, 397B) would be getting the 3.6 treatment? Or have they in any way confirmed or hinted at "this is it"? Genuinely curious if I mi…
What about a website to share our model settings and optimisations ?
Hello folks, I'm thinking about creating a website to share our settings and configurations for our beloved models according to the hardware we have. We could share our setups and …
Benchmark for SageAttention kernels using real attention shapes logged from ComfyUI models (image / video / audio)
What this is — and what it is not This is not a benchmark of how fast a model generates an image or video. No model weights, no inference pipeline. The benchmark runs on randomly g…
[RELEASE] - Finally, my first TTS model is out! 🎙️ Flare-TTS 28M
Hey r/LocalLLaMA ! I am back with a new model, and it's something special today 😃 It's Flare-TTS 28M, my first text to speech (TTS) model trained completely from scratch on a sing…
Sometimes the useful difference is not between models, but between contexts.
I accidentally discovered something useful while comparing GPT sessions. One session knew my project context. The other knew nothing about me. The first helped me build faster. The…
New to local image generation. What model/workflow should I start with?
I’ve been browsing this subreddit and the images here are really impressive.…
Is it over for locally hosted i2v models ?
I started playing with comfyui and mostly wan 2.2 back in October. Am I right in thinking that no new models of that type have been released since ? It seems all the new wan models…
Looking for uncensored image gen model
Hi guys, so I've been looking for a fully uncensored model to generate images with. So that model has to support at least reference images because I'm planning to build a custom in…
OpenAI wants to put its most powerful model at all levels of government to fight hackers - WLFI News 18
OpenAI wants to put its most powerful model at all levels of government to fight hackers WLFI News 18…
What is the best all-round local model?
Not for agentic coding but for help in conversational style write-ups like markdown documentation (not code-related). Constraints are 64GB unified memory, obviously local.…
Beyond Memorization: Do Larger Models Know More, or Just Better?
Just read 2 papers: 1. Incompressible Knowledge Probes 2. Densing Law of LLMs densing laws suggest for every 3 months you will get a new model that does same things in half the par…
Put multiple AI models to work at once with this $80 tool
Upgrade your AI workflow with a lifetime subscription to 1min.AI's Advanced Business Plan.…
Secret Service 'model worked' during WHCA Dinner shooting but 'luck' played a role, experts say
Questions swirl over Secret Service security after an armed gunman allegedly attempted to assassinate Trump at the White House Correspondents' Association Dinner.…
Open source ballistic simulator with NASA SRTM terrain masking (Python/C#)
Contribute to InsaneInfinity/Balistic development by creating an account on GitHub.…
Apple’s Mac mini now has a higher starting price, as it discontinues the entry-level model and slides down to the mid-range
Apple's entry-level $599 Mac mini with 256GB of storage and 16GB of RAM is no longer available.…
Just like the MacBook Neo, Apple might serve another pricing slam with the iPhone 18 Pro
iPhone 18 Pro pricing leak suggests Apple may widen the gap between Pro and standard models, following a strategy similar to its MacBook Neo approach.…
I built a minimal asyncpg wrapper that gives you Pydantic type safety without the ORM overhead. You write raw SQL, you get typed models back.
Claude Sonnet 4.6 model hallucinates
DeepSeek Finally "Opens Its Eyes": Multimodal Image Recognition Goes Live, the Last Missing Piece for Chinese LLMs
On April 29, 2026, DeepSeek officially launched the gray-scale testing of its "Image Recognition...…