WeSearch
Hub / Tags / Pytorch
TAG · #PYTORCH

Pytorch coverage.

Every story in the WeSearch catalog tagged with #pytorch, chronological, with view counts. Subscribe to the per-tag RSS feed to follow this topic in your reader of choice.

21 stories tagged with #pytorch, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.

⌘ RSS feed for this tag →   or   search "Pytorch"

RELATED TAGS
#ml4#deep-learning3#ai3#debugging2#cuda2#programming2#machinelearning2#nvidia2#rocm1#llm1#amd1#nan-detection1
R/SPACE

I rewrited WarpFactory into PyTorch so anyone can simulate Warp Drives for free

15 views ·
GITHUB

Holonomy_lib, exact non Euclidean geometry primitives for PyTorch

Research-grade PyTorch math: differential geometry, spectral graph theory, discrete Ricci flow, simplicial topology, persistent homology, cellular sheaves, SO(3) Lie primitives, in…

13 views ·
#mathematics#geometry#machine learning
R/MACHINELEARNING

What I learned building a debugger for PyTorch training loops and how it changed how I think about failure diagnosis [D]

13 views ·
HUGGINGFACE

I made a CPU only spiking neuron network lib that comes pretty close to PyTorch

We’re on a journey to advance and democratize artificial intelligence through open source and open science.…

15 views ·
#technology#artificial intelligence#machine learning
R/MACHINELEARNING

Profiling PyTorch training without accidentally stalling the GPU [D]

20 views ·
DEV.TO (TOP)

Prefix caching in vLLM under multi-tenant agent traffic

TL;DR: We turned on vLLM's prefix cache for our agent workloads at Nexus Labs and watched TTFT drop...…

16 views ·
#mlops#infrastructure
R/LINUXQUESTIONS

Need a Linux distro to do some work with OpenCV + PyTorch but unsure which to pick based off their suggestions on the PyTorch website. I have some Linux experience but certainly not an expert or anything

19 views ·
SPRINGER

Characterization of machine learning compilers for LLM inference on NVIDIA GPUs

AI inference is conflicted between Performance, developer Productivity, and device Portability–the P3 problem. Machine learning compilers (MLCs) aim to address this, but their ecos…

17 views ·
#machine learning#nvidia#artificial intelligence
R/MACHINELEARNING

I built a Mamba1 variant I call SM1 with d_state=1 that runs on Blackwell in pure PyTorch [P]

20 views ·
PYTORCH

PyTorch 2.12 Release

19 views ·
#technology#software
GITHUB

End of a Semester

A story on how I plan to spend my Holiday…

14 views ·
#programming#ai#clojure
DEV.TO (TOP)

Why your diffusion model is slow at batch size 1 (and what actually helps)

TL;DR: Single-image diffusion inference is bottlenecked by kernel launch overhead and attention...…

13 views ·
#machinelearning#computervision
PYTORCH LANDSCAPE

PyTorch Landscape

22 views ·
R/STABLEDIFFUSION

Installing ComfyUI + PyTorch for AMD ROCm 7.2, using official drivers.

14 views ·
DEV.TO (TOP)

Your PyTorch Model File Can Execute Arbitrary Code — Here's How I Built a Scanner to Detect It

Every time you run torch.load("model.pt"), you're executing arbitrary Python code. Not "could...…

16 views ·
#security#machinelearning
PYTORCH

Running PyTorch Models on Apple Silicon GPUs with the ExecuTorch MLX Delegate

17 views ·
#technology#machine learning#apple silicon
GITHUB

ImpactArbiter – A PyTorch autograd trap for LLM memory bugs

Contribute to msunda17/impactarbiter-cli development by creating an account on GitHub.…

12 views ·
#technology#artificial intelligence#software development
HACKER NEWS (NEWEST)

Softmax in front of CrossEntropyLoss: 16 other bugs PyTorch won't catch

A walkthrough of the 17-rule design-time linter inside Neurarch: what each rule catches, why it matters, and where static analysis stops being useful for neural networks.…

16 views ·
#machine learning#neural networks
PROBABLY DANCE

Programmers Spend Their Time – Probably Dance

I submitted a tiny patch to flash attention. The necessary typing for the change takes less ten seconds, but the overall change took more than ten hours So where does the time go? …

15 views ·
#software development#debugging#cuda
TOWARDS DATA SCIENCE

PyTorch NaNs Are Silent Killers — So I Built a 3ms Hook to Catch Them at the Exact Layer

NaNs don’t crash your training — they quietly destroy it. After losing hours to a silent failure in a ResNet training run, I built a lightweight detector that pinpoints the exact l…

14 views ·
#deep learning#debugging
GITHUB

Porting a Scratch-Built 500M LLM Training Pipeline to ROCm on Strix Halo

A lightweight transformer language model built from scratch in PyTorch, trained on a single consumer GPU with a full pipeline for data processing, pretraining, and instruction tuni…

16 views ·
#machine learning#rocm#llm