GeistHaus
log in · sign up

the bug that taught me more about PyTorch than years of using it

elanapearl.github.io

a loss plateau that looked like my mistake turned out to be a PyTorch bug. tracking it down meant peeling back every layer of abstraction, from optimizer internals to GPU kernels.

1 page links to this URL