GeistHaus
log in · sign up

Quantization · Hugging Face

huggingface.co

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

10 pages link to this URL
AI Engineering Primer

Principal AI Architect. Creator of open-strix, a harness for building agent teams. Writing about AI architecture, stateful agents, and what happens when you give AI memory.

0 inbound links article en
From Classical ML to DNNs and GNNs for Real-Time Financial Fraud Detection

Financial transaction fraud is a pervasive problem costing institutions and customers billions annually. This survey reviews the current state-of-the-art in real-time transaction fraud detection, spanning both academic research and industry adopted solutions.

0 inbound links article en financial fraudfraud detectionmachine learningdeep learningsurveytransaction monitoring
Ubuntu Summit 25.10: Personal Highlights

I recently had the privilege of attending the Ubuntu Summit 25.10 - an event hosted by Canonical to celebrate the release of Ubuntu 25.10, and provide a platform for open source projects from around the globe to showcase their work. This post includes some personal highlights and a brief summary of some of the talks.

0 inbound links article en Blog UbuntuBlogCanonicalLinuxNixNvidiaProfilingCUDAAISnapsWSLDesign
Quantization, Floating Points and TurboQuant

A lot of effort is spent to make LLM inference cheaper and performant. Quantization is the standard way to do this, where we reduce model’s size by representing it with parameters with fewer bits so they take up less memory and move faster through the memory hierarchy. The progression from 32-bit -> mixed precision -> 16-bit -> 8-bit -> 4-bit formats has been one of the most impactful practical developments in LLM inference Floating Point Formats

0 inbound links article en llmquantizationmlinference
LLM Quantization and NVFP4

With the rise of large language models and the desire to run them more cheaply and efficiently, the concept of quantization has gained a lo...