@ajkerchum

Alexander Kerchum

@ajkerchumJoined September 2021

Senior Software Engineer at Upstart

About

Nothing here yet.

Available for

Nothing here yet.

Alexander Kerchum's blogs

Alexander Kerchumblog.kerchum.dev4 posts

Articles Comments

Recently published

AKAlexander Kerchumblog.kerchum.devJun 5 · 12 min read

Lessons From the Bottom of the Stack: Shipping a Quant

The SCLP compression algorithm — palette the exponents, sidecar the outliers, pack the rest — was a week or so of prototyping. The two posts before this one covered it end to end. This post is about t

AKAlexander Kerchumblog.kerchum.devJun 3 · 9 min read

From 8 Bits to 4: Sidecar, MoE, and the imatrix Trick That Worked

Last time we cut BF16 weights in half by treating the exponent as a 16-entry palette instead of an 8-bit field. SCLP8: 7.9 GB instead of 15.0, perplexity slightly better than the original, token gener

AKAlexander Kerchumblog.kerchum.devMay 29 · 9 min read

LLMs Use Just 16 of 256 Exponents — So We Compressed the Rest Away

Most people compressing LLM weights are fighting the same war: squeeze 7 billion floats into less memory without wrecking the model. The standard weapons are quantization schemes — map each float to a

AKAlexander Kerchumblog.kerchum.devNov 24, 2021 · 1 min read

How to move a directory from one git repo to another (or new) without losing history

Make copy of repo git clone dirtySourceRepo newSourceRepo OR clone from actual git repo and prevent push git remote set-url --push origin no_push Make sure to checkout the correct branch before the next step. Cloning from another local directory al...

Alexander Kerchum

About

Available for

Alexander Kerchum's blogs

Recently published

Lessons From the Bottom of the Stack: Shipping a Quant

From 8 Bits to 4: Sidecar, MoE, and the imatrix Trick That Worked

LLMs Use Just 16 of 256 Exponents — So We Compressed the Rest Away

How to move a directory from one git repo to another (or new) without losing history

Search Hashnode

Alexander Kerchum

About

Available for

Alexander Kerchum's blogs

Recently published

Lessons From the Bottom of the Stack: Shipping a Quant

From 8 Bits to 4: Sidecar, MoE, and the imatrix Trick That Worked

LLMs Use Just 16 of 256 Exponents — So We Compressed the Rest Away

How to move a directory from one git repo to another (or new) without losing history