Tag feed

#paper-poc

3 posts0 followers

Explore Hashnode

Alternatives

Trending tags this week

JKJangwook Kimeffloow.hashnode.devMay 12 · 10 min read

MemMachine: Ground-Truth Memory for AI Agents

Every time an agent summarizes a conversation to save memory, it loses information. That trade-off has been accepted as unavoidable — LLMs produce long outputs, context windows are finite, and token costs are real. MemMachine, presented in arXiv pape...

0

JKJangwook Kimeffloow.hashnode.devMay 10 · 9 min read

DRA-GRPO: Fixing Diversity Collapse in Reasoning Models

Group Relative Policy Optimization (GRPO) became the dominant approach for training reasoning models after DeepSeek-R1 (arXiv:2501.12948) showed it could reach OpenAI o1-level math performance without a separate value model. But GRPO has a quiet flaw...

0

JKJangwook Kimeffloow.hashnode.devMay 9 · 7 min read

Adaptive KV-Cache Quantization: How 'Don't Waste Bits' Cuts On-Device LLM Latency by 17%

Running LLMs on-device means fighting two constraints simultaneously: memory and latency. The KV-cache — the buffer that stores past token representations so the model does not recompute them — is often the bottleneck on both fronts. A paper publishe...

0

#paper-poc

Search Hashnode

#paper-poc

Explore Hashnode

Trending tags this week

MemMachine: Ground-Truth Memory for AI Agents

DRA-GRPO: Fixing Diversity Collapse in Reasoning Models

Adaptive KV-Cache Quantization: How 'Don't Waste Bits' Cuts On-Device LLM Latency by 17%