@effloow

Jangwook Kim

@effloowTokyo, JapanJoined April 2026

Website

About

Nothing here yet.

Available for

Nothing here yet.

Jangwook Kim's blogs

Efflooweffloow.hashnode.dev198 posts

Articles Comments

Recently published

JKJangwook Kimeffloow.hashnode.devMay 12 · 10 min read

MemMachine: Ground-Truth Memory for AI Agents

Every time an agent summarizes a conversation to save memory, it loses information. That trade-off has been accepted as unavoidable — LLMs produce long outputs, context windows are finite, and token costs are real. MemMachine, presented in arXiv pape...

JKJangwook Kimeffloow.hashnode.devMay 11 · 5 min read

ReFlect: Training-Free Error Recovery for Long-Horizon LLM Reasoning

Long-horizon reasoning is where production LLM agents tend to quietly break. A model can produce a plausible-looking chain of thought, accept a wrong intermediate answer, and continue building on that error for every step that follows. By the time th...

JKJangwook Kimeffloow.hashnode.devMay 11 · 5 min read

Claude Managed Agents: Dreaming, Outcomes, and Multiagent

Anthropic released three new Claude Managed Agents features on May 7, 2026: dreaming (a research preview that lets agents learn from their own session history), outcomes (a rubric-based grading system that guides agent behavior toward defined success...

JKJangwook Kimeffloow.hashnode.devMay 11 · 5 min read

PARSE: Faster LLM Inference via Parallel Prefix Speculative Decoding

Speculative decoding became the standard inference speedup technique through 2024 and 2025. The idea: a small draft model generates a sequence of candidate tokens, and a larger target model verifies them in parallel — accepting the longest valid pref...

JKJangwook Kimeffloow.hashnode.devMay 11 · 10 min read

ZAYA1-8B: Zyphra's Efficient MoE Reasoning Model Guide

The scaling-is-everything story has a new challenger. On May 6, 2026, Zyphra released ZAYA1-8B — an open-weight Mixture-of-Experts reasoning model with 8.4 billion total parameters and fewer than 800 million active per token. On AIME 2025, a benchmar...

Jangwook Kim

About

Available for

Jangwook Kim's blogs

Recently published

MemMachine: Ground-Truth Memory for AI Agents

ReFlect: Training-Free Error Recovery for Long-Horizon LLM Reasoning

Claude Managed Agents: Dreaming, Outcomes, and Multiagent

PARSE: Faster LLM Inference via Parallel Prefix Speculative Decoding

ZAYA1-8B: Zyphra's Efficient MoE Reasoning Model Guide

Search Hashnode

Jangwook Kim

About

Available for

Jangwook Kim's blogs

Recently published

MemMachine: Ground-Truth Memory for AI Agents

ReFlect: Training-Free Error Recovery for Long-Horizon LLM Reasoning

Claude Managed Agents: Dreaming, Outcomes, and Multiagent

PARSE: Faster LLM Inference via Parallel Prefix Speculative Decoding

ZAYA1-8B: Zyphra's Efficient MoE Reasoning Model Guide