These are notes, thoughts, progress logs, and ideas I'm still forming. Some will age well. Some won't — that's the point.

AI Notes — April 29

NVIDIA Nemotron 3 Nano Omni: 30B/A3B multimodal MoE, 256K context, 9x throughput. Mini-SGLang prefix matching with the radix tree. Unsloth LoRA: merged vs non-merged tradeoffs. Mimicking Dream of the Red Chamber style with a 167MB adapter. TRL DPO end-to-end.

AI Notes — April 28

Sakana's 7B Conductor orchestrates frontier models, hits 83.9% on LiveCodeBench. OpenAI's AI-first phone targeting 2028. GUI Agent annotation needs a totally different paradigm. YC Summer 2026 RFS: 14 directions betting AI is now infrastructure, not feature.

AI Notes — April 27

Medical-LLM refactor: 4 findings on overnight runs and multi-format interference. Architectural breakdown of Gemma 4, Qwen 3.6, GLM-5.1, Kimi K2.6 and DeepSeek V4-Pro. Anthropic's Project Deal: Opus agents close better trades than Haiku.

AI Notes — April 26

SkillsBench vs our skillrank — a postmortem on seven mistakes: LLM-as-judge instead of deterministic verifiers, pairwise instead of pass/fail, no with/without baseline, and too much time on infra.

Where Sages Agree

A book on where four wisdom traditions converge — Zen, Confucianism, Stoicism, and Adlerian psychology — on what it means to live well in an anxious age.

AI Notes — April 25

DeepSeek-V4 vs Flash Attention vs MHA — algorithmic vs architectural innovation. CSA/HCA shrinks KV cache 5-10x via low-rank latent compression. GPT-Image 2 + Seedance 2.0 short-film workflow.

AI Notes — April 24

GPT-5.5 ships — faster, cheaper net, smarter. swyx on AI-native: skills as the agent unit, app companies outlast infra, Taalas bakes models into silicon. World ID 4.0 hits Tinder, Zoom, DocuSign.

AI Notes — April 23

Shopify at ~100% internal AI use, critique loops over parallel agents, Tangle/Tangent/SimGym. MacAskill on AI character as the most underrated lever. mini-sglang RadixAttention vs nano-vllm: 7311 tok/s on a single 3090.

AI Notes — April 22

Claude Design locks in creativity. GPT-Image-2 tops Image Arena by +242 Elo. ChatGPT Images 2.0 adds reasoning before drawing. RankAI's SEO+GEO stack. Google: 75% of new code is AI.

AI Notes — April 21

RLVR explained via DeepSeek-R1. Hermes agent patterns: stateless units, structured failure traces, directory-scoped AGENTS.md. Alex Imas on the post-commodity economy.

AI Notes — April 20

Generative Agents (Smallville), OASIS large-scale social simulation, and Love First Know Later — three papers mapping the theoretical base for persona products like Halo.

AI Notes — April 19

Claude Code terminal shortcuts (Shift+Tab, Esc, @). Fengtian's workflow: two Max plans + voice input + Agent Team mode = 10x productivity.

AI Notes — April 18

Claude Design pipeline: Pinterest inspiration → AI-generated background and character → Seedance 2.0 animation → motionsites.ai template → Landbook layouts.

AI Notes — April 17

Overseeing agents is the future, not writing code. Deep dive into nano-vllm attention, preempt, prefix caching. McKinsey on the agentic organization.

AI Notes — April 16

Energy-Based Models: not new — Hopfield Networks, Boltzmann Machines, diffusion models all trace back here. Yann LeCun's bet against autoregressive LLMs.

AI Notes — April 15

Local model rankings from Reddit, how to steer AI toward your design style with images, the 2026 AI engineer roadmap, Karpathy on the AI capability gap.

AI Notes — April 14

nano-vLLM deep dive: prefill vs decode, KV cache, PagedAttention, continuous batching. Plus Notion's Model Behavior Engineer role and software factory design.

AI Notes — April 13

GLM-5.1 architecture explained (MoE, MLA, DSA). Using Claude for tax filing: what broke. AI writing is harder than it looks. The folder-as-agent pattern.

AI Notes — April 12

A quiet day. Sometimes letting ideas settle is the work.

AI Notes — April 11

Consultant-style agent coordination: cheap executor + expensive advisor. Haiku + Opus doubles BrowseComp scores vs Haiku alone.

AI Notes — April 10

Meta's Muse Spark: 10x efficiency over Llama 4, 16 hidden tools in meta.ai. Two thoughts: AI tools as games, vibe coding as web fiction.

AI Notes — April 9

Mythos scores 93.9% on SWE-bench — a nuclear weapon. Picotron distributed training: DP naive vs bucket, AFAB vs 1F1B pipeline schedules.

AI Notes — April 8

Moltbook: AI theater or genuine emergence? Nebius $46B in signed contracts. Ryan Leoplo on harness engineering and zero human-written code.

AI Notes — April 7

Why changing one character in an image is harder than generating a cyberpunk city. Full diffusion model walkthrough with math and code.

AI Notes — April 6

Claude's Cowork feature supports Computer Use across devices — control a remote machine's browser without touching your own.

The Force That Keeps Me Moving

The Force That Keeps Me Moving

A simple number changed everything. Thirty thousand days. That's roughly how many days a human life has. This realization reshaped how I live, work, and think about time.

Why I'm Building in Public

Why I'm Building in Public

The decision to document everything publicly wasn't easy. Here's why I chose transparency over polish, and what I hope to gain from it.

What Steplify Taught Me About Product-Market Fit

My startup failed. But the lessons about listening to users, timing, and the gap between conviction and validation are worth more than any success.