AI Notes — April 29
NVIDIA Nemotron 3 Nano Omni: 30B/A3B multimodal MoE, 256K context, 9x throughput. Mini-SGLang prefix matching with the radix tree. Unsloth LoRA: merged vs non-merged tradeoffs. Mimicking Dream of the Red Chamber style with a 167MB adapter. TRL DPO end-to-end.