<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Pengcheng's Blog</title><link>https://pengchengneo.github.io/</link><description>Recent content on Pengcheng's Blog</description><generator>Hugo -- gohugo.io</generator><language>zh-CN</language><copyright>© 2026 Pengcheng</copyright><lastBuildDate>Sat, 04 Apr 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://pengchengneo.github.io/index.xml" rel="self" type="application/rss+xml"/><item><title>GEMM Kernel 性能优化十问</title><link>https://pengchengneo.github.io/posts/gemm-kernel-optimization-10-questions/</link><pubDate>Sat, 04 Apr 2026 00:00:00 +0000</pubDate><guid>https://pengchengneo.github.io/posts/gemm-kernel-optimization-10-questions/</guid><description>基于 Pallas GMM FP8 blockwise 量化内核开发的实战问答，涵盖量化粒度、子通道循环、分阶段 tiling、编译链路、精度对齐方法论等核心话题。</description></item><item><title>SGLang-JAX</title><link>https://pengchengneo.github.io/projects/sglang-jax/</link><pubDate>Fri, 20 Mar 2026 00:00:00 +0000</pubDate><guid>https://pengchengneo.github.io/projects/sglang-jax/</guid><description>SGLang 的 JAX 后端实现，支持在 TPU 上运行高效推理。</description></item><item><title>SGLang-JAX: An Open-Source Solution for Native TPU Inference</title><link>https://pengchengneo.github.io/posts/sglang-jax-tpu-inference/</link><pubDate>Wed, 29 Oct 2025 00:00:00 +0000</pubDate><guid>https://pengchengneo.github.io/posts/sglang-jax-tpu-inference/</guid><description>SGLang-JAX 是基于 JAX 和 XLA 构建的开源推理引擎，支持 continuous batching 和 speculative decoding，在 TPU 上实现高效原生推理。</description></item></channel></rss>