Personal homepage China time

AI product engineer / multimodal mapping

input text

i map(x)

text -> image
output image

Text, image, audio, logs, scenes, and traffic mapped into controllable products, code, state, and operations.

AI products with infrastructure underneath.

From vague intent to shipped control surfaces.

HA Distributed services
Data Structured / unstructured
Infra Models / containers / tools
Lead/SRE Delivery / reliability
Product Scope / boundaries / delivery
LLM systems Agent SDKs / Vercel AI SDK / Cloudflare AI Gateway / OpenAI Agents / Claude Code / Hermes
Generation Stable Diffusion / LoRA / ComfyUI / GPT-SoVITS
Backend FastAPI / Docker / model serving / internal tools
SRE TDSQL / automation / containers / remediation

Lab / private systems.

Small systems, tools, and experiments. Some private, some forked, all reworked into usable surfaces.

Personal AI OS

Devices as interfaces

devices / voice / events / gateways
Knowledge engines

Memory as infrastructure

schemas / queues / graphs / retrieval
Agent interfaces

Messy platforms, clean tools

CLI / MCP / browser / task I/O
Generative canvases

Pixels under direction

image / voice / timeline / model control

Art is a timeline you can scrub.

Motion, text, shade, and progress on one controlled axis.

ScrollVideo study

Scroll becomes time.

Frame, annotation, and exit state share one progress value.

0% / 0.00s

Proof in products, not pitch decks.

PageOn

AI PPT generation: visual representation language, structured LLM output, multimodal agents.

app.pageon.ai

Cyber Space

App Store multimodal AI chat: text, voice, image input, Stable Diffusion, voice conversion.

App Store

Tencent SRE systems

TDSQL, Python ops packages, single-machine deploy, alert remediation, container optimization.

Automation / container optimization / SRE

Public traces

Repos and images: forks, prototypes, containers, and model-service experiments.