VLM Run | 1x Product + 1x ML Staff Engineer | Santa Clara, CA (HQ)

We're building the inference and orchestration layer for production Vision-Language Models. We care deeply about fast and ergonomic visual inference, reliable structured outputs, and the observability to iterate on them.

What We've Shipped

Here are a few things you can check out:

Orion: Our visual agent that reasons and acts over images, video, and documents. Chat at https://chat.vlm.run.
mm-ctx: A Unix-style multimodal CLI (find, cat, grep, wc) that gives coding agents real context over images, video, and PDFs. Rust core, Python devex. See https://pypi.org/project/mm-ctx and https://www.vlm.run/open-source/mm.
vlmbench: A single-file CLI for benchmarking VLM inference (TTFT, TPOT, throughput) across vLLM, Ollama, and SGLang. See https://github.com/vlm-run/vlmbench and https://www.vlm.run/open-source/vlmbench.

Apply

Apply at https://app.dover.com/jobs/vlm-run

Or, email hiring "at" vlm.run with your GitHub and a couple recent projects.

Product & ML Staff Engineer at VLM Run

VLM Run | 1x Product + 1x ML Staff Engineer | Santa Clara, CA (HQ)

What We've Shipped

Apply

Similar jobs