Job Summary
We are looking for a highly capable engineer/researcher to lead the R&D of Job Summary
We are looking for a highly capable engineer/researcher to lead the R&D of
Small Language Models (SLMs)
and
Vision-Language Models (VLMs)
for
edge / low-latency
and cost-efficient production scenarios. You will own the
continuous pretraining, supervised instruction tuning (SFT) , and
compression/distillation
pipelines, and work closely with platform teams to deliver reliable, measurable improvements in
inference efficiency, tool-use success rate, and overall model quality .
Key Responsibilities
SLM/VLM Training: Continuous Pretraining & Instruction Tuning (SFT) Conduct continuous pretraining and SFT for SLMs and VLMs to improve task performance and domain adaptation. Build reproducible training workflows in PyTorch, including data processing, training, evaluation, and model versioning. Compression, Distillation & Edge/Low-Latency Inference Optimiz...