Design and implement embedded software libraries and low-level runtime for platforms.
Develop and maintain the compiler path (MLIR/LLVM passes, code generation, kernels) that maps AI and DSP primitives and related operations to our hardware.
Develop and refine a benchmarking and profiling framework that incorporates reproducible tests, dashboards, and regression gates.
Strengthen build, test, and CI so releases are predictable and artifacts are easy to consume.
Collaborate with hardware, architecture, and customer-facing teams; write precise specs and documentation; turn feedback into roadmap items.
Outcomes (first 18 months)
A production-ready driver + runtime stack for at least one MCU target and one accelerator-class target.
A working compiler path with visible wins in latency and energy on representative models, documented end-to-end.
A stable benchmark suite with automated reports and perf...