
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A technology recruitment firm is seeking an AI Infrastructure Architect in Edinburgh. This role involves designing a unified architecture platform for AI workloads, building execution frameworks across different computing units, and creating a high-performance Runtime. Ideal candidates will have strong knowledge of system architecture, experience with Serverless architectures, and proficiency in relevant programming languages. This is a permanent position requiring on-site work. Interested applicants should contact via an provided email.
Job Title: AI Infrastructure Architect
Location: Edinburgh, Scotland
Type: Permanent
On-Site Working Required, No Sponsorship Provided
Design a unified AI Infra & Serving architecture platform for composite AI workloads such as LLM Training & Inference, RLHF, Agent, and Multimodal processing. This platform will integrate inference, orchestration, and state management, defining the technical evolution path for Serverless AI + Agentic Serving
Design a heterogeneous execution framework across CPU/GPU/NPU for agent memory, tool invocation, and long-running multi-turn conversations and tasks. Build an efficient memory/KV-cache/vector store/logging and state-management subsystem to support agent retrieval, planning, and persistent memory.
Build a high-performance Runtime/Framework that defines the next-generation Serverless AI foundation through elastic scaling, cold start optimization, batch processing, function-based inference, request orchestration, dynamic decoupled deployment, and other features to support performance scenarios such as multiple models, multi-tenancy, and high concurrency.
If you're interested in applying, please reach out to daniel@microtech-global.com