前玮科技(上海)有限公司
Full Stack Engineer, AI systems
高科技/人工智能
高科技/人工智能
广州, 上海, 北京, 深圳
经验不限
本科
¥50 - 100K15薪
公司介绍
前玮科技(上海)有限公司成立于2024年03月15日,注册地位于上海市静安区江场三路238号1601室,法定代表人为LOW YU WEI。经营范围包括一般项目:技术服务、技术开发、技术咨询、技术交流、技术转让、技术推广;工程和技术研究和试验发展(除人体干细胞、基因诊断与治疗技术开发和应用,中国稀有和特有的珍贵优良品种);信息技术咨询服务;信息咨询服务(不含许可类信息咨询服务)。(除依法须经批准的项目外,凭营业执照依法自主开展经营活动)
职位描述
A1 is building a proactive AI smart assistant for everyday users to bring intelligence to conversations, errands, organising and workflows.
Our product focuses on achieving high reliability for long-running workflows, persistent context, and real-world task completion. The system must handle multi-step reasoning, interact with external tools, and remain reliable despite non-deterministic model behavior.
Role
We are looking for a Full Stack Engineer - AI Systems to build the product layer that turns these capabilities into usable, production-grade workflows. This includes designing how agents operate, fail, recover, and deliver consistent value to users.
Focus
Build and operate backend systems that serve AI-powered features in production.
Design inference pipelines, orchestration layers, and service boundaries around models.
Own production concerns: monitoring, logging, alerting, and incident response.
Optimize latency and throughput across inference, caching, batching, and streaming.
职位要求
Ideal Experiences
Strong backend engineering fundamentals in production environments.
Experience running high-throughput, low-latency services.
Familiarity with AI inference patterns (LLMs, embeddings, multimodal).
Comfortable debugging distributed systems under load.
Bias toward shipping and learning from production behavior.
Outcomes
Backend systems run reliably at scale, handling production AI traffic with low latency and high throughput.
APIs are stable, clear, and support seamless integration with frontend and ML systems.
Production incidents are quickly detected, diagnosed, and resolved, minimizing user impact.
Iterative improvements based on real usage continuously increase system performance and reliability.
Tech Stack
Python
NodeJs
Pytorch
OpenAI / Anthropic / open-source LLMs
SQl & noSQL
Kubernetes
Docker
分享