About the Job
This role is for one of our client companies — a VC-backed health-tech, ai, wearable startup that has raised $2M USD in funding.
Salary: Up to 50LPA
Apply once and, if selected, get access to up to 20 remote and onsite interview opportunities.
What We're Building
CodeRound AI matches the top 5% tech talent with the fastest-growing, VC-funded AI startups across Silicon Valley and India.
Top-tier product startups across the US, UK, EU, UAE, and India have hired top engineers through CodeRound.
What You'll Build
We're looking for a hands-on AI Engineer who has built production AI systems, not just AI prototypes. You'll own everything from multi-agent architectures and RAG pipelines to real-time inference, model serving, and production reliability.
This is a founding engineering role where you'll work directly with the founders to design, build, deploy, and scale AI systems used by real customers.
What You'll Do
- Design, build, and deploy production-grade LLM applications and conversational AI systems
- Build multi-agent workflows using frameworks such as LangGraph, CrewAI, or similar orchestration frameworks
- Develop production RAG systems using vector databases and hybrid retrieval architectures
- Optimize inference latency, GPU utilization, memory consumption, and LLM serving costs
- Build reliable model serving infrastructure with monitoring, observability, evaluations, retries, and guardrails
- Work on real-time AI applications involving streaming, speech (STT/TTS), or voice agents
- Design scalable backend services and APIs supporting AI workloads
- Collaborate with product and engineering teams to rapidly ship AI features into production
What We're Looking For
- 2+ years of software engineering or AI engineering experience
- Strong hands-on experience building production LLM applications
- Experience deploying RAG systems into production
- Experience with vector databases such as Pinecone, pgvector, Weaviate, Chroma, FAISS, or similar
- Experience with LangChain, LangGraph, LlamaIndex, or similar AI orchestration frameworks
- Strong understanding of transformers, embeddings, prompt engineering, and evaluation methodologies
- Experience building model serving infrastructure and production ML systems
- Strong backend engineering skills (Python preferred)
- Experience with cloud platforms (AWS/GCP/Azure), Docker, and scalable backend architecture
Bonus Points
- Experience building multi-agent systems
- Experience with voice AI (STT/TTS), streaming, or real-time conversational systems
- Experience with inference optimization, batching, quantization, or GPU optimization
- Experience implementing AI guardrails, structured outputs, and evaluation pipelines
- Experience working in early-stage startups or owning products end-to-end
Why Join?
- Build core AI systems from day one with significant technical ownership
- Work directly with founders on high-impact AI products
- Solve challenging engineering problems across LLM infrastructure, agentic systems, and production AI
- Ship products used by real customers in healthcare
- Join a fast-moving, VC-backed startup with significant growth potential