Role Introduction
We’re looking for an experienced AI/ML Engineer to lead the development of our core machine
learning models and infrastructure. In this role, you will be at the forefront of building real-time
voice AI solutions with millisecond-level inference latencies .
You will optimize our multi-tenant platform’s ML systems for performance and scalability, including
GPU acceleration and data pipeline efficiency, ensuring enterprise-grade reliability for our clients.
Responsibilities
- Design, develop, and refine machine learning models that power our voice AI platform.
- Optimize real-time inference pipelines for low-latency, high-throughput performance in a
production environment.
- Leverage GPU acceleration techniques to improve inference speed and efficiency in a
multi-tenant SaaS environment.
- Clean, preprocess, and manage large datasets; build robust data engineering pipelines to feed
ML models.
- Implement and maintain CI/CD pipelines for continuous training, testing, and deployment of ML
models into production.
- Integrate large language models (LLMs) and other AI algorithms into our product offerings,
tailoring solutions for enterprise use cases.
- Collaborate with cross-functional teams to deploy ML solutions both in the cloud and
on-premises for enterprise clients, ensuring smooth integration and performance.
- Monitor model performance in production and iterate on improvements in accuracy, latency, and
scalability.
- Hire and lead your own AI/ML team.
Qualifications
- 5+ years of experience as an AI/ML Engineer or similar role, with a strong foundation in machine
learning algorithms and principles.
- Proven expertise in developing and deploying ML models in real-time inference settings (e.g.,
streaming, live data scenarios).
- Hands-on experience with GPU optimization and parallel computing (e.g., CUDA, tensor
libraries) to accelerate ML workloads in a SaaS product .
- Proficiency in Python (or similar language) and ML frameworks (TensorFlow, PyTorch, etc.), with
a focus on production-level code quality.
- Experience in data engineering: handling large datasets, writing ETL pipelines, and employing
data cleaning techniques to ensure high-quality input data.
- Knowledge of CI/CD tools and practices for machine learning (Docker, Kubernetes, Jenkins or
similar) to automate model deployment.- Familiarity with enterprise-level LLM applications and frameworks, and the ability to fine-tune or
customize models for specific industry needs.
- Experience setting up and deploying ML solutions in on-premises environments for enterprise
clients, including knowledge of relevant security and compliance considerations.
- Strong problem-solving skills and the ability to work in a fast-paced startup environment, taking
ownership of complex ML projects from concept to production.
Preferred Skills
- Experience with voice AI or speech-related ML (automatic speech recognition, text-to-speech,
NLP for voice).
- Familiarity with distributed computing frameworks and tools (Spark, Dask, Ray) for scaling model
training or inference.
- Knowledge of MLOps platforms and monitoring tools (MLflow, Kubeflow, Prometheus, etc.) for
tracking experiments and model performance.
- Understanding of multi-tenant cloud architecture and microservices as it relates to deploying AI
models at scale.
- Contributions to open-source ML projects or publications in AI conferences/journals.
- Strong communication skills for explaining complex ML concepts to non-technical stakeholders
and working with client teams on customization.
Why CozmoX AI?
- Innovative Domain: Work on cutting-edge voice AI technology in a rapidly growing field. Voice
AI is poised to create enormous new market opportunities , putting you at the forefront of an
industry boom.
- Startup Growth: Join a dynamic startup at an exciting scaling stage – your work will have a
direct, significant impact on the product and company direction.
- Ownership & Impact: Enjoy a high degree of ownership over projects, creative freedom to
experiment, and the chance to see your ideas quickly come to life in production.
- Collaborative Team: Be part of a small, passionate team of experts. You'll work closely with
founders and cross-functional colleagues in a supportive, low-bureaucracy environment.
- Personal Growth: Tackle complex, cutting-edge challenges that accelerate your learning. In
our fast-paced environment, you’ll grow your skill set and career alongside the company.
- Mission-Driven: Help enterprises transform how they use voice and AI in their operations,
delivering technology that can improve customer experiences and business efficiency.