Reports to: Vice President of Software Development
Department: Engineering
FLSA Category: Exempt
Position Type: Remote, Full-Time, Mid-Senior level
Travel Requirement: 0-10%
Office Location: Remote, Austin, TX or Denver, CO preferred
JOB SUMMARY
Clarvos is seeking an experienced Senior AI/ML Ops Engineer to join our team focused on deploying, scaling, and maintaining machine learning systems in production. This role bridges the gap between data science and engineering, working to transform experimental models into robust, scalable solutions. If you have a proven record of building, scaling and deploying LLM and ML solutions that drive innovation and scalable solutions, we would like to hear from you.
ESSENTIAL FUNCTIONS AND RESPONSIBILITIES
- Design, deploy and maintain secure, high-quality ML/LLM systems and data infrastructure (databases, data lakes, warehouses, pipelines) in GCP/AWS and Databricks environments
- Implement end-to-end large-scale data pipelines, with Apache Spark, and APIs for efficient data processing, model serving, and scalability
- Architect CI/CD pipelines, LLM/MLOps workflows, and AI agent frameworks
- Transform data science prototypes into production-ready systems with robust monitoring and tracing for performance, high accuracy, and data drift.
- Collaborate with cross-functional teams to deliver tailored solutions that meet product requirements.
- Document technical designs and provide guidance to team members as needed.
KNOWLEDGE, SKILLS, ABILITIES, AND QUALIFICATIONS
- 5+ years of applied ML/LLM engineering experience, with a deep focus on unstructured data.
- Strong software, AI, and data engineering skills with proficiency in Python, SQL and Spark.
- Expertise in Databricks platform, Apache Spark for distributed data processing, and cloud platforms (GCP preferred, AWS/Azure).
- Demonstrated experience with the full ML lifecycle: data engineering, training, deployment, monitoring and governance.
- Expertise with unstructured data structures, unsupervised algorithms (LSH, KMeans), NLP processing, embedding models (open source, third-party), ML/LLMOps tools (MLflow, LangSmith, Weights & Biases), containerization (Docker), vector databases (Databricks, pgvector), and AI agent systems (Langchain, LlamaIndex).
- Experience in RAG and fine-tuned LLM deployments is a plus.
- Demonstrated technical leadership experience.
- Strong communication skills.
- Excellent collaboration skills (primarily between Engineering and Data Science).
PHYSICAL REQUIREMENTS/WORKING CONDITIONS:
Standing/Walking/Mobility: Must have mobility to attend meetings remotely and in person.
Climbing/Stooping/Kneeling: 0% - 10% of the time.
Lifting/Pulling/Pushing: 0% - 10% of the time.
Fingering/Grasping/Feeling: Must be able to write, type and use a telephone system 100% of the time.
Sitting: Sitting for prolonged and extended periods of time.
This job description reflects management’s assignment of essential functions; it does not prescribe or restrict the tasks that may be assigned. Management may revise duties as necessary without updating this job description.
For more information about the company, please visit our website: https://clarvos.com/
Clarvos is an Equal Opportunity Employer and does not discriminate against any employee or applicant for employment because of race, color, sex, age, national origin, religion, sexual orientation, gender identity, status as veteran, disability or any other federal, state or local protected class.
Clarvos complies with federal and state disability laws and makes reasonable accommodation for applicants and employees with disabilities.
If you require reasonable accommodation in completing the application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process, please direct your inquiries to hrsupport@Clarvos.com.