About Nexusflow.ai
Modern enterprise copilots & agents call for last-mile quality, enterprise-grade robustness and scalable operation costs, beyond simplified programming interfaces for generative AI. Nexusflow tackles this challenge, enabling enterprises to own their workflow copilots & agents stacked on top of powerful yet cost-effective, compact LLMs. We train large language models and build last-mile quality dev tooling for copilots & agents on your enterprise workflows. Our team has built the open-source LLM, NexusRaven-V2, rivaling GPT-4 in function calling with a 100X smaller model size. Our team members are also behind the scenes of Starling, the #1 ranked compact 7B chat model based on human evaluation in Chatbot Arena.
Position: Backend Engineer
Nexusflow is currently adding Backend Engineers to our team. Our Backend Engineers package up our technology in models and last-mile quality tooling. Our Backend Engineers will be the driving force to build our products and solutions, in extensive collaboration with our ML Engineers and Front-end Engineers.
Responsibilities
-
API system development for copilot & agent quality tooling
-
API system development for copilot serving and integration with a focus on enterprise-grade requirements in the following areas
-
Integration with on-prem & cloud compute vendors
-
Integration with software tools required in customer oriented solutions
-
-
Distributed system and optionally GPU performance optimization
-
Wear many hats and collaborate with the whole team for product development, deployment and customer success
Qualification
Required
-
Experience in ML model or ML data pipeline deployment (on-prem or on cloud)
-
Experience in building backend for application or platform API systems
Preferred
-
Working experience in fast-pace team environment
-
Experience in using or contributing to modern compute frameworks for LLMs (e.g. Deepspeed, Huggingface TGI, FSDP)
-
Experience in projects involving LLMs