Your mission
Join us at Rhesis AI – Open-source test generation and management for Gen AI applications that deliver value, not surprises!
At Rhesis AI, we empower organizations to develop and deploy Gen AI applications that meet high standards for reliability, robustness, and compliance. As the creators of an open-source solution for test generation & management, we enable AI teams to build context-specific tests, and collaborate directly with domain experts.
We're currently part of the K.I.E.Z. Accelerator at the Merantix AI Campus in Berlin, where we’re building the testing infrastructure Gen AI needs to earn trust at scale.
If you’re passionate about advancing trustworthy AI through practical tools and collaborative infrastructure, we invite you to join our mission.
Your profile
What you will do:
- Lead the design, development, fine-tuning, and deployment of large language models (LLMs) and generative AI systems for the Rhesis AI platform.
- Define and enforce best practices for LLMOps, ensuring scalable, maintainable, and secure model lifecycles from data ingestion to deployment and continuous monitoring.
- Guide architectural decisions and technical strategy across AI, ML, and NLP initiatives—translating cutting-edge research into practical, production-grade solutions.
- Build and scale LLM-powered data pipelines and workflows, integrating orchestration tools and collaborating closely with data engineering and product teams.
- Mentor engineers and establish engineering excellence within the AI/ML team, while remaining hands-on with development, testing, and performance optimization.
- Collaborate with leadership on roadmap planning and AI-driven product innovation, staying ahead of trends in foundational models, Gen AI, and applied ML.
- Contribute across the stack where needed—including MLOps, data engineering, and backend services supporting AI workflows.
You are great for this role, if you have:
- A Phd in Machine Learning or a related field.
- Deep hands-on experience with large language models (e.g., fine-tuning, evaluation, deployment) and generative AI architectures.
- Advanced proficiency in Python and ML frameworks such as PyTorch, JAX, TensorFlow, and HuggingFace Transformers.
- Strong grasp of MLOps tooling, including job orchestration, CI/CD for ML, and model monitoring.
- Solid foundation in ML theory, NLP techniques, and software engineering best practices (testing, documentation, version control).
Preferred Experience:
- Prior leadership in AI or ML-focused teams, including mentoring and technical direction.
- Experience with training and deploying state-of-the-art deep learning models in production environments.
- Familiarity with cloud-native ML infrastructure (AWS, GCP, Azure).
- Research or replication of cutting-edge ML papers, ideally in NLP or LLM optimization.
Overall Skills:
- Strong systems thinking with the ability to architect AI solutions that are both robust and extensible.
- Ability to manage ambiguity, break down complex problems, and drive them to resolution through collaboration.
- Excellent communication and documentation skills for cross-functional work and technical leadership.
- A mindset for building for scale, reliability, and continuous improvement—matching the fast pace of Gen AI evolution.
Why us?
We’re excited to offer a
fixed one-year contract (with very likely extension),
starting 15 June or 1 July, along with a range of benefits to support our team members, including:
- Work at the forefront of Gen AI: Collaborate with some of the most innovative companies building LLM applications. Contribute to the trustworthiness of AI by shaping open-source tools that define how Gen AI is tested and validated.
- Flexible work arrangements: We understand the importance of work-life balance and offer flexible working options to accommodate personal needs and preferences. We have offices in Berlin (AI Campus) and Potsdam (Griebnitzsee).
- Compensation: We offer salaries and benefits tailored to your experience and qualifications, along with the opportunity to gain ownership in the company.
- A supportive and collaborative work environment: We foster a culture of teamwork, collaboration, and mutual respect, where every team member is valued and supported in their professional and personal growth.
At Rhesis AI, we value diversity and inclusion, believing that diverse perspectives enrich our team and drive innovation. We encourage applications from individuals of all backgrounds, regardless of gender, nationality, religion, or other personal characteristics. Even if you don’t meet every requirement listed, we encourage you to apply—your unique skills and experiences could be exactly what we need to succeed.
Ready to join us?If you’re passionate about leading AI engineering, excited by the prospect of working with cutting-edge AI technology, and committed to making AI responsible, we’d love to hear from you!
Apply now and help us build solutions that shape the future of AI!
About us
At Rhesis AI, we’re driven by the goal of making AI evaluation and testing seamless, thorough, and accessible. We’re not just another tech company – we’re building solutions to ensure that Gen AI applications are reliable, resilient, and ready to meet the demands of real-world use. Our focus is on providing a comprehensive, automated testing platform that validates AI applications across diverse scenarios and industries, helping businesses confidently deploy Gen AI.