hero

Build the future with our visionaries.

Senior Machine Learning Engineer, Infrastructure

Rad AI

Rad AI

Software Engineering, Other Engineering
United States
Posted on Wednesday, May 22, 2024

About Rad AI

We have raised $80+ million to date from venture funds and just closed on our series B financing with investors Khosla Ventures, Gradient (Google’s AI fund) and ARTIS. We’ve also formed a partnership with Google to collaborate on the future of generative AI to redefine healthcare. Currently, more than 1/3 of radiology groups and healthcare systems, including Kaiser Permanente, HCA Healthcare, and Geisinger, now leverage the latest Gen AI advancements from Rad AI. We're recognized as one of the most promising healthcare AI companies by both CB Insights and AuntMinnie. Come join us in transforming healthcare with AI!

Founded by the youngest US radiologist in history, Rad AI empowers physicians with Al to save time, reduce burnout, and improve the quality of patient care. By combining our deep expertise in healthcare and AI and using one of the largest proprietary radiology report datasets in the world, our AI has uncovered hundreds of new cancer diagnoses for patients and reduced the error rate in tens of millions of radiology reports by nearly 50%.

Why Join Us:

Given our large client growth and projected movement in the year ahead, we are seeking an experienced Senior Machine Learning Engineer to join our team. The candidate that we are looking for will have expertise in maturing, scaling and optimizing the infrastructure of a quickly growing product, and a passion for building, teaching, learning, and collaborating in a high-performing cross functional team working to make a difference in millions of patients and physicians lives.

What You’ll Be Doing:

  • Design, implement, and maintain the infrastructure that supports our machine learning applications, services, and workflows

  • Build, maintain, and improve our ML platform that supports continuous integration, continuous delivery, and continuous training for our machine learning models

  • Leverage low-level programming languages, cloud native services, and serverless architectures to build scalable and resilient systems

  • Plan, design and develop components in the data pipeline to enable various machine learning models in production

  • Lead the design and implementation of infrastructure projects, including the development of technical designs, plans, and specifications, along with their evolutions and updates

  • Design, deploy, and maintain the full ML platform stack including capabilities such as monitoring and data observability, the full model lifecycle, etc.

  • Investigate the existing pipeline, identify bottlenecks and optimize the throughput and latency of ML components

  • Balance metrics and alerting with cost efficiency and detail

  • Develop and implement automation tools for model training and deployment

Who We’re Looking For:

  • 4+ years of experience in ML Systems Engineering

  • 4+ years of industry experience writing in Python (preferable) or other common languages in the ML domain

  • Strong experience with infrastructure and DevOps tools such as Kubernetes, Docker, and Ansible

  • Experience in distributed systems, storage systems, and databases

  • Strong knowledge of cloud computing platforms such as AWS (preferable), GCP, and Azure.

  • Experience with infrastructure-as-code tools such as Terraform (preferable), Pulumi, Cloud Formation, etc.

  • Experience with monitoring, tracing, and logging tools such Cloudwatch, NewRelic, Prometheus, etc.

  • Excellent communication skills, with a strong sense of ownership and a systematic approach to problem-solving

  • Proven ability to manage and lead active incidents, address what caused them, and establish systems to avoid them in the future via blameless postmortems

Nice to haves:

  • Experience working at an early stage startup

  • Experience in a HIPAA compliant environment

  • Experience working with machine learning frameworks such as PyTorch

  • Experience with inference optimization of NLP models

Come join our world-class team as we build and deploy AI solutions that will make a difference in millions of people’s lives. Our team is mission-driven and focused on transparency, inclusion, close collaboration, and building an incredible team.

If you're passionate about driving innovation and delivering impactful reporting solutions, we'd love to hear from you!

Rad AI offers a variety of benefits, including:

  • Comprehensive Medical, Dental, Vision & Life insurance

  • HSA (with employer match), FSA, & DCFSA

  • 401(k)

  • 11 paid company holidays

  • Location-flexibility (remote-first company!)

  • Flexible PTO policy

  • Annual company-wide offsite

  • Periodic team offsites

  • Annual equipment stipend

At Rad AI, we value diversity and provide equal employment opportunities (EEO) to all employees and applicants without regard to race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We will consider for employment qualified applicants with criminal histories in a manner consistent with the requirements of the San Francisco Fair Chance Ordinance.