New

Senior Machine Learning Engineer

Genentech
United States, California, South San Francisco
Jan 30, 2025
The Position We advance science so that we all have more time with the people we love. At Genentech Computational Sciences (gCS), Computational Catalysts, we are on a mission to partner across the organization to realize the potential of data, technology, and computational approaches that will revolutionize how targets and therapeutics are discovered and developed, ultimately enabling novel treatments for patients across the world. We stand at the beginning of an exciting journey. The Computational Catalysts group within gCS is a diverse, curious, and action-driven team at the intersection of computation, engineering, and science with ambition to advance our technical excellence. The focus of the team is on partnering with the informatics and scientific communities to create a computational and data ecosystem that powers scientific discovery and accelerates decision-making. We aim to modernize our ability to acquire, store, link, share, find, and analyze data across the organization through scalable and integrated solutions that truly make every data point count. At Genentech Computational Sciences (gCS), we're looking for a motivated Senior Machine Learning Engineer to join us in developing innovative solutions for protocol generation, review, and AI-based search platforms. In this role, you'll work closely with key stakeholders to deliver impactful machine learning solutions that benefit our broader R&D community. As a Senior Machine Learning Engineer within the Data & Solutions Engineering function, you will play a crucial role in crafting and implementing machine learning models that drive two of our internal R&D solutions: a search and insights platform tailored to our data ecosystem and a protocol generation and review platform that streamlines key research workflows. Working closely with researchers, scientists, and engineers, you will bring a harmonious approach and technical rigor to projects that fulfill our scientific teams' needs. This role is ideal for someone passionate about applying machine learning to improve internal solutions that directly enhance research and development. The Opportunity: Design, develop, and deploy cloud-first, API-driven machine learning applications for data search, insights, and protocol generation and review platforms. Leverage large language models (LLMs) to improve contextual search, data retrieval, and scientific research efficiency through advanced prompt engineering, retrieval augmented generation, and fine-tuning techniques. Develop and refine LLMs tailored for protocol generation and review workflows, driving innovation in GenAI applications to streamline R&D processes. Collaborate with data engineers, software engineers, and architects to integrate ML models effectively within the internal data ecosystem. Monitor, validate, and optimize ML applications to ensure high-quality outputs, performance scalability, and a seamless user experience. Partner with research teams to identify needs, exchange insights, and deliver solutions that address evolving R&D requirements. Who You Are: Hold a Bachelor's or Master's degree in Computer Science, Data Science, Applied Mathematics, Bioinformatics, or a related quantitative field, with 3-5 years of experience in deploying machine learning applications at scale, preferably in R&D or data-intensive environments. Will be proficient in Python, with hands-on experience using modern frameworks for deep learning and GenAI, such as PyTorch, Hugging Face Transformers, LangChain, or Llama-Index. Good understanding of machine learning algorithms, model evaluation techniques, and performance optimization, with a knowledge of deploying LLMs in data-intensive settings. Skilled in cloud platforms (AWS, GCP, Azure), version control systems (Git, DVC, MLflow), CI/CD pipelines, and SQL for relational database management. Be a collaborative problem-solver with a strong sense of ownership, capable of partnering with interdisciplinary teams to deliver impactful solutions. Someone who is continuously updated on advancements in LLMs and GenAI, with a passion for applying these technologies to drive efficiencies in R&D workflows. Preferred: A public portfolio of projects available on GitHub/GitLab. A record of scientific excellence, as evidenced by at least one publication in a scientific journal or conference. Not sure if you meet all qualifications? Let us decide! Research shows that women and members of other under-represented groups tend to not apply to jobs when they think they may not meet every qualification, when, in fact, they often do! We pledge to create a diverse and inclusive environment and strongly encourage you to apply. Onsite presence, on our South San Francisco campus, is expected for at least 3 days a week. Relocation benefits are available for this job posting. The expected salary range for this position based on the primary location of California is $147,500 - $273,900 of hiring range. Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law. A discretionary annual bonus may be available based on individual and Company performance. This position also qualifies for the benefits detailed at the link provided below. Benefits #LI-JD1 #gCSCatalysts #gCS Genentech is an equal opportunity employer, and we embrace the increasingly diverse world around us. Genentech prohibits unlawful discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin or ancestry, age, disability, marital status and veteran status.