The OCI Generative AI team is responsible for building and scaling Oracle's Generative AI and Agent services. Our mission is to empower customers to apply cutting-edge AI to their unique business challenges—leveraging Oracle’s infrastructure, enterprise reach, and deep expertise in AI. We focus on delivering high-performing, secure, and enterprise-grade generative AI models and services, including state-of-the-art solutions for code generation (e.g., NL2Code), content generation, and AI agents.
Role Overview:
We are looking for a Senior Applied Data Scientist (IC5) with deep expertise in LLMs and Generative AI to join our growing team. In this role, you will drive innovation at the intersection of AI and software development, with a focus on natural language to code (NL2Code), LLM fine-tuning, and evaluation frameworks. You’ll collaborate with top scientists and engineers to build and deploy models that are not only powerful but safe, secure, and reliable for enterprise use.
Key Responsibilities:
Lead development and evaluation of cutting-edge LLMs for NL2Code and related tasks (e.g., code synthesis, debugging, translation, and doc generation). Design and execute robust experimentation pipelines, benchmarking frameworks, and adversarial testing for model quality and safety. Collaborate with research and engineering to define and drive scientific direction in areas like instruction tuning, reinforcement learning, retrieval-augmented generation (RAG), and guardrails. Publish impactful research at top-tier conferences (e.g., NeurIPS, ACL, ICML, ICLR) and contribute to open science where applicable. Translate business problems into AI solutions while ensuring safety, fairness, and compliance. Serve as a technical thought leader, mentor junior scientists, and engage cross-functionally with product and engineering teams.Required Qualifications:
PhD in Computer Science, Machine Learning, AI, or a related field. Strong publication record in top AI/ML conferences or journals. 5+ years of hands-on experience building and evaluating deep learning models, particularly transformer-based LLMs. Expertise in natural language processing, code generation (NL2Code), and multi-turn interaction modeling. Deep understanding of model evaluation, including human-in-the-loop methods and automated metrics. Strong software engineering skills—proficient in Python, PyTorch, or TensorFlow; comfortable working with large-scale distributed training systems. Experience working with prompt engineering, alignment techniques (e.g., RLHF), and safety/guardrail systems is a plus. Ability to work in a fast-paced, collaborative environment and drive projects from ideation to deployment.
Why Join Us?
Work on cutting-edge generative AI technologies with real-world enterprise impact. Collaborate with world-class talent in a dynamic, high-growth environment. Enjoy the resources and scale of Oracle while pushing the boundaries of what GenAI can achieve.