Original listing text, shown exactly as published by the company.
What You’ll Do
- Drive the end-to-end development of proprietary LLMs optimized for corporate finance, procurement, and supply chain domains. This includes pre-training, Parameter-Efficient Fine-Tuning (PEFT/LoRA), and Alignment techniques (RLHF/DPO).
- Partner with data engineering to curate, clean, and tokenize massive volumes of unstructured enterprise data, including supplier contracts, invoices, and complex global compliance regulations.
- Design robust Retrieval-Augmented Generation (RAG) frameworks to ground LLM outputs in verified, internal corporate knowledge bases, minimizing hallucinations.
What You Will Bring to Coupa
- Bachelores/Masters degree in Computer Science, Data Science, Statistics, or a related quantitative field (or equivalent deep industry experience)
- 5+ years of experience as a Data Scientist, with a strong, proven track record of training and deploying transformer-based models and LLMs in production environments.
- Experience with Python (and PySpark), and querying languages such as SQL
- Experience working with large datasets - cleaning, interpreting, and automating processes
- A strong understanding of the trade-offs between model size, inference speed, cost, and accuracy, with the ability to choose the right tool for the specific business problem.…