Karina Nguyen

I work on AI research & new products at OpenAI.

My contributions (1.3 yrs): creative writing model (unreleased), ChatGPT Tasks, Canvas (first full synthetic posttraining), SimpleQA, 4o (nov'24) w/ improved writing, o-series (o3 w/ tools, o1)

Research: RL environments and synthetic data, multimodal methods in RLAIF, agentic consumer coding, RL for subjective tasks like creative writing, posttraining science (in-context learning, instruction following, tool use)

Previously, I worked at Anthropic:

Post-training (RLAIF/RLHF) and leading evaluations for the Claude 3, Claude 2 family
Training & productionizing cost-performance efficient models: Claude 3 Haiku, Claude Instant 1.2, 1.1 in the API
Researching Constitutional AI scaling laws, character / model personality, solving hallucinations (before search tool), honesty, self-correction, refusals, faithful reasoning
Building the file uploads feature to productionize 100K long-context capability
Developing long-horizon & human feedback interfaces, Claude in Slack, and other unreleased products
Writing the first 50,000 lines for the claude.ai and developer console
+100 more things that you’d expect to do in a very fast-growing startup!

I also worked on R&D prototypes, engineering tools, and product features with teams at Primer.ai, Dropbox, Square, and the New York Times.

AI Research

Find all published research here.

SimpleQA: Measuring Short-form Factuality in Large Language Models

Jason Wei*, Nguyen Karina*, Hyung Won Chung, Yunxin Joy Jiao, Spencer Papay, Amelia Glaese, John Schulman, William Fedus

A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.

[Paper]

[Dataset]

Question Decomposition Improves the Faithfulness of Model-Generated Reasoning

Ansh Radhakrishnan*, Karina Nguyen*, +18 more, Jared Kaplan, Jan Brauner, Samuel R. Bowman, Ethan Perez

By forcing the model to answer simpler subquestions in separate contexts, we greatly increase the faithfulness of model-generated reasoning over chain-of-thought (CoT), while still achieving some of the performance gains of CoT.

[Paper]

[Dataset]

Towards Measuring the Representation of Subjective Global Opinions in Language Models

Esin Durmus, Karina Nguyen, Thomas I Liao, Nicholas Schiefer, +11 more, Jared Kaplan, Jack Clark, Deep Ganguli

We develop a method to test global opinions represented in language models.

In submission 2023

[Paper]

[Dataset]

[Interactive Map]

Discovering Language Model Behaviors with Model-Written Evaluations

Ethan Perez, Sam Ringer*, Kamile Lukošiute*, Karina Nguyen*, Edwin Chen, Scott Heiner, +55 more, Nicholas Schiefer, Jared Kaplan

We test LMs using >150 LM-written evaluations, finding cases of inverse scaling, where models exhibit sycophantic behaviors.

ACL'23 (Findings)

[Paper]

[Evals]

[Data Visualization]

[Dataset]

Towards Semantically-Aware UI Design Tools: Design, Implementation and Evaluation of Semantic Grouping Guidelines

Peitong Duan, Bjorn Hartmann, Karina Nguyen, Yang Li, Marti Hearst, Meredith Ringel Morris

We develop computational metric to measure semantic grouping UI violations.

ICML Workshop'23

[Google Research Blog]

Investigations

My work in visual forensics & human rights contributed to the Pulitzer Prize reporting. That involved investigations of war crimes and crimes against humanity with extensive data collection, evidence verification, satellite analysis, 3D reconstructions, legal submissions, investigative tools, and applied remote sensing.

Say Hi!