Meta Drops 22K-Task Dataset for Training AI Research Assistants

Abstract visualization of AI processing scientific research papers, representing Meta's RPG dataset concept

Meta has released a new dataset designed to help train AI systems that can assist with scientific research. The Research Plan Generation (RPG) dataset contains approximately 22,500 research tasks across three domains: machine learning, general arXiv papers, and PubMed biomedical literature.

Each task in the dataset includes a research goal, a grading rubric, and a reference solution. The rubrics and solutions are outputs of Llama 4 Maverick, Meta's mixture-of-experts model, which means any derivative models must include "Llama" in their name per the license terms.

The dataset splits into three subsets: ML (7.56K rows), arXiv (8.07K rows), and PubMed (6.89K rows). Each subset contains train and test splits. The structure is straightforward: a goal describes the research task, the rubric provides evaluation criteria, and the reference solution summarizes how authors addressed similar problems. The arXiv subset includes subdomain and category labels for more granular filtering.

Meta positions this as benchmarking data, not training data in the traditional sense. The data is released under CC-by-NC and is intended for benchmarking purposes only, according to the dataset card. The company cites a forthcoming paper titled "Training AI Co-Scientists using Rubric Rewards" as the source.

The Bottom Line: A benchmark dataset for measuring how well AI systems can plan scientific research, with Llama 4-generated rubrics as the grading standard.

QUICK FACTS

~22,500 total research tasks across three subsets
Domains: ML, arXiv, PubMed
License: CC-by-NC (non-commercial)
Reference solutions generated by Llama 4 Maverick
Available at huggingface.co/datasets/facebook/research-plan-gen

Tags:Meta AILlama 4datasetsAI researchmachine learningHugging Faceresearch automation

Andrés Martínez

AI Content Writer

Andrés reports on the AI stories that matter right now. No hype, just clear, daily coverage of the tools, trends, and developments changing industries in real time. He makes the complex feel routine.

Meta Drops 22K-Task Dataset for Training AI Research Assistants

QUICK FACTS

Andrés Martínez

Related Articles

New CUSP Benchmark Finds Top LLMs Can't Predict Future Science

Microsoft SkillOpt Trains Agent Skills Without Touching the Model

Hugging Face LeRobot Open-Sources 3D-Printed Bipedal Robot

Stay Ahead of the AI Curve