Securing AI Model Weights: Preventing Theft and Misuse of Frontier Models

Abstract

The goal of this report is to improve the security of frontier artificial intelligence (AI) or machine learning (ML) models. (Frontier models are those that match or exceed the capabilities of the most advanced AI models at the time of their development.) Our analysis focuses on foundation models, and specifically large language models and similar multimodal models. We focus on the critical leverage point that is the core of a models intelligence and capabilities: its weights, a term used here to refer to all learnable parameters derived by training the model on massive datasets. These parameters stem from large investments in data, algorithms, compute (i.e., the processing power and resources used to process data and run calculations), and other resources; compromising the weights would give an attacker direct access to the crown jewels of an AI developers work and the ability to exploit them for their own use.

Open PDF

Document Details

Document Type: Technical Report
Publication Date: May 01, 2024
Accession Number: AD1229050

Entities

People

Ajay Karpur
Dan Lahav
Henry A. Bradley
Jeff Alstott
Sella Nevo
Yogev Bar-on

Organizations

RAND Corporation

Securing AI Model Weights: Preventing Theft and Misuse of Frontier Models

Abstract

Document Details

Entities

People

Organizations

Tags

Fields of Study

Readers

Technology Areas