As OpenAI has rapidly expanded access to its groundbreaking AI products, questions have emerged around how user data is handled. After making important changes to its data privacy practices earlier this year, OpenAI is now providing further clarity into how user data is handled, particularly for developers utilizing its API.
Previously, data submitted through the API before March 1, 2023 could have been incorporated into model training. This is no longer the case since OpenAI implemented stricter data privacy policies.
Inputs and outputs to OpenAI's API (directly via API call or via Playground) for model inference do not become part of the training data unless you explicitly opt in.
The company emphasized that models that are accessible via the API are versioned and have completed training—they are not retrained or updated in real-time with API requests.
Additionally, data upload by users for fine-tuning, is solely used to customize that user's model. It does not flow into OpenAI's general training data. OpenAI does retains ownership of fine-tuned models, however access is exclusive to the user who provided the training data. These users can delete their training data or fine-tuned models at any time.
The company noted that it may access API data solely for potential abuse monitoring and investigation. To ensure safe and responsible use of their models, API input and output data may be run through safety classifiers. This data is deleted within 30 days unless required for legal reasons, and is only accessible by authorized OpenAI employees, as well as specialized third-party contractors (that are subject to confidentiality and security obligations).
Models are NOT trained with data that is run through safety classifiers.
If OpenAI discovers an API user has violated their policies, they may require changes, or for repeating or serious issues, they may suspend or terminate accounts.
Finally, the company detailed the enterprise-grade security and compliance measures that they implement and maintain across all their products and services:
- SOC 2 Type 2
- CCPA and other state privacy laws
OpenAI has clearly evolved its data policies with greater privacy safeguards and transparency, particularly around access to API user data. Businesses cautious about potential unconsented data use may now have less apprehension.
However, preserving this trust will necessitate continued diligence from OpenAI in upholding rigorous security standards and maximum transparency about any first-party data leveraged for model training. As the company continues to evolve, its commitment to privacy, safety, and the responsible use of AI, will be instrumental in driving adoption in the enterprise.