Getty Images Model Card

Last updated: April, 2026

Model Details

Model name: Generative AI by Getty Images
Model release date: November, 2025
Model version: Bria Fibo Lite
Model summary: Generative AI by Getty Images is a commercially safe service built on a responsibly trained and clean foundational model. Key elements of this commercial safety are:
- The model was trained on high resolution photography, illustrations, and still images from Getty Images and Bria’s other creative partners, paired with detailed descriptions for each asset. Bria’s other creative partners include Wavebreak Media, CGI Background, Danita Delimont, Archivision, Randy Olson, World Illustration, Pocstock, Freepik, Alamy, Envato, Depositphotos, SuperStock, StockPhotos, Getty Image Korea, Pichastock, Bridgeman Images, Airpano, Roger Viollet.
  - All training data is owned or licensed.
  - Getty Images (i) maintains model and property releases for images depicting persons and certain places (as necessary) included in the training set; or (ii) has contractual guarantees of the same for images from other content libraries.
  - The model was not trained off any data/images scraped from the internet, generated synthetically, or from outputs from other generator.
- The generator has been trained to not produce visuals that violate intellectual property or artist rights, including images of identifiable people, protected locations, trademarks or brands.
- Getty Images blocks both prompts and generations in an effort to avoid visuals being generated that would create legal risks or be considered offensive.
- This model is safe for commercial use. Safe for commercial use means that because the model was only trained with permissioned content, you may use the outputs for commercial purposes. Accordingly, Getty Images represents and warrants that necessary model and property releases have been obtained to avoid infringement of third-party intellectual property rights.
- Legal indemnification is included for all generations, without requiring that assets are reviewed and cleared by Getty Images. Different monetary levels of indemnification are offered based on the package purchased.
- Additionally, the model strives to promote people diversity and representation through the diversity inherent in the training dataset, as well as custom model design.
- The model is a custom architecture. It supports images up to a 4K resolution using super-resolution techniques.

Terms of Use

The intended use of the model is for commercially safe, photorealistic image generation for creation and ideation. Users of the model are expected to act responsibly and are subject to the terms and conditions expressed in the Getty Images Site Terms of Use, Getty Images Content License Agreement and the applicable AI Image Generation Subscription Agreement which prohibit illegal and certain other uses.

Third-Party Community Consideration

This model has been developed and built to Getty Images requirements for this application and use case; more information: https://www.gettyimages.com/ai/generation/about.

Fibo reference:
- Gutflaish, E., Kachlon, E., Zisman, H., Hacham, T., Sarid, N., Visheratin, A., … & Mokady, R. (2025). Generating an Image From 1,000 Words: Enhancing Text-to-Image With Structured Captions. arXiv preprint arXiv:2511.06876.

Training Details

Training objective: Flow matching
- Architecture Type: DiT (Diffusion transformer) , Variational Autoencoder
- Network Architecture: SmolLM3-3B Dimfusion (fusing intermediate LLM layers along the embedding dimension) + WAN 2.2 VAE

Inputs

Input Type(s): Text, Image
Input Format(s): Text: Raw Text, Image: JPG
Input Parameter(s): One Dimensional (1D)
Other Properties Related to Input: Max 250 words

Outputs

Output Type(s): Image
Output Format: Red, Green, Blue (RGB)
Output Parameter(s): Two-Dimensional (2D)
Other Properties Related to Output: Output Sizes (Configurable)- 1MP square: 1024x1024, 1MP 16:9: 1365x768, 1MP 9:16: 768x1365, 1MP 4:5: 916x1145, 1MP 5:4: 1145x916, 1MP 3:2: 1254x836, 1MP 2:3: 836x1254, 1MP 4:3: 1182x887, 1MP 3:4: 887x1182, 4MP square: 2048x2048, 4MP 16:9: 2731x1536, 4MP 9:16: 1536x2731, 4MP 4:5: 1832x2290, 4MP 5:4: 2290x1832, 4MP 3:2: 2508x1672, 4MP 2:3: 1672x2508, 4MP 4:3: 2365x1774, 4MP 3:4: 1774x2365

Software Integration

Supported Hardware Microarchitecture Compatibility: NVIDIA Ada Lovelace
Preferred/Supported Operating System(s): Linux

Training and Evaluation Datasets and Performance

Dataset: Licensed or owned high resolution photography, illustrations, and still images from Getty Images and Bria creative partners were used, paired with detailed descriptions for each asset. Descriptions and metadata attributes curated and crafted by Getty Images photographers and professional content editors are utilized.
Creator Compensation: Getty Images compensates contributors in an ongoing basis. This includes where contributors’ content is used as training data for AI. On an annual recurring basis, we will share in the revenues generated from the Generative AI by Getty Images with contributors whose content was used to train the AI Generator, allocating both a pro rata share in respect of every file and allocating a share based on traditional licensing revenue.
Fibo dataset: Image assets. Captions are sourced from each asset using a 3rd party vision-language model with specific instructions on which image attributes to highlight (See here for full description)
Quality: It especially excels at content that is commercially viable, photorealistic people, and compelling creative concepts.
Performance: The model achieves an average of 20-25 seconds to generate 4 images.

Inference

Engine: Tensor (RT)
Test Hardware: NVIDIA L40S

Limitations

People and object deformations: While the model addresses common issues in generative models, such as malformed limbs, hands, and disproportionate object sizes through careful design choices and custom loss functions, it can still occasionally produce images with malformed or disfigured human parts or objects.
Offensive: The model might create unrealistic and potentially offensive representations of humans by merging independent features learned during training. We attempt to block many of these instances through prompt blocking and output blocking.
Bias: While the model implements measures to generate more diverse representations of humans, the training dataset has some imbalances in the distribution of human attributes like gender and ethnicity in relation to occupational roles that can be biased towards such attributes. Our custom prompting and custom model design aims to combat these biases, but they may still occasionally arise.
Not safe for work: The model is supplemented by a language model which analyzes and filters text prompts, and an image filter that screens for inappropriate outputs. However, both these models can mistakenly filter “safe” prompts and images and may fail to filter unsafe prompts or images. This can arise from expertly designed adversarial input prompts or inherent limitations within the models.
Contemporary: The training data covers up to October 2023 and only includes descriptions in English.
Text: The model does not perform well at generating text in outputs.

Contact

Please send model questions and comments to api@gettyimages.com.

Getty Images Summary of Training Content Generative AI Download Images