Pinterest Unveils 'Canvas' AI for Enhanced Product Shot Backgrounds

July 15, 2024 at 6:12:53 AM

TL;DR Pinterest is developing an AI text-to-image generation process called Canvas to enhance product shot backgrounds without altering the main product. The system isolates background and foreground using a segmentation model to generate product masks. Canvas is trained on curated images to align with specific visual styles, allowing brands to create varied product visuals. It uses a latent diffusion model trained in-house.

Pinterest Unveils 'Canvas' AI for Enhanced Product Shot Backgrounds

Pinterest is developing an AI text-to-image generation process called "Canvas" to enhance product shot backgrounds without altering the main product. This system isolates the background and foreground using a segmentation model to generate product masks. The AI is trained on a curated set of images to align with specific visual styles, allowing brands to create varied and appealing product visuals.

Building Pinterest Canvas

Pinterest Canvas is a text-to-image model that supports arbitrary conditioning information in the form of product masks and conditioning images for stylistic guidance. The model is built as a latent diffusion model trained exclusively in-house at Pinterest. It operates in the latent space learned by a variational autoencoder (VAE). Text captions are encoded using both CLIP-ViT/L and OpenCLIP-ViT/G and are fed to a convolutional UNet via cross-attention to incorporate text conditioning information during the generation process.

During training, random caption-image pairs are sampled from a dataset, encoded into latent representations using VAE, and embedded using CLIP. Noise is added to each image latent, and the UNet is tasked with denoising the latent given the text embedding and timestep index. The training data is filtered to ensure high quality, trust, and safety standards, resulting in over 1.5 billion high-quality text-image pairs.

Fine-tuning for Background Generation

Pinterest Canvas is fine-tuned to perform specific visualization tasks like inpainting. The model is trained in two stages:

  1. First Stage: Uses the same dataset as the base model and generates random masks for inpainting during training.
  2. Second Stage: Focuses on product images, using a segmentation model to generate product masks and incorporating more complete and detailed captions from a visual LLM. This stage involves training a LoRA on all UNet layers for rapid, parameter-efficient fine-tuning.

The model can take a product Pin and generate a background according to a text prompt. The VAE is retrained to accept additional conditioning inputs to seamlessly blend the original and generated image content, ensuring pixel-perfect reconstructions of products. Multiple variations are generated, and the top k are selected using a reward model trained on human judgments.

Personalizing Results

To further enhance personalization, the model is augmented to condition on other images, using their style to guide the generation process. This is achieved by building off of IP-Adapter, which processes additional image prompts within the diffusion UNet. These prompts are encoded into embeddings and passed alongside text embeddings to new image-specific cross-attention layers.

For personalization, stylistic context is appended in the form of concatenated UVE and CLIP embeddings. Different ways of collecting conditioning images are experimented with, including using boards with strong styles and automatically mining style clusters. Using Pinterest's internally developed Unified Visual Embedding (UVE) generally leads to a stronger effect on the resulting generations.

Q&A

Have more questions on this topic? Ask our AI assistant for in-depth insights.

Read more from sources 👇

The Only Digital Marketing Feed You'll Ever Need.

Stay informed your way. Tailored updates when and how you want them. 100% Free.

10,000+ Users

500+ Sources

1000+ Tools

Or

Related Posts

Audit your GA4 account in Minutes

Audit your GA4 account in Minutes

Sponsored
GA4 Auditor
GA4 Auditor

Verified Sponsor

Verified Sponsor

GA4 Auditor is a Verified Sponsor. Want to get featured here? Contact us.

Verified Sponsor
Pinterest Launches AI and Automation Features in New Performance+ Suite

Pinterest Launches AI and Automation Features in New Performance+ Suite

Pinterest
Pinterest

Official Source

Official Source

Pinterest is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Unveils BackgroundR AI Tool for Automated Image Background Replacement

Google Unveils BackgroundR AI Tool for Automated Image Background Replacement

Christoph Scherf
Christoph Scherf

Official Source

Official Source

Christoph Scherf is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Meta Introduces Generative AI API Features for Ad Creatives

Meta Introduces Generative AI API Features for Ad Creatives

Meta for Developers
Meta for Developers

Official Source

Official Source

Meta for Developers is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Meta AI Now Lets You Talk and Share Photos for Faster Answers Trending ️‍🔥

Meta AI Now Lets You Talk and Share Photos for Faster Answers

Meta
Meta

Official Source

Official Source

Meta is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Pinterest Unveils Collage Remixing and Sharing to Boost User Creativity

Pinterest Unveils Collage Remixing and Sharing to Boost User Creativity

Pinterest
Pinterest

Official Source

Official Source

Pinterest is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Ads Rolls Out New AI Features for Enhanced Campaign Performance Trending ️‍🔥

Google Ads Rolls Out New AI Features for Enhanced Campaign Performance

Google
Google

Official Source

Official Source

Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Expands AI-Powered Virtual Try-On to Dresses

Google Expands AI-Powered Virtual Try-On to Dresses

Google
Google

Official Source

Official Source

Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source

Related Tools

GA4 Auditor logo

GA4 Auditor

Verified Tool

Verified Tool

GA4 Auditor is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Automated GA4 audits with actionable insights

Get Featured Here

Showcase your tool in this list.

Contact Us
Crowdfire logo

Crowdfire

Manage social media and schedule content easily

Organic Social
Hootsuite logo

Hootsuite

All-in-one social media management and analytics

Organic Social
Planoly logo

Planoly

Plan and auto-post content across all channels

Organic Social
Dash Hudson logo

Dash Hudson

Manage social media with insights and workflow tools

Organic Social
SocialBee logo

SocialBee

AI-powered social media management made simple

Organic Social
Statusbrew logo

Statusbrew

All-in-one social media management and analytics

Organic Social
Later logo

Later

Automate social media tasks and grow your audience

Organic Social

Get Featured Here

Showcase your tool in this list.

Contact Us