Tencent AI Lab has made a new breakthrough in pose-guided character image synthesis. 02/15 Update SLTechnology News&Howtos

Tencent AI Lab has made a new breakthrough in pose-guided character image synthesis.

2026-02-15 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > IT Information >

Shulou(Shulou.com)11/24 Report--

CTOnews.com, Nov. 4, Tencent AI Lab recently launched a progressive conditional diffusion model (PCDMs), which has made a major breakthrough in pose-guided character image synthesis.

PCDMs consists of three key stages: a priori conditional diffusion model, a repair conditional diffusion model and a perfect conditional diffusion model, which solves the problem of pose inconsistency between the source image and the target image, as well as the challenge of generating high quality and realistic images.

The indicators of PCDMs on DeepFashion and Market1501 datasets are significantly better than other SOTA methods, and the SSIM index on the small-scale dataset Market1501 (128 / 64) is the highest 0.3169, 3.8% higher than the second PIDM.

In the first stage of the prior conditional diffusion model, when the source image and pose coordinates are given as conditions, the prior conditional diffusion model uses a transformation network to predict the global features of the target pose.

In the second stage of the repair conditional diffusion model, we further improve the global features of the first stage and establish a dense correspondence between the source image and the target image, which ensures alignment across multiple dimensions (including images, poses and features). It is very important to achieve realistic results.

In the third stage of improving the conditional diffusion model: after the initial coarse-grained target image is generated in the previous stage, the thinning conditional diffusion model is involved to improve the image quality and texture details.

This stage uses the previously generated coarse-grained image as a condition to further improve image fidelity and ensure texture consistency, which involves modifying the first convolution layer and using an image encoder to extract features from the source image. The cross-attention mechanism is used to inject texture features into the network to facilitate texture repair and detail enhancement.

CTOnews.com encloses here the address of the paper: https://arxiv.org/pdf/2310.06313.pdf

GitHub address: https://github.com/muzishen/PCDMs

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Weibo

Tencent

Renren

QQZone

Douban

Weibo

Tencent

Renren

QQZone

Douban

Yixin

The market share of Chrome browser on the desktop has exceeded 70%

The market share of Chrome browser on the desktop has exceeded 70%, and users are complaining about

2025-09-03 14:52:50 SL Technology News Views: 49
The world's first 2nm mobile chip: Samsung Exynos 2600 is ready for mass production.

The world's first 2nm mobile chip: Samsung Exynos 2600 is ready for mass production.According to a r

2025-09-03 14:07:30 SL Technology News Views: 54
Disney Agrees to Pay $10 Million to Settle with FTC over Alleged Child Data Collection Using YouTube Animations

On September 3, it was reported that Disney has agreed to pay $10 million to settle a case in which

2025-09-03 14:03:30 SL Technology News Views: 58
Google Wins! Court Rules It Doesn't Have to Sell Chrome Browser

A US federal judge has ruled that Google can keep its Chrome browser, but it will be prohibited from

2025-09-03 13:41:31 SL Technology News Views: 54
Build zoopker+hbase environment

Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope

2023-12-25 21:17:29 shulou Views: 408

IT Information

More IT Information >

Tencent AI Lab has made a new breakthrough in pose-guided character image synthesis.

Related

The market share of Chrome browser on the desktop has exceeded 70%

The world's first 2nm mobile chip: Samsung Exynos 2600 is ready for mass production.

Disney Agrees to Pay $10 Million to Settle with FTC over Alleged Child Data Collection Using YouTube Animations

Google Wins! Court Rules It Doesn't Have to Sell Chrome Browser

Build zoopker+hbase environment

IT Information

Stanford doctoral student self-made PPT generation artifact ChatBCG is free: one-click generation of custom templates, but also export PDF

The pattern of e-commerce supply chain is changing: ten global fast consumer giants and rookies deepen supply chain cooperation

Shopee's 12.12th birthday promoted the curtain, and its value set off the holiday shopping trend at the end of the year.

Tencent applied for the registration of XR trademark, and had already set up relevant departments.

Elegant and relaxed Samsung Galaxy Tab S9 series makes the office more "relaxed"

Latest Network Security More Network Security >

Latest Internet Technology More Internet Technology >

Latest Development More Development >

Latest Database More Database >

Latest Servers More Servers >

Latest Mobile Phone More Mobile Phone >

Latest Android Software More Android Software >

Latest Apple Software More Apple Software >

Latest Computer Software News More Computer Software News >

Latest IT Information More IT Information >