HQ-I2IT: Redesign the optimization scheme to improve image quality in CycleGAN-based image translation systems | |
Zhang, Yipeng1,2,3![]() ![]() | |
作者部门 | 光谱成像技术研究室 |
发表期刊 | IET Image Processing
![]() |
ISSN | 17519659;17519667 |
产权排序 | 1 |
摘要 | The image-to-image translation (I2IT) task aims to transform images from the source domain into the specified target domain. State-of-the-art CycleGAN-based translation algorithms typically use cycle consistency loss and latent regression loss to constrain translation. In this work, it is demonstrated that the model parameters constrained by the cycle consistency loss and the latent regression loss are equivalent to optimizing the medians of the data distribution and the generative distribution. In addition, there is a style bias in the translation. This bias interacts between the generator and the style encoder and visually exhibits translation errors, e.g. the style of the generated image is not equal to the style of the reference image. To address these issues, a new I2IT model termed high-quality-I2IT (HQ-I2IT) is proposed. The optimization scheme is redesigned to prevent the model from optimizing the median of the data distribution. In addition, by separating the optimization of the generator and the latent code estimator, the redesigned model avoids error interactions and gradually corrects errors during training, thereby avoiding learning the median of the generated distribution. The experimental results demonstrate that the visual quality of the images produced by HQ-I2IT is significantly improved without changing the generator structure, especially when guided by the reference images. Specifically, the Fréchet inception distance on the AFHQ and CelebA-HQ datasets are reduced from 19.8 to 10.2 and from 23.8 to 17.0, respectively. © 2023 The Authors. IET Image Processing published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology. |
DOI | 10.1049/ipr2.12965 |
收录类别 | SCI ; EI |
语种 | 英语 |
WOS记录号 | WOS:001087975200001 |
出版者 | John Wiley and Sons Inc |
EI入藏号 | 20234314951511 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.opt.ac.cn/handle/181661/96863 |
专题 | 光谱成像技术研究室 |
通讯作者 | Wang, Quang |
作者单位 | 1.Key Laboratory of Spectral Imaging Technology, Xi'an Institute of Optics and Precision Mechanics of the Chinese Academy of Sciences, Shaanxi, Xi'an, China; 2.The Key Laboratory of Biomedical Spectroscopy of Xi'an, Shaanxi, Xi'an, China; 3.School of Optoelectronics, University of Chinese Academy of Sciences, Beijing, China |
推荐引用方式 GB/T 7714 | Zhang, Yipeng,Hu, Bingliang,Huang, Yingying,et al. HQ-I2IT: Redesign the optimization scheme to improve image quality in CycleGAN-based image translation systems[J]. IET Image Processing. |
APA | Zhang, Yipeng,Hu, Bingliang,Huang, Yingying,Gao, Chi,Yin, Jianfu,&Wang, Quang. |
MLA | Zhang, Yipeng,et al."HQ-I2IT: Redesign the optimization scheme to improve image quality in CycleGAN-based image translation systems".IET Image Processing |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
HQ-I2IT Redesign th(5165KB) | 期刊论文 | 出版稿 | 限制开放 | CC BY-NC-SA | 请求全文 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论