OPT OpenIR  > 光谱成像技术研究室
HQ-I2IT: Redesign the optimization scheme to improve image quality in CycleGAN-based image translation systems
Zhang, Yipeng1,2,3; Hu, Bingliang1,2; Huang, Yingying1,2,3; Gao, Chi1,2,3; Yin, Jianfu1,2,3; Wang, Quang1,2
作者部门光谱成像技术研究室
发表期刊IET Image Processing
ISSN17519659;17519667
产权排序1
摘要

The image-to-image translation (I2IT) task aims to transform images from the source domain into the specified target domain. State-of-the-art CycleGAN-based translation algorithms typically use cycle consistency loss and latent regression loss to constrain translation. In this work, it is demonstrated that the model parameters constrained by the cycle consistency loss and the latent regression loss are equivalent to optimizing the medians of the data distribution and the generative distribution. In addition, there is a style bias in the translation. This bias interacts between the generator and the style encoder and visually exhibits translation errors, e.g. the style of the generated image is not equal to the style of the reference image. To address these issues, a new I2IT model termed high-quality-I2IT (HQ-I2IT) is proposed. The optimization scheme is redesigned to prevent the model from optimizing the median of the data distribution. In addition, by separating the optimization of the generator and the latent code estimator, the redesigned model avoids error interactions and gradually corrects errors during training, thereby avoiding learning the median of the generated distribution. The experimental results demonstrate that the visual quality of the images produced by HQ-I2IT is significantly improved without changing the generator structure, especially when guided by the reference images. Specifically, the Fréchet inception distance on the AFHQ and CelebA-HQ datasets are reduced from 19.8 to 10.2 and from 23.8 to 17.0, respectively. © 2023 The Authors. IET Image Processing published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology.

DOI10.1049/ipr2.12965
收录类别SCI ; EI
语种英语
WOS记录号WOS:001087975200001
出版者John Wiley and Sons Inc
EI入藏号20234314951511
引用统计
被引频次:1[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.opt.ac.cn/handle/181661/96863
专题光谱成像技术研究室
通讯作者Wang, Quang
作者单位1.Key Laboratory of Spectral Imaging Technology, Xi'an Institute of Optics and Precision Mechanics of the Chinese Academy of Sciences, Shaanxi, Xi'an, China;
2.The Key Laboratory of Biomedical Spectroscopy of Xi'an, Shaanxi, Xi'an, China;
3.School of Optoelectronics, University of Chinese Academy of Sciences, Beijing, China
推荐引用方式
GB/T 7714
Zhang, Yipeng,Hu, Bingliang,Huang, Yingying,et al. HQ-I2IT: Redesign the optimization scheme to improve image quality in CycleGAN-based image translation systems[J]. IET Image Processing.
APA Zhang, Yipeng,Hu, Bingliang,Huang, Yingying,Gao, Chi,Yin, Jianfu,&Wang, Quang.
MLA Zhang, Yipeng,et al."HQ-I2IT: Redesign the optimization scheme to improve image quality in CycleGAN-based image translation systems".IET Image Processing
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
HQ-I2IT Redesign th(5165KB)期刊论文出版稿限制开放CC BY-NC-SA请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Zhang, Yipeng]的文章
[Hu, Bingliang]的文章
[Huang, Yingying]的文章
百度学术
百度学术中相似的文章
[Zhang, Yipeng]的文章
[Hu, Bingliang]的文章
[Huang, Yingying]的文章
必应学术
必应学术中相似的文章
[Zhang, Yipeng]的文章
[Hu, Bingliang]的文章
[Huang, Yingying]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。