Category: Study

5 Posts

[Reading Notes] Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization
Source Paper: [ICCV'2017] Authors: Xun Huang, Serge Belongie Code: Contributions In this paper, the authors present a simple yet effective approach that for the first time enables arbitrary style transfer in real-time. Arbitrary style transfer: takes a content image $C$ and an arbitrary style image $S$ as inputs, and synthesizes an output image with the same content as $C$ and the same syle as $S$. Background Batch Normalization Given a input batch $x \in \mathbb{R}^{N \times C \times H \times W}$, batch normalization (BN) normalizes the mean and standard deviation for each individual feature channel: $$ \mathrm{BN}(x)=\gamma\left(\frac{x-\mu(x)}{\sigma(x)}\right)+\beta $$ where $\gamma , \beta \in \mathbb{R}^{C}$ are affine parameters learned from data. $\mu(x) , \sigma(x) \in \mathbb{R}^{C}$ are mean and standard deviation computed across batch size and spatial dimensions, independently. $$ \mu_{c}(x)=\frac{1}{N H W} \sum_{n=1}^{N} \sum_{h=1}^{H} \sum_{w=1}^{W} x_{n c h w} $$ $$ \sigma_{c}(x)=\sqrt{\frac{1}{N H W} \sum_{n=1}^{N} \sum_{h=1}^{H} \sum_{w=1}^{W}\left(x_{n c h w}-\mu_{c}(x)\right)^{2}+\epsilon} $$ Instance Normalization Original feed-forward stylization method [51] utilizes BN layers after the convolutional layer. Ulyanov et al. [52] found using Instance Normalization…
[Reading Notes] Collaborative Distillation for Ultra-Resolution Universal Style Transfer
Source Authors: Huan Wang, Yijun Li, Yuehai Wang, Haoji Hu, Ming-Hsuan YangPaper: [CVPR2020] Contributions It proposes a new knowledge distillation method "Collobrative Distillation" based on the exclusive collaborative relation between the encoder and its decoder. It proposes to restrict the students to learn linear embedding of the teacher's outputs, which boosts its learning. Experimetenal works are done with different stylization frameworks, like WCT and AdaIN. Related Works Style Transfer WCT: Li, Y., Fang, C., Yang, J., Wang, Z., Lu, X., & Yang, M. H. (2017). Universal style transfer via feature transforms. arXiv preprint arXiv:1705.08086.AdaIN: Huang, X., & Belongie, S. (2017). Arbitrary style transfer in real-time with adaptive instance normalization. In Proceedings of the IEEE International Conference on Computer Vision (pp. 1501-1510). Model Compression low-rank decomposition pruning quantization knowledge distillationKnowledge distillation is a promising model compression method by transferring the knowledge of large networks (called teacher) to small networks (called student), where the knowledge can be softened probability (which can reflect the inherent class similarity structure known as dark knowledge) or sample relations (which…
YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Paper Information Paper: YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design Authors: Yuxuan Cai, Hongjia Li, Geng Yuan, Wei Niu, Yanyu Li, Xulong Tang, Bin Ren, Yanzhi Wang Paper: Github: Objective: Real-time object detection for mobile devices. Study notes and presentation: Download: This is an embedded Microsoft Office presentation, powered by Office.
A Image Quality Evaluation Toolbox For MATLIB
MeTriX MuX Visual Quality Assessment Package The name of the package is MeTriX MuX Visual Quality Assessment Package. The official website does not work (attempt on 08-Sep-2019), and the download link is invaild. I find a copy that is from a github repository Installation It can be easily followed at the index.html file. It is very simple, just run the command ">> configure_metrix_mux" in matlab. For "VIF" method there is a bug that need to change a function name from "_m"to "_M". Content It is a powerful toolbox that contains implementation of many evaluation algorithms: 'MSE': mean-squared-error 'PSNR': peak signal-to-noise-ratio 'SSIM': structural similarity index 'MSSIM': multi-scale SSIM index 'VSNR': visual signal-to-noise ratio 'VIF': visual information fidelity 'VIFP': pixel-based VIF 'UQI': universal quality index 'IFC': information fidelity criterion 'NQM': noise quality measure 'WSNR': weighted signal-to-noise ratio 'SNR': signal-to-noise ratio Usage distorted_ssim_index = metrix_mux( reference_image, distorted_image, 'SSIM')