Skip to content

Commit

Permalink
Update all FYE-SR
Browse files Browse the repository at this point in the history
  • Loading branch information
hosiet committed Dec 24, 2024
1 parent 3b199e6 commit b75933f
Show file tree
Hide file tree
Showing 10 changed files with 104 additions and 6 deletions.
Binary file added assets/media/2024-fye-sr/fye-sr-fig1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/media/2024-fye-sr/fye-sr-fig15.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/media/2024-fye-sr/fye-sr-fig16ab.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/media/2024-fye-sr/fye-sr-fig16cd.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/media/2024-fye-sr/fye-sr-fig2.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/media/2024-fye-sr/fye-sr-fig8.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
4 changes: 4 additions & 0 deletions config/_default/menus.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -24,6 +24,10 @@ main:
weight: 52
- name: Projects
weight: 53
- name: Inference-time Computation in Generative AI
url: /projects/infer-time-generative-ai/
parent: Projects
weight: 75
- name: On-device AI
url: /projects/on-device-ai/
parent: Projects
Expand Down
16 changes: 16 additions & 0 deletions content/publication/2024-fye-sr/cite.bib
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
@inproceedings{10.1145/3636534.3690698,
author = {Huang, Kai and Yin, Xiangyu and Gu, Tao and Gao, Wei},
title = {Perceptual-Centric Image Super-Resolution using Heterogeneous Processors on Mobile Devices},
year = {2024},
isbn = {9798400704895},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3636534.3690698},
doi = {10.1145/3636534.3690698},
booktitle = {Proceedings of the 30th Annual International Conference on Mobile Computing and Networking},
pages = {1361–1376},
numpages = {16},
keywords = {image super-resolution, perceptual quality, neural networks, heterogeneous computing, mobile devices},
location = {Washington D.C., DC, USA},
series = {ACM MobiCom '24}
}
6 changes: 0 additions & 6 deletions content/publication/2024-fye-sr/cite.bib.bk

This file was deleted.

84 changes: 84 additions & 0 deletions content/publication/2024-fye-sr/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -61,3 +61,87 @@ image:
slides:
---

## Background

Recent SOTA Image Super-Resolution (SR) techniques are mainly based on Neural
networks (NNs) that can better capture such non-linearity and
hence improve the image quality. However, NN-based SR models are
computationally expensive for mobile devices with limited computing power.
A better alternative is to involve specialized hardware AI
accelerators that have been readily available in mobile SoCs,
such as Neural Processing Units (NPUs), in addition to traditional
processors (e.g., CPU and GPU) for faster inference.
However, their use of fixed-point
arithmetic could result in low quality in upscaled images
when being applied to regression-based SR task.

To mitigate such image quality drop, existing schemes
split input images into small patches and dispatch these
patches to traditional processors and AI accelerators.
However, when upscaled patches
are re-stitched to form a complete image, such image-based
split of SR computations often leads to color mismatch and
visual inconsistency across image patches, as shown in the
figure below. This inconsistency may not impact the structural
image quality with a small portion of mismatching patches
, but can largely affect the human perception of images.

![Quality drop and visual inconsistency](2024-fye-sr/fye-sr-fig1.png)

## Overview

### Our Idea

Our work addresses the visual inconsistency
in upscaled images by introducing a new procedure-based
approach to splitting SR computations among heterogeneous
processors, as opposed to the traditional image-based split-
ting. As shown below, We split the SR model and adaptively
dispatch different NN layers of the SR model to heterogeneous\
processors, according to the computing complexity
of these NN layers and how SR computations in these layers
are affected by the reduced arithmetic precision. Our goal
is to maximize the utilization of AI accelerators within the
given time constraints on SR computations, while minimizing
their impact on perceptual image quality.

![FYE-SR basic idea](2024-fye-sr/fye-sr-fig2.png)

### System Design

![FYE-SR system overview](2024-fye-sr/fye-sr-fig8.png)

As shown in the figure above,
our design of FYE-SR consists
of three main modules. During the
offline phase, we first use a SR Timing Profiler to measure the
computing latencies of SR model’s different NN layers on
traditional processors (e.g., GPU) and AI accelerators (e.g.,
NPU), respectively. Then, knowledge about such latencies
will be used to train a Model Split Learner to solve Eq. (2) for
the optimal split of SR model.

During the online phase, FYE-SR enforces such model
split, and uses a Data Format Converter to convert the intermediate
feature maps into the right data formats (e.g.,
INT8 and FP32) for properly switching SR computations be-
tween heterogeneous processors.

## Results

As shown in the figures below,
compared to other SOTA image SR approaches,
our method could reach the overall optimal result
considering both the structual image quality and
perceptual quality, while meeting the preset deadline
requirement.

![FYE-SR comparison results](2024-fye-sr/fye-sr-fig15.png)

Looking into the output images, FYE-SR can effectively
suppress the distortions and visual inconsistency at
detailed objects (windows on buildings).

![FYE-SR comparison: GPU-only, NPU-only](2024-fye-sr/fye-sr-fig16ab.png)

![FYE-SR comparison: MobiSR, FYE-SR](2024-fye-sr/fye-sr-fig16cd.png)

0 comments on commit b75933f

Please sign in to comment.