MSc Thesis: An attention-based image denoising network leveraging information of both spatial and frequency domain

Description

Image denoising task, in which a clean image is recovered from a noise observation, is a classical inverse problem and still active topic in low-level vision since it is an indispensable step in many practical applications. In past decades, a large variety of image denoising methods employing neural networks were proposed to solve this inverse problem. These methods are major in the spatial domain which denoise the image straightforwardly by extracting the spatial information using the sliding CNN window. On the other hand, image denoising from the frequency domain also has a long history in which the low-pass filter techniques are applied and the noise (major in high-frequency space) can be filtered out. The denoising optimization in the frequency domain is frequently utilized in medical image reconstruction. Recently a great number of frequency networks with CNNs are used in this field to improve the reconstruction quality. However, this approach is rarely studied in the nature denoising field with RGB images. More recently, the advent of Transformers evolutes the computer vision field. Its inherent attributes like larger receptive field and lower inductive bias facilitate the image denoising tasks. In this work, we attempt to introduce a hybrid image denoising network that optimizes the noise image from both spatial and frequency domain. A comparison with SOTA methods on public datasets like urban100 or SIDD will be conducted in the end.

To accomplish this work successfully, we expect you have:

  • Most important: solid coding skills and familiarity with Pytorch and Numpy
  • Knowledge of the state-of-the-art image denoising methods
  • Knowledge of the state-of-the-art CNN and Transformer approaches
  • Knowledge of Fourier transformation and frequency optimization methods/network
  • Independent work spirit of finding, research reading and solving a research problem

What we offer

  • An exciting research project with many possibilities to bring in your own ideas.
  • Close supervision and access to state-of-the-art computer hardware.
  • The chance to work in a team of experts in image processing, deep learning, biomedical engineering and medicine.

How to apply:

Just send an email to jiazhen.pan@tum.de, with a short CV and your grade report. We promise to get back to you within days.

Jiazhen Pan
Jiazhen Pan
PhD Student

My main research interests lie in medical imaging computing, semantic segmentation and flow estimation