Skip to content

Latest commit

 

History

History
52 lines (52 loc) · 1.45 KB

README.md

File metadata and controls

52 lines (52 loc) · 1.45 KB

Introuduction

Impelmentation Phase-aware Speech Enhnacement Deep Complex UNet
This is convolution neural networks model for Speech Enhancement
Papers URL

  1. Phase-aware Speech Enhancement Deep Complex UNet - openreview
  2. Phase-aware Speech Enhancement Deep Complex UNet - arxiv

Architecture

archi

Warning

Don't use DCUnet16 and DCUnet20 via ComplexBatchNormalization
You can use only Naive_DCUnet series
(In the latest version of TensorFlow, the functionality for the above layer has been removed)

Requirements

Python >= 3.6.9 (3.7.6)  
numpy  
scipy  
librosa == 0.7.2  (with numba 0.48.0)  
Tensorflow == 2.1.0

Folder

Directory
./Deep_Complex_UNet
    /datasets
        /train_noisy
        /train_clean
        /test_noisy
        /test_clean
    /model_save
    /model_pred
    /complex_layers
        __init__.py
        stft.py
        layer.py
        activation.py
        normaliztion.py
        dcunet.py
    model_module.py
    model_loss.py
    model_data.py
    model_test.py
    model_train.py

Usage

--model naive_dcunet16 naive_dcunet20 dcunet16 dcunet20
--loss SDR or wSDR  
--save "save file name"  
python model_train.py --model dcunet20 --loss wSDR --save dcunet20_default --optim adam --lr 0.002
python model_train.py --model dcunet16 --loss wSDR --save dcunet16_default --optim adam --lr 0.002