Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to reconstruct audio signal from the spectrogram (output of assemble_coeffs) #20

Open
binarygo opened this issue May 15, 2018 · 3 comments

Comments

@binarygo
Copy link

Hello

Thanks a lot for the nice package! I am looking for a "lossless way" to convert audio signal to spectrogram (in mel scale) and back to audio signal again. This seems to be a great solution.

However, after calling the assemble_coeffs() to construct spectrogram from CQT slices. How do I transform the spectrogram back to CQT slices (and thus backward() to audio signal again)? The overlap-add operation in the assemble_coeffs() seems to be irreversible.

Sorry for the dumb question, but it will be great if you could help on this. Thanks!

@jeremydeanw
Copy link

Hey Binarygo, did you ever figure out how to do this?

@grrrr
Copy link
Owner

grrrr commented Feb 21, 2019

Hi all, not sure whether i understand it right. Have you had a look at examples/transform_stream.py ?
It seems it does what you want to achieve - only the frequency scale needs to be modified.

@jeremydeanw
Copy link

Hi, dont mean to jack the thread. But to convert an audio file to a spectrogram, am I doing "python spectrogram.py myaudio.wav --output image.npz" ? And for the reconstruction I am to use transform_stream.py? Also how do I get an image out of the npz once it's created?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants