Hi everyone! I'm Yougen Yuan. Glad to have you here.
-
Speech Processing
- Speech Keyword Retrieval / Spotting / Search
- Zero-shot Text-to-Speech generation / Voice Conversion / End-to-End Speech Interaction
- Speech Recognition
- Audio Scene Classification
- Speech Language Identification
-
Multimodal Analysis
- Language-Image Fusion
- Contrastive Language-Image Pre-Training
- Multimodal with LLMs
-
Clustering
-
Audio / Visual / Text Similarity
-
Audio / Visual / Text Deephasing
-
SinglePass / HDBSCNN clustering
If you are interested in my works, feel free to reach out to me at [email protected].