-
codecsimu Public
This tool help to augment PCM wavs for telephone 8k senario.
Python UpdatedDec 20, 2024 -
StreamPlayer Public
stream player for pcm data received from server in typescript
TypeScript UpdatedDec 20, 2024 -
FunASR Public
Forked from modelscope/FunASRA Fundamental End-to-End Speech Recognition Toolkit
-
-
espnet Public
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
Python Apache License 2.0 UpdatedNov 6, 2023 -
sherpa-onnx Public
Forked from k2-fsa/sherpa-onnxReal-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++,…
C++ Apache License 2.0 UpdatedOct 26, 2023 -
-
wenet Public
Forked from wenet-e2e/wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
C++ Apache License 2.0 UpdatedAug 18, 2023 -
soundstorm-pytorch Public
Forked from lucidrains/soundstorm-pytorchImplementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Python MIT License UpdatedAug 18, 2023 -
SoundStorm Public
Forked from yangdongchao/SoundStormThe reproduced code for Google's SoundStorm
Python UpdatedAug 15, 2023 -
Speech-Resources Public
Forked from ddlBoJack/Speech-Resources语音方向实验室/公司/资源/实习等,欢迎推荐或自荐
UpdatedAug 7, 2023 -
unimrcp Public
Forked from unispeech/unimrcpOpen source cross-platform implementation of MRCP protocol
C Apache License 2.0 UpdatedAug 1, 2023 -
AcademiCodec Public
Forked from yangdongchao/AcademiCodecAcademiCodec: An Open Source Audio Codec Model for Academic Research
Python UpdatedAug 1, 2023 -
-
Recorder Public
Forked from xiangyuecn/Recorderhtml5 js 录音 mp3 wav ogg webm amr 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
JavaScript MIT License UpdatedMay 7, 2023 -
piper Public
Forked from rhasspy/piperA fast, local neural text to speech system
C++ MIT License UpdatedMay 5, 2023 -
websocketpp Public
Forked from zaphoyd/websocketppC++ websocket client/server library
C++ Other UpdatedApr 17, 2023 -
rasa Public
Forked from RasaHQ/rasa💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
Python Apache License 2.0 UpdatedApr 13, 2023 -
vall-e Public
Forked from lifeiteng/vall-ePyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
-
LPCNet Public
Forked from xiph/LPCNetEfficient neural speech synthesis
C BSD 3-Clause "New" or "Revised" License UpdatedFeb 21, 2023 -
PyTorch_Speaker_Verification Public
Forked from HarryVolek/PyTorch_Speaker_VerificationPyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 5, 2020