Skip to content
View zhaomingwork's full-sized avatar

Block or report zhaomingwork

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • codecsimu Public

    This tool help to augment PCM wavs for telephone 8k senario.

    Python Updated Dec 20, 2024
  • stream player for pcm data received from server in typescript

    TypeScript Updated Dec 20, 2024
  • FunASR Public

    Forked from modelscope/FunASR

    A Fundamental End-to-End Speech Recognition Toolkit

    Python 1 Other Updated Oct 24, 2024
  • zvad Public

    VAD wrapper in C for most popular vad models, such as Silero

    C++ Updated Sep 22, 2024
  • espnet Public

    Forked from espnet/espnet

    End-to-End Speech Processing Toolkit

    Python Apache License 2.0 Updated Nov 6, 2023
  • sherpa-onnx Public

    Forked from k2-fsa/sherpa-onnx

    Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++,…

    C++ Apache License 2.0 Updated Oct 26, 2023
  • linux Public

    Forked from torvalds/linux

    Linux kernel source tree

    C Other Updated Sep 26, 2023
  • wenet Public

    Forked from wenet-e2e/wenet

    Production First and Production Ready End-to-End Speech Recognition Toolkit

    C++ Apache License 2.0 Updated Aug 18, 2023
  • Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

    Python MIT License Updated Aug 18, 2023
  • The reproduced code for Google's SoundStorm

    Python Updated Aug 15, 2023
  • 语音方向实验室/公司/资源/实习等,欢迎推荐或自荐

    Updated Aug 7, 2023
  • unimrcp Public

    Forked from unispeech/unimrcp

    Open source cross-platform implementation of MRCP protocol

    C Apache License 2.0 Updated Aug 1, 2023
  • AcademiCodec: An Open Source Audio Codec Model for Academic Research

    Python Updated Aug 1, 2023
  • Updated May 10, 2023
  • Recorder Public

    Forked from xiangyuecn/Recorder

    html5 js 录音 mp3 wav ogg webm amr 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码

    JavaScript MIT License Updated May 7, 2023
  • piper Public

    Forked from rhasspy/piper

    A fast, local neural text to speech system

    C++ MIT License Updated May 5, 2023
  • websocketpp Public

    Forked from zaphoyd/websocketpp

    C++ websocket client/server library

    C++ Other Updated Apr 17, 2023
  • rasa Public

    Forked from RasaHQ/rasa

    💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

    Python Apache License 2.0 Updated Apr 13, 2023
  • vall-e Public

    Forked from lifeiteng/vall-e

    PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

    Python 2 Apache License 2.0 Updated Mar 31, 2023
  • LPCNet Public

    Forked from xiph/LPCNet

    Efficient neural speech synthesis

    C BSD 3-Clause "New" or "Revised" License Updated Feb 21, 2023
  • PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

    Python BSD 3-Clause "New" or "Revised" License Updated Feb 5, 2020