Skip to content
Change the repository type filter

All

    Repositories list

    • csm

      Public
      A Conversational Speech Generation Model
      Apache License 2.0
      1354.5k180Updated Feb 26, 2025Feb 26, 2025
    • sglang

      Public
      SGLang is a fast serving framework for large language models and vision language models.
      Python
      Apache License 2.0
      1.2k500Updated Feb 18, 2025Feb 18, 2025
    • wavtools

      Public
      Record and stream WAV audio data in the browser across all platforms
      JavaScript
      MIT License
      12800Updated Jan 28, 2025Jan 28, 2025
    • moshi

      Public
      Python
      Apache License 2.0
      615300Updated Jan 8, 2025Jan 8, 2025
    • whisperX

      Public
      WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
      Python
      BSD 2-Clause "Simplified" License
      1.5k1800Updated Oct 25, 2024Oct 25, 2024
    • Faster Whisper with additional features
      Python
      MIT License
      1.2k901Updated Oct 25, 2024Oct 25, 2024
    • Silero VAD: pre-trained enterprise-grade Voice Activity Detector
      Python
      MIT License
      506200Updated Jun 27, 2024Jun 27, 2024
    • gpt-fast

      Public
      Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
      Python
      BSD 3-Clause "New" or "Revised" License
      536500Updated Apr 30, 2024Apr 30, 2024