Skip to content
Change the repository type filter

All

    Repositories list

    • devika

      Public
      Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
      Python
      MIT License
      2.5k000Updated Mar 8, 2025Mar 8, 2025
    • wan

      Public
      JavaScript
      0000Updated Feb 26, 2025Feb 26, 2025
    • Wan2.1

      Public
      Wan: Open and Advanced Large-Scale Video Generative Models
      Python
      Apache License 2.0
      805000Updated Feb 25, 2025Feb 25, 2025
    • macOS-use

      Public
      We Make Mac apps accessible for AI agents
      JavaScript
      MIT License
      1000Updated Feb 20, 2025Feb 20, 2025
    • Zonos

      Public
      Python
      Apache License 2.0
      616100Updated Feb 11, 2025Feb 11, 2025
    • Easily convert audio in Speech to Speech with built-in Text to Speech with RVC.
      JavaScript
      1100Updated Feb 4, 2025Feb 4, 2025
    • YuEGP

      Public
      YuE: Open Full-song Generation Foundation Model, something similar to Suno.ai but open
      Python
      474000Updated Jan 29, 2025Jan 29, 2025
    • Janus

      Public
      Janus-Series: Unified Multimodal Understanding and Generation Models
      Python
      MIT License
      2.2k000Updated Jan 28, 2025Jan 28, 2025
    • JavaScript
      6100Updated Jan 27, 2025Jan 27, 2025
    • pippin

      Public
      The Digital Being Framework for Autonomous Agents
      Python
      MIT License
      182000Updated Jan 24, 2025Jan 24, 2025
    • bu

      Public
      JavaScript
      0000Updated Jan 6, 2025Jan 6, 2025
    • Run AI Agent in your browser.
      Python
      1.5k000Updated Jan 5, 2025Jan 5, 2025
    • Text2midi

      Public
      Python
      MIT License
      5000Updated Dec 28, 2024Dec 28, 2024
    • Build your own StyleTTS 2 Voice!
      JavaScript
      2100Updated Dec 26, 2024Dec 26, 2024
    • MMAudio

      Public
      [arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
      Python
      MIT License
      143000Updated Dec 21, 2024Dec 21, 2024
    • theme

      Public
      EJS
      0000Updated Dec 17, 2024Dec 17, 2024
    • HunyuanVideo GP: Large Video Generation Model - GPU Poor version
      Python
      Other
      753000Updated Dec 11, 2024Dec 11, 2024
    • mcpc

      Public
      JavaScript
      0000Updated Dec 1, 2024Dec 1, 2024
    • JavaScript
      0000Updated Dec 1, 2024Dec 1, 2024
    • mcp

      Public
      JavaScript
      0000Updated Nov 28, 2024Nov 28, 2024
    • A minimal and universal controller for FLUX.1.
      Python
      88000Updated Nov 27, 2024Nov 27, 2024
    • Python
      MIT License
      30100Updated Nov 26, 2024Nov 26, 2024
    • EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
      Python
      Apache License 2.0
      370000Updated Nov 24, 2024Nov 24, 2024
    • local

      Public
      HTML
      0000Updated Nov 24, 2024Nov 24, 2024
    • JoyVASA

      Public
      Python
      MIT License
      64000Updated Nov 20, 2024Nov 20, 2024
    • Enhanced background remove and replace app built around BRIA-RMBG-2.0. Low VRAM/RAM | 6GB Install
      Python
      9000Updated Nov 17, 2024Nov 17, 2024
    • Creative Image Enhancer/Upscaler. Powered By Refiners. 8GB VRAM | 10GB Install
      Python
      Other
      4100Updated Nov 16, 2024Nov 16, 2024
    • Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
      Python
      MIT License
      278000Updated Nov 16, 2024Nov 16, 2024
    • F5-TTS

      Public
      Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
      Python
      MIT License
      1.4k000Updated Nov 10, 2024Nov 10, 2024
    • hertz-dev

      Public
      first base model for full-duplex conversational audio
      Python
      Apache License 2.0
      114000Updated Nov 6, 2024Nov 6, 2024