peanutcocktail

All

93 repositories

devika
Public
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
Python
•
MIT License
•2.5k•0•0•0•Updated Mar 8, 2025Mar 8, 2025
wan
Public
JavaScript
•0•0•0•0•Updated Feb 26, 2025Feb 26, 2025
Wan2.1
Public
Wan: Open and Advanced Large-Scale Video Generative Models
Python
•
Apache License 2.0
•805•0•0•0•Updated Feb 25, 2025Feb 25, 2025
macOS-use
Public
We Make Mac apps accessible for AI agents
JavaScript
•
MIT License
•1•0•0•0•Updated Feb 20, 2025Feb 20, 2025
Zonos
Public
Python
•
Apache License 2.0
•616•1•0•0•Updated Feb 11, 2025Feb 11, 2025
Ilaria-RVC
Public
Easily convert audio in Speech to Speech with built-in Text to Speech with RVC.
JavaScript
•1•1•0•0•Updated Feb 4, 2025Feb 4, 2025
YuEGP
Public
YuE: Open Full-song Generation Foundation Model, something similar to Suno.ai but open
Python
•474•0•0•0•Updated Jan 29, 2025Jan 29, 2025
Janus
Public
Janus-Series: Unified Multimodal Understanding and Generation Models
Python
•
MIT License
•2.2k•0•0•0•Updated Jan 28, 2025Jan 28, 2025
Hunyuan3d-2-lowvram
Public
JavaScript
•6•1•0•0•Updated Jan 27, 2025Jan 27, 2025
pippin
Public
The Digital Being Framework for Autonomous Agents
Python
•
MIT License
•182•0•0•0•Updated Jan 24, 2025Jan 24, 2025
bu
Public
JavaScript
•0•0•0•0•Updated Jan 6, 2025Jan 6, 2025
browser-use-webui
Public
Run AI Agent in your browser.
Python
•1.5k•0•0•0•Updated Jan 5, 2025Jan 5, 2025
Text2midi
Public
Python
•
MIT License
•5•0•0•0•Updated Dec 28, 2024Dec 28, 2024
StyleTTS2_Studio
Public
Build your own StyleTTS 2 Voice!
JavaScript
•2•1•0•0•Updated Dec 26, 2024Dec 26, 2024
MMAudio
Public
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Python
•
MIT License
•143•0•0•0•Updated Dec 21, 2024Dec 21, 2024
theme
Public
EJS
•0•0•0•0•Updated Dec 17, 2024Dec 17, 2024
HunyuanVideoGP
Public
HunyuanVideo GP: Large Video Generation Model - GPU Poor version
Python
•
Other
•753•0•0•0•Updated Dec 11, 2024Dec 11, 2024
mcpc
Public
mcpfoundation
JavaScript
•0•0•0•0•Updated Dec 1, 2024Dec 1, 2024
mcpfoundation
Public
JavaScript
•0•0•0•0•Updated Dec 1, 2024Dec 1, 2024
mcp
Public
JavaScript
•0•0•0•0•Updated Nov 28, 2024Nov 28, 2024
OminiControl
Public
A minimal and universal controller for FLUX.1.
Python
•88•0•0•0•Updated Nov 27, 2024Nov 27, 2024
qwen2vl-flux
Public
Python
•
MIT License
•30•1•0•0•Updated Nov 26, 2024Nov 26, 2024
echomimic_v2
Public
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Python
•
Apache License 2.0
•370•0•0•0•Updated Nov 24, 2024Nov 24, 2024
local
Public
HTML
•0•0•0•0•Updated Nov 24, 2024Nov 24, 2024
JoyVASA
Public
Python
•
MIT License
•64•0•0•0•Updated Nov 20, 2024Nov 20, 2024
RMBG-2-Studio
Public
Enhanced background remove and replace app built around BRIA-RMBG-2.0. Low VRAM/RAM | 6GB Install
Python
•9•0•0•0•Updated Nov 17, 2024Nov 17, 2024
Finegrain-Image-Enhancer
Public
Creative Image Enhancer/Upscaler. Powered By Refiners. 8GB VRAM | 10GB Install
Python
•
Other
•4•1•0•0•Updated Nov 16, 2024Nov 16, 2024
Pyramid-Flow
Public
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Python
•
MIT License
•278•0•0•0•Updated Nov 16, 2024Nov 16, 2024
F5-TTS
Public
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Python
•
MIT License
•1.4k•0•0•0•Updated Nov 10, 2024Nov 10, 2024
hertz-dev
Public
first base model for full-duplex conversational audio
Python
•
Apache License 2.0
•114•0•0•0•Updated Nov 6, 2024Nov 6, 2024