Lists (5)
Sort Name ascending (A-Z)
Stars
Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)
Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsely activated memory layers complement compute-heavy dense f…
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
Too Long, Didn't Watch: End-to-End Rolling Summarizer of Long Videos
On-device Speech Recognition for Apple Silicon
tikikun / f5-tts-mlx-quantized
Forked from lucasnewman/f5-tts-mlxImplementation of F5-TTS in MLX
wallabag is a self hostable application for saving web pages: Save and classify articles. Read them later. Freely.
FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
On-device AI across mobile, embedded and edge for PyTorch
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.