We're thrilled to announce the release of whisper.net 1.7.4, featuring a host of improvements and new functionalities.
Key Features and Changes
- Custom Alignment Heads for DTW: Now, you can define custom alignment heads for Dynamic Time Warping (DTW), enhancing the accuracy of speech-to-text alignment in various scenarios.
- Upgrade to
Whisper.cpp 1.7.3
which included multiple updates and fixes:WithNoSpeechThreshold
Support: This allows for better handling of silent segments in audio, improving transcription accuracy.- Performance Enhancements: Significant improvements, especially on the Metal runtime for macOS, ensuring faster processing times.
- Library Split: The native libraries are now split into multiple files (
libggml-whisper
,libggml-base-whisper
, etc.), allowing for more modular use and easier updates.
- MAUI App Testing for iOS: Added tests for MAUI applications on iOS, ensuring better support for cross-platform development. 🚀
- Whisper Factory Options: Introduced new options at the WhisperFactory level for more granular control over configurations, previously managed through a singleton at the library level (RuntimeOptions). This improves flexibility and maintainability. Note: RuntimeOptions are still available but only to configure the way how the native runtime is being loaded.
- The continuous recognition example has been deprecated in favor of EchoSharp, which provides a more robust and up-to-date demonstration of continuous speech recognition capabilities.
- FIxed an issue where the native library couldn't be auto-loaded on systems that are not providing any command line arguments (like godot)
What's Changed
- Bumped version 1.7.3 by @sandrohanea in #292
- Bump FluentAssertions from 6.12.2 to 7.0.0 by @dependabot in #291
- Bump Microsoft.DotNet.XHarness.TestRunners.Xunit from 10.0.0-prerelease.24575.1 to 10.0.0-prerelease.24604.1 by @dependabot in #294
- Bump Microsoft.DotNet.XHarness.TestRunners.Xunit from 10.0.0-prerelease.24604.1 to 10.0.0-prerelease.24610.1 by @dependabot in #298
- Support setting custom alignment heads for dtw by @jettoblack in #301
- Added Whisper Factory Options by @sandrohanea in #302
- Added test reporter by @sandrohanea in #303
- Removed unnecessary instances and simplified LogProviders + RuntimeOptions by @sandrohanea in #304
- Bump xunit.runner.visualstudio from 2.8.2 to 3.0.0 by @dependabot in #306
- Bumped Whispercpp 1.7.3 that adds WithNoSpeechThreshold by @sandrohanea in #305
- Removed continuous recognition example and replaced it with a link the echosharp by @sandrohanea in #309
Full Changelog: 1.7.3...1.7.4