Skip to content

v1.7.4

Latest
Compare
Choose a tag to compare
@sandrohanea sandrohanea released this 26 Dec 20:57
· 6 commits to main since this release
a3ec2f8

We're thrilled to announce the release of whisper.net 1.7.4, featuring a host of improvements and new functionalities.

Key Features and Changes

  • Custom Alignment Heads for DTW: Now, you can define custom alignment heads for Dynamic Time Warping (DTW), enhancing the accuracy of speech-to-text alignment in various scenarios.
  • Upgrade to Whisper.cpp 1.7.3 which included multiple updates and fixes:
    • WithNoSpeechThreshold Support: This allows for better handling of silent segments in audio, improving transcription accuracy.
    • Performance Enhancements: Significant improvements, especially on the Metal runtime for macOS, ensuring faster processing times.
    • Library Split: The native libraries are now split into multiple files (libggml-whisper, libggml-base-whisper, etc.), allowing for more modular use and easier updates.
  • MAUI App Testing for iOS: Added tests for MAUI applications on iOS, ensuring better support for cross-platform development. 🚀
  • Whisper Factory Options: Introduced new options at the WhisperFactory level for more granular control over configurations, previously managed through a singleton at the library level (RuntimeOptions). This improves flexibility and maintainability. Note: RuntimeOptions are still available but only to configure the way how the native runtime is being loaded.
  • The continuous recognition example has been deprecated in favor of EchoSharp, which provides a more robust and up-to-date demonstration of continuous speech recognition capabilities.
  • FIxed an issue where the native library couldn't be auto-loaded on systems that are not providing any command line arguments (like godot)

What's Changed

  • Bumped version 1.7.3 by @sandrohanea in #292
  • Bump FluentAssertions from 6.12.2 to 7.0.0 by @dependabot in #291
  • Bump Microsoft.DotNet.XHarness.TestRunners.Xunit from 10.0.0-prerelease.24575.1 to 10.0.0-prerelease.24604.1 by @dependabot in #294
  • Bump Microsoft.DotNet.XHarness.TestRunners.Xunit from 10.0.0-prerelease.24604.1 to 10.0.0-prerelease.24610.1 by @dependabot in #298
  • Support setting custom alignment heads for dtw by @jettoblack in #301
  • Added Whisper Factory Options by @sandrohanea in #302
  • Added test reporter by @sandrohanea in #303
  • Removed unnecessary instances and simplified LogProviders + RuntimeOptions by @sandrohanea in #304
  • Bump xunit.runner.visualstudio from 2.8.2 to 3.0.0 by @dependabot in #306
  • Bumped Whispercpp 1.7.3 that adds WithNoSpeechThreshold by @sandrohanea in #305
  • Removed continuous recognition example and replaced it with a link the echosharp by @sandrohanea in #309

Full Changelog: 1.7.3...1.7.4