Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HTML-to-text conversion performance of inscriptis compare to trafilatura? #89

Open
LeMoussel opened this issue Jan 6, 2025 · 1 comment

Comments

@LeMoussel
Copy link

Your blog post on various methods for converting HTML to text doesn't mention trafilatura.
How does the HTML-to-text conversion performance of inscriptis compare to that of trafilatura?

@AlbertWeichselbraun
Copy link
Contributor

AlbertWeichselbraun commented Jan 7, 2025

i haven't tested this when the post had been written, but I will add trafilatura to the inscriptis benchmarking suite and add an update to this bug, once i have any results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants