Add option to optionally skip downloading the contents of a file #79

benjamb · 2024-03-01T14:30:59Z

This PR does a couple of things:

1 . The first commit prevents the downloading of entire files into memory by streaming the response and reading small chunks as a time.
2. The second commit adds the configuration option to optionally skip downloading of a remote URLs content.

Closes #76.

benjamb · 2024-03-01T14:43:40Z

htmlproofer/plugin.py

+            # Download the entire contents as to not break previous behaviour.
+            for _ in response.iter_content(chunk_size=1024):
+                pass


If we don't actually care about downloading the contents (we seemingly don't do any verification of contents), then we could always just drop these lines, with stream=True set the body of the response isn't downloaded automatically.

We don't as long as the url status is correct.

Should I just simplify this MR to just skip the download entirely and drop the read_max_bytes configuration? I'm hesitant to break existing behaviour, so I'll list out a few possible routes this MR could go:

Keep this MR as it is, with read_max_bytes limiting the size of downloaded content.

Opt for a different configuration value, such as skip_downloads, which triggers this new behaviour of skipping the download of the response's body.

Don't bother adding any configurable behaviour and just establish a connection via stream=True and skip any downloads.
@manuzhang Which would be your preference?

I'd prefer option 2.

@manuzhang I've implemented option 2 and updated this MR, as well as the original issue title.

Closes #76.

benjamb commented Mar 1, 2024

View reviewed changes

benjamb changed the title ~~Add option to read max N bytes of a file~~ Add option to optionally skip downloading the contents of a file Mar 4, 2024

Ben Brown added 2 commits March 4, 2024 12:26

Stream response as to not read the entire contents into memory

8618272

Add config option to skip downloading the body of a response

08f0152

Closes #76.

manuzhang approved these changes Mar 4, 2024

View reviewed changes

manuzhang merged commit fce60c9 into manuzhang:main Mar 4, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add option to optionally skip downloading the contents of a file #79

Add option to optionally skip downloading the contents of a file #79

benjamb commented Mar 1, 2024 •

edited

Loading

benjamb Mar 1, 2024

manuzhang Mar 3, 2024

benjamb Mar 3, 2024 •

edited

Loading

manuzhang Mar 4, 2024

benjamb Mar 4, 2024

Add option to optionally skip downloading the contents of a file #79

Add option to optionally skip downloading the contents of a file #79

Conversation

benjamb commented Mar 1, 2024 • edited Loading

benjamb Mar 1, 2024

Choose a reason for hiding this comment

manuzhang Mar 3, 2024

Choose a reason for hiding this comment

benjamb Mar 3, 2024 • edited Loading

Choose a reason for hiding this comment

manuzhang Mar 4, 2024

Choose a reason for hiding this comment

benjamb Mar 4, 2024

Choose a reason for hiding this comment

benjamb commented Mar 1, 2024 •

edited

Loading

benjamb Mar 3, 2024 •

edited

Loading