gh-50333: Deprecate support of non-tuple sequences in PyArg_ParseTuple() #128374

serhiy-storchaka · 2024-12-31T13:40:54Z

Non-tuple sequences are deprecated as argument for the "(items)" format unit in PyArg_ParseTuple() and other argument parsing functions if items contains format units which store borrowed buffer or reference (e.g. "s" and "O").

str and bytearray are no longer accepted as valid sequences.

Issue: Reference counting bug in PyArg_ParseTuple and PyArg_ParseTupleAndKeywords #50333

📚 Documentation preview 📚: https://cpython-previews--128374.org.readthedocs.build/

…seTuple() Non-tuple sequences are deprecated as argument for the "(items)" format unit in PyArg_ParseTuple() and other argument parsing functions if items contains format units which store borrowed buffer or reference (e.g. "s" and "O"). str and bytearray are no longer accepted as valid sequences.

Lib/test/test_capi/test_getargs.py

picnixz

While I understand that "borrowed buffer or reference" reads as "borrowed buffer or borrowed reference", I would advise repeating "borrowed reference" as well.

I haven't looked at te implementation though.

Doc/c-api/arg.rst

Doc/whatsnew/3.14.rst

Misc/NEWS.d/next/C_API/2024-12-31-15-28-14.gh-issue-50333.KxQUXa.rst

Co-authored-by: Bénédikt Tran <[email protected]> Co-authored-by: Stan Ulbrych <[email protected]>

erlend-aasland · 2025-01-02T13:20:41Z

While I understand that "borrowed buffer or reference" reads as "borrowed buffer or borrowed reference", I would advise repeating "borrowed reference" as well.

In this case, I think we should consider being explicit, rather than worrying about the repeated word.

Python/getargs.c

erlend-aasland · 2025-01-02T13:38:53Z

Python/getargs.c

        levels[0] = 0;
        PyOS_snprintf(msgbuf, bufsize,
-                      "must be %d-item sequence, not %.50s",
+                      "must be %d-item tuple, not %.50s",


snprintf will keep the string within bufsize; we should not need to use the sized specifier.

Suggested change

"must be %d-item tuple, not %.50s",

"must be %d-item tuple, not %s",

This is for consistency with other formats. Also, if we will add more text after the type name, it is easier to not forget to truncate.

No worry; snprintf will truncate.

Python/getargs.c

erlend-aasland · 2025-01-02T14:01:58Z

Python/getargs.c

+    if (PyTuple_Check(arg)) {
+        Py_INCREF(arg);
+    }


We can easily avoid the unneeded incref/decref if we refactor out the convertitem loop. I'll make a suggestion to your fork for your consideration.

Suggestion opened:

Suggestion: refactor to avoid incref/decref serhiy-storchaka/cpython#23

I thought about this, but I am not sure that there is a significant benefit in this. It would complicate the code, for sure. So I left the simpler code.

So I left the simpler code.

However, the "simpler" code is more complex with regards to reference counting. IMO, keeping the ref counting simple is worth it; ref count bugs are hard to catch. But it is your call. BTW, I addressed your remarks on your fork.

Co-authored-by: Erlend E. Aasland <[email protected]>

serhiy-storchaka added the topic-C-API label Dec 31, 2024

bedevere-app bot mentioned this pull request Dec 31, 2024

Reference counting bug in PyArg_ParseTuple and PyArg_ParseTupleAndKeywords #50333

Open

bedevere-app bot added the awaiting core review label Dec 31, 2024

serhiy-storchaka mentioned this pull request Dec 31, 2024

Deprecating support for nested non-tuple sequences in PyArg_ParseTuple() capi-workgroup/decisions#52

Open

6 tasks

StanFromIreland reviewed Dec 31, 2024

View reviewed changes

Lib/test/test_capi/test_getargs.py Outdated Show resolved Hide resolved

picnixz reviewed Dec 31, 2024

View reviewed changes

Apply suggestions from code review

9878b37

Co-authored-by: Bénédikt Tran <[email protected]> Co-authored-by: Stan Ulbrych <[email protected]>

serhiy-storchaka added 2 commits January 2, 2025 15:27

Merge branch 'main' into pyarg-deprecate-sequences

f8bc0e0

Duplicate "a borrowed".

e7198bc

erlend-aasland reviewed Jan 2, 2025

View reviewed changes

serhiy-storchaka and others added 3 commits January 2, 2025 16:05

Re-order format units.

fac1989

Update Python/getargs.c

ab1ea18

Co-authored-by: Erlend E. Aasland <[email protected]>

Update docs.

44fd700

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-50333: Deprecate support of non-tuple sequences in PyArg_ParseTuple() #128374

gh-50333: Deprecate support of non-tuple sequences in PyArg_ParseTuple() #128374

serhiy-storchaka commented Dec 31, 2024 •

edited by github-actions bot

Loading

picnixz left a comment •

edited

Loading

erlend-aasland commented Jan 2, 2025

erlend-aasland Jan 2, 2025

serhiy-storchaka Jan 2, 2025

erlend-aasland Jan 2, 2025

erlend-aasland Jan 2, 2025

erlend-aasland Jan 2, 2025

serhiy-storchaka Jan 2, 2025

erlend-aasland Jan 2, 2025

	"must be %d-item tuple, not %.50s",
	"must be %d-item tuple, not %s",

gh-50333: Deprecate support of non-tuple sequences in PyArg_ParseTuple() #128374

Are you sure you want to change the base?

gh-50333: Deprecate support of non-tuple sequences in PyArg_ParseTuple() #128374

Conversation

serhiy-storchaka commented Dec 31, 2024 • edited by github-actions bot Loading

picnixz left a comment • edited Loading

Choose a reason for hiding this comment

erlend-aasland commented Jan 2, 2025

erlend-aasland Jan 2, 2025

Choose a reason for hiding this comment

serhiy-storchaka Jan 2, 2025

Choose a reason for hiding this comment

erlend-aasland Jan 2, 2025

Choose a reason for hiding this comment

erlend-aasland Jan 2, 2025

Choose a reason for hiding this comment

erlend-aasland Jan 2, 2025

Choose a reason for hiding this comment

serhiy-storchaka Jan 2, 2025

Choose a reason for hiding this comment

erlend-aasland Jan 2, 2025

Choose a reason for hiding this comment

serhiy-storchaka commented Dec 31, 2024 •

edited by github-actions bot

Loading

picnixz left a comment •

edited

Loading