MemoryStream / TryGetBuffer and writing analyzers/fix providers #38093
Labels
api-suggestion
Early API idea and discussion, it is NOT ready for implementation
area-System.IO
backlog-cleanup-candidate
An inactive issue that has been marked for automated closure.
no-recent-activity
Milestone
Background and Motivation
Mentioning @stephentoub as he seems to be on many of the
MemoryStream
topics.I see that
MemoryStream
has a lot of discussion traffic (e.g. here), so add mine to the pile.We often encounter egregious use of memory and track it down to use of
MemoryStream.ToArray
. It often piles things up unnecessarily on our LOH. In fact, double because of the copying.We have written a diagnostic to flag all occurrences of it and try to review the usage to reduce the double allocations. Recently I learned about the
TryGetBuffer
api, but using it as a replacement is troublesome.Our rule has two main criticisms/concerns:
MemoryStream
, fail, and will be less likely to make performance improvements from our analyzers in the future.My API proposal is we need (at the minimum):
which always succeeds. If the
MemoryStream.TryGetBuffer
would return false, then it should domemoryStream.ToArray()
and return it as an ArraySegment.This is then a drop in replacement for
memoryStream.ToArray()
The complexity of
MemoryStream
/ API makes it hard to write analyzers and fix providers for it.The logic to determine if
MemoryStream.ToArray
can useTryGetBuffer
requires tracing theMemoryStream
instance back through function calls to the source to find if it was created with one of the constructors that will work. This is quite complicated.TryGetBuffer
is not a drop in replacement forToArray
because it can fail, and when it does, it always will. Autogenerating error condition paths and changing variable scopes is complicated. See this "simple" example:This requires promoting the
jsonString
declaration above thememoryStream.ToArray
and generating anelse
condition on error.The other possibility is generating our own extension method:
That has this logic in it and force insert a new code file into every solution / project where it doesn't seem to be defined already.
Proposed API
Usage Examples
It would nice to be able to the following:
Side note: I see
System.Text.Encoding
takes aReadOnlySpan
as an argument toGetString
and notArraySegment
. There is also a new class likeMemory<T>
as well.So we need to do conversions now if we have an
ArraySegment
?Alternative Designs
It's unclear to me the need for the 'publicly' visible complication of the
MemoryStream
class.i.e. If you supply a buffer and
publiclyVisible
is false, how do you get the used part of the buffer you passed in?But to make this simple perhaps separate out the
publiclyVisible: false
functionality into a subclass or different stream implementation so analyzers can easily find MemoryStreams that can be "fixed" and ones that cannot.Then analyzers can quickly tell if the the
stream.ToArray()
can be fixed or not:Risks
Splitting class functionality will of course break some portion of the API audience.
The text was updated successfully, but these errors were encountered: