Reading garbage in types with both `size` and `terminator` #613

KOLANICH · 2019-08-11T21:14:15Z

In some formats one can see that null-terminated strings of some fixed size are used. Often the rest of such strings is not zero-filled, but contain traces of data once been in from process address space by that addr. Though it is possible to write a spec in the way allowing extraction of this data (quite inefficient though), it may make sense to allow them out of the box.

GreyCat · 2019-09-16T18:10:28Z

Could you clarify what exactly do you mean? Current implementation of something like that:

seq:
  - id: foo
    size: 40
    terminator: 0

... is specifically engineered to have max length of 40 bytes and trim all the garbage after "0" byte. If someone wants to parse that garbage instead, then one's totally free to do something like:

seq:
  - id: foo
    size: 40
    type: terminated_string_and_garbage
types:
  terminated_string_and_garbage:
    seq:
      - id: the_string
        terminator: 0
      - id: garbage
        size-eos: true

Am I missing your point?

KOLANICH · 2019-09-16T19:46:27Z

Yes, but it has overhead of storing the same bytes twice. Also I am not sure it correctly processes the cases when the terminator is not within these 40 bytes (for example the native parser reserves 41 byte, sets the 41st to 0 and then strncpyes 40 bytes in front of them, so then the 40 bytes can be without terminator at all). So I feel like it should be the built-in feature activated by a compiler flag or maybe a separate built-in type.

GreyCat · 2019-09-17T10:59:31Z

The overhead depends on the implementation. With cleaner implementation of substreams #44, it won't be an issue.

Also I am not sure it correctly processes the cases when the terminator is not within these 40 bytes

Sounds exactly like a use case for eos-error: false.

So, in other words, it's already implemented and I don't see why we should change these designs.

GreyCat added the enhancement label Sep 16, 2019

GreyCat closed this as completed Sep 17, 2019

KOLANICH mentioned this issue Jan 18, 2020

inlining types #88

Open

KOLANICH mentioned this issue Dec 25, 2021

Serialization #27

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reading garbage in types with both `size` and `terminator` #613

Reading garbage in types with both `size` and `terminator` #613

KOLANICH commented Aug 11, 2019 •

edited

Loading

GreyCat commented Sep 16, 2019 •

edited

Loading

KOLANICH commented Sep 16, 2019 •

edited

Loading

GreyCat commented Sep 17, 2019

Reading garbage in types with both size and terminator #613

Reading garbage in types with both size and terminator #613

Comments

KOLANICH commented Aug 11, 2019 • edited Loading

GreyCat commented Sep 16, 2019 • edited Loading

KOLANICH commented Sep 16, 2019 • edited Loading

GreyCat commented Sep 17, 2019

Reading garbage in types with both `size` and `terminator` #613

Reading garbage in types with both `size` and `terminator` #613

KOLANICH commented Aug 11, 2019 •

edited

Loading

GreyCat commented Sep 16, 2019 •

edited

Loading

KOLANICH commented Sep 16, 2019 •

edited

Loading