Add defstruct macro #14

rutenkolk · 2024-10-13T21:02:00Z

~~[Note: I would consider this a draft PR, as I have not yet added tests (which could unearth some bugs and necessitate appropriate fixes).]~~

Hi, this PR contains the addition of a defstruct macro. It does the following:

It adds serialize and deserialize code to a serde "registry" for the new type (details below)
It generates a new type that has the specified members (details below)
It adds an implmentation for c-layout
It adds inline implementations for both deserialize-from and serialize-into
It adds an implementation for clojure.pprint/simple-dispatch

serde registry

The "registry" is implemented via the multimethods generate-deserialize and generate-serialize which produce code to de/serialize the respective types. This removes indirection in the de/serialize code for types that use other types. i think in the original discussion we were on the same page, but thought the other meant something different. The defstruct macro adds implementations to the multimethods for the newly generated type.

the generated type

The new type is generated via deftype in the private function generate-struct-record. This is an attempt to strike a middle ground between the two positions of the original discussion, although the result might be a bit odd:

The type implements both IPersistentVector and IPersistentMap.
- The basic idea is: if it is treated like a vector, it behaves like a vector. if it is treated like a map, it behaves like a map.
It therefore implements both vector-like methods like nth as well as map-like methods like without (for e.g. dissoc).
If there is a an overlap in map/vector interface such as with assoc, it supports both paradigms of indices-as-keys and membernames-as-keys. Practically speaking, if you use something like assoc with a number as a key, it behaves like a vector (and will return a vector), otherwise like a map (and will return a map).
one notable exception here is foreach which can't support both paradigms, and it is therefore implemented as if it's a vector. The rationale here is that the value of the type is composed of the actual values of the members, not the associated names of the places of the values. If you map or reduce over an object of this type, you will do so over the values of the members.

with-c-layout

There was one implementation problem. Since padding was needed to be taken into account to allow for inline serdes, the new code for the macro needed to rely on with-c-layout. The problem here is that with-c-layout is in the layout namespace which already depends on mem. As a stopgap solution i simply copied the function over as a private function. I would be in favor of actually deprecating the layout namespace. for backwards compatibility the with-c-layout function in layout could depend on the one in mem. Not only is the layout namespace at this point somewhat anemic, it has also caused me trouble. I'm not sure if it's a bug, but due to with-c-layout being in layout, i ran into the problem that there are now two different :padding keywords, which i found confusing.

tests & benchmarks

No tests or benchmarks exist right now. I don't expect the custom type to be slower than defrecord, but I want to test it.
Similarly, i do want to add a first set of tests for the de/serialization.

IGJoshua · 2024-10-14T17:44:41Z

I absolutely love this. You've done a fantastic job! I look forward to seeing the tests that you add for this, and I'm also thinking ahead to reorganizing some of the existing serde code to use the new generate-serialize and generate-deserialize to add some inline arities to serialize and deserialize when the type argument is a constant. Don't worry about any of that in this PR, it's just something I want to use this for in the future.

IGJoshua · 2024-10-14T18:04:45Z

So I like the way you've chosen to introduce the type registry. It integrates well with the existing tools, and provides a way to do inline arities for serialize and deserialize in the future. One hesitation I have at the moment looking over the code is the use of ::mem/array to refer to actual java arrays.

Arrays

Currently in coffi the ::mem/array type serializes anything seqable, which does include arrays, and it deserializes to a vec.
I think that to support the direction you're going here, we should add optional kwargs as options in the array type, with a :raw? true option meaning that it will deserialize to a JVM array and will assume that the argument is an array, and then add some conditionals in to ensure that we have the fast path for array serialization.

I also think that your compromise around using a record-like type with both map and array style accesses is appropriate and well-done. I might personally want to go the other direction with the foreach implementation though, making it act as if it's reducing over a sequence of map entries. Doing it this way allows adding a quick map val into the stack without too much performance overhead, and it avoids the need to figure out a zipmap with the keys and values separately. I don't have too strong an opinion on this one though as long as keys returns the keys in the same order as foreach yields the values.

serde registry

The serde registry as it stands with generate-serialize and generate-deserialize both look pretty good in terms of usage and follow about what I want them to do, but I want to note two things about them that I'm not sure how I feel about right now.

The first is just an observation and not a problem, that being that these functions all generate the equivalent of a serialize-into or deserialize-from call, which I think is appropriate, I'm just thinking about what this might mean in terms of naming though if the generate-x functions are going to become a part of the public api of coffi.

The second is that these macros as they stand are unhegenic macro helpers. I think it would be appropriate for the multimethod to take in the symbol which will be used to refer to the segment.

with-c-layout

For the with-c-layout problem, I think there's a couple things to be done. To start with, we can make the private version in coffi.mem use :coffi.layout/padding explicitly which doesn't require the namespace be loaded, which reduces it to just one padding key. Then for the rest, I'm a little undecided about it.

All the structs being passed over the C abi will most certainly use the with-c-layout layout, however the intention behind having the namespace in the first place was to allow easily serializing clojure maps into e.g. std140 or std430 from the GLSL spec, I just haven't gotten around to implementing those yet as I was wanting to get a defstruct macro and some codegen for an opengl bindgen library first.

Specifically though, if we remove the coffi.layout namespace and just assume everything is the c-layout, that will then mean there's no way for a user of the library to reach lower in the abstractions to implement a different layout for their usecase except to re-implement defstruct for their own layout.

rutenkolk · 2024-12-29T17:23:18Z

Hi, I've opened this PR for review, since I tested and benchmarked everything.

The macro itself doesn't have a raw? option anymore, but it supports arrays with a :raw? true optional keyword like this: [::mem/array ::whatever-type 3 :raw? true].

There were a few interesting things that happened along the way. One issue were the inline versions of the write-x functions like write-int. Sometimes reflection would occur but even *warn-on-reflection* true wouldn't catch it. Since writing this macro version with proper typehinting is finicky i added a new with-typehints macro to make it more robust and applied it to all of the functions where it was causing issues. I can write this out by hand if needed, but it's very uncomfortable. In essence you either couldn't pass in a literal or a form and have it be typehinted correctly in both cases as a primitive local before. Now this works, given that the local is actually of the right primitive type.

the internal with-c-layout function still exists, but it's using the keywords just like the original version in coffi.layout.

the macros are now hygenic, as in: they take in the expression that represents e.g. a segment and don't just use a random symbol name you have to match on the call-site.

i have spent some time optimizing serializing and deserializing and benchmarked everything. especially interesting is the story around arrays. some key takeaways:

if one has to decide between inlining and unrolling a loop, inlining is faster for raw arrays, but not for vectors
with big enough arrays of primitives, it's advantageous for serializing to first create one big array and then copy that array with one call. this is apparently not true for deserializing, which is kind of surprising! A transient vector wins this case.
there are tradeoffs when which method to read / write becomes the better option and the code auto-chooses the best, based on my benchmarks. this may need to be tested on multiple platforms though and adjusted accordingly.

In any case, the "auto" version wich chooses the method performs as good or better than the best alternative for arrays and vectors respectively. That being said, raw arrays remain incredibly fast and small vectors (< 16 elements) are still pretty fast.

One thing I noticed while profiling calls to serialize-into and deserialize-from is that with small sizes, a big cost factor of these functions becomes looking up the multimethod:

IGJoshua

I've got a few specific things in the review I'd like resolved or to discuss, and attached here I've got a few patches that I'd like if they were applied to the PR.

0004-Don-t-use-underscore-on-used-args.patch.txt
0003-Remove-duplicate-c-layout-implementation.patch.txt
0002-Fix-warning-about-defstruct-redefinition.patch.txt
0001-Use-a-once-only-impl-rather-than-with-typehints.patch.txt

src/clj/coffi/mem.clj

Signed-off-by: Kristin Rutenkolk <[email protected]>

…or strings Co-authored-by: Joshua Suskalo <[email protected]>

Co-authored-by: Joshua Suskalo <[email protected]>

rutenkolk · 2025-01-04T20:50:57Z

I think work on this PR is nearing completion. As for the performance, here are some benchmark results:

Serializing a struct with `n` amount of `::mem/int` members:

linear scale:

logarithmic scale:

Deserializing a struct with `n` amount of `::mem/int` members:

linear scale:

logarithmic scale:

Serializing a struct with one `::mem/array` of `::mem/int`s of fixed size `n`

all individual ways to serialize, logarithmic scale:

comparison defstruct with raw arrays and vectors for arrays vs. defalias, logarithmic scale:

Deserializing a struct with one `::mem/array` of `::mem/int`s of fixed size `n`

all individual ways to deserialize, logarithmic scale:

comparison defstruct with raw arrays and vectors for arrays vs. defalias, logarithmic scale:

In all cases, a significant speedup has been achieved, often more than an order of magnitude, in special cases more than two. For native arrays, raw java arrays outperform vectors quite heavily. there may be room for improving this specific codepath, but it is still a noticeable improvement. For some cases, like raw arrays, the performance is in the realm of only a few nanoseconds and one of the biggest factors actually becomes the initial dispatch of the multimethod, so the actual time the de/serialization takes is probably hard to improve significantly further without changing aspects of how coffi itself operates.

rutenkolk · 2025-01-07T16:28:51Z

to document a last touch:

i moved with-c-layout to coffi.layout again and load-fileed it in coffi.mem right above the definition of defstruct (the latest possible time, so that it hopefully will cause as little problems as possible, should coffi.layout be developed further).

defstruct can still be called from coffi.mem and even after removing the dependency on coffi.layout in mem_test.clj no test fails, so i think this worked out just fine!

register-new-struct-deserialization

rutenkolk added 13 commits October 4, 2024 16:17

add write functions for arrays

557cd27

remove namespaced references

f96df71

add defstruct macro and helper functions

cf6dff3

copy with-c-layout to mem namespace

d04a9f6

remove namespace qualifiers from with-c-layout

df29b16

add c-layout to struct generation

b0cb0f2

add deserialization generation

8bfc156

add generate-serialize multimethod

c5d18e9

add serialization generation

37b74fc

fix nested types serdes

eea1b43

fix array handling for defstruct macro

f702096

add custom deftype for struct type generation

21c547c

add pprint impl for struct types

53a8435

rutenkolk marked this pull request as draft October 13, 2024 21:19

IGJoshua mentioned this pull request Oct 14, 2024

Defstruct Macro Added: #6

Closed

rutenkolk added 13 commits October 18, 2024 18:06

fix seq of new type and remove indirection

4f5b9fa

draft of new type generation

f08fa20

fix forEach reference

913c004

introduce custom vector iterator

cf2dece

add error message for invalid type usage

020e102

make defstruct robust against dangling and unbound vars

b6f1e44

improve error message

4c40804

fix map cons

ec418cb

fix struct map containsKey

003a737

fix struct assoc

9404ef4

fix struct entryAt

18679c4

add map functionaliy test for struct

e5cd228

implement java.util.Map and MapEquivalence

fd0f22f

rutenkolk added 6 commits December 28, 2024 01:08

typehint inline functions

7bcdb8c

fix write-byte typehint

ab8cc0c

remove necessity to create array when deserializing

18f5699

refactor array serdes & auto-choose copy methods

869e678

remove array-copy-method

c49dc79

fix array serialization

51dfbc3

rutenkolk marked this pull request as ready for review December 29, 2024 16:59

IGJoshua requested changes Jan 2, 2025

View reviewed changes

IGJoshua and others added 11 commits January 2, 2025 23:32

Use a once-only impl rather than with-typehints

d763b39

Signed-off-by: Kristin Rutenkolk <[email protected]>

Fix warning about defstruct redefinition

8ea1217

Signed-off-by: Kristin Rutenkolk <[email protected]>

Remove duplicate c-layout implementation

33e1a95

Signed-off-by: Kristin Rutenkolk <[email protected]>

Don't use underscore on used args

78d39b1

Signed-off-by: Kristin Rutenkolk <[email protected]>

fix mangled keyword in coffitype->typename

cb3e620

refactor multimethod dispatch function to use ffirst

10b8baa

use syntax quoted expression in generate-deserialize implementation f…

0dfb6bd

…or strings Co-authored-by: Joshua Suskalo <[email protected]>

refactor struct-vec-iterator to camel case

d24ebc7

refactor generate-struct-type to return the form via syntax quote

a6864fd

Co-authored-by: Joshua Suskalo <[email protected]>

fix unmatched parantheses

e25ad22

remove bulk deserialization for vectors

d2afb2b

rutenkolk changed the base branch from master to develop January 4, 2025 19:06

rutenkolk added 3 commits January 4, 2025 20:33

reverse type and fieldname in defstruct definition

5a9d156

fix order of type and fieldname for defstruct in tests

637f156

remove typename argument from typelist

06cd910

rutenkolk added 3 commits January 4, 2025 23:32

emit serde registration and omit padding from defstruct

1f5efb0

move with-c-layout back to layout.clj and load layout namespace from mem

8d29234

remove layout dependency from mem test

a6b7ece

allow global offset to be expression for

f9784b3

register-new-struct-deserialization

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add defstruct macro #14

Add defstruct macro #14

rutenkolk commented Oct 13, 2024 •

edited

Loading

IGJoshua commented Oct 14, 2024

IGJoshua commented Oct 14, 2024 •

edited

Loading

rutenkolk commented Dec 29, 2024 •

edited

Loading

IGJoshua left a comment

rutenkolk commented Jan 4, 2025

rutenkolk commented Jan 7, 2025

Add defstruct macro #14

Are you sure you want to change the base?

Add defstruct macro #14

Conversation

rutenkolk commented Oct 13, 2024 • edited Loading

serde registry

the generated type

with-c-layout

tests & benchmarks

IGJoshua commented Oct 14, 2024

IGJoshua commented Oct 14, 2024 • edited Loading

Arrays

serde registry

with-c-layout

rutenkolk commented Dec 29, 2024 • edited Loading

IGJoshua left a comment

Choose a reason for hiding this comment

rutenkolk commented Jan 4, 2025

Serializing a struct with n amount of ::mem/int members:

Deserializing a struct with n amount of ::mem/int members:

Serializing a struct with one ::mem/array of ::mem/ints of fixed size n

Deserializing a struct with one ::mem/array of ::mem/ints of fixed size n

rutenkolk commented Jan 7, 2025

rutenkolk commented Oct 13, 2024 •

edited

Loading

IGJoshua commented Oct 14, 2024 •

edited

Loading

rutenkolk commented Dec 29, 2024 •

edited

Loading

Serializing a struct with `n` amount of `::mem/int` members:

Deserializing a struct with `n` amount of `::mem/int` members:

Serializing a struct with one `::mem/array` of `::mem/int`s of fixed size `n`

Deserializing a struct with one `::mem/array` of `::mem/int`s of fixed size `n`