Operators work directly on raw arrays if most agents are active #843

RomeshA · 2025-01-17T04:54:21Z

Description

Operators on Arr and BoolArr use asnew() to return new Arr instances. These work along the lines of

Using self.values to create a numpy array with length equal to auids
Creating a corresponding array from other.values if other is an Arr
Creating an intermediate result array with length equal to auids
Creating a new empty array the same size as uids
Populating it sparsely with the result array based on auids

The intention is that the alternative of operating on every UID (i.e., on the raw arrays directly) would be unnecessarily slow if there are many inactive agents. However, in practice the overhead associated with creating the intermediate arrays and then sparsely inserting it is fairly significant. For simulations that do not have a significant proportion of inactive agents, it can be faster to just operate on every entry. In that case, no intermediate arrays need to be created - the output of the logical operation of the two raw arrays can be directly used as the raw array for the new Arr instances. In some simulations this can give a speedup of around 40%.

This PR thus adds an attribute for Arr that corresponds to whether this optimization should be used, with a threshold of 50% (i.e., it will use the raw entries if over half the agents are active). This threshold could be tuned as we get more use cases. There is a corresponding update to asnew to optionally take in a raw array, which will be used if provided.

There should be no changes necessary in any user code and simulation output should be the same

Checklist

Code commented & docstrings added
New tests were needed and have been added
A new version number was needed & changelog has been updated
A new PyPI version needs to be released

RomeshA · 2025-01-17T07:41:23Z

Going to mark this as draft as other changes in our code seem to have made this change have less impact, will continue profiling further

RomeshA added 2 commits January 17, 2025 14:31

Operators work directly on raw arrays if most agents are active

067757a

Improve logic

e7288a4

RomeshA requested a review from cliffckerr January 17, 2025 04:54

RomeshA marked this pull request as draft January 17, 2025 07:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Operators work directly on raw arrays if most agents are active #843

Operators work directly on raw arrays if most agents are active #843

RomeshA commented Jan 17, 2025

RomeshA commented Jan 17, 2025

Operators work directly on raw arrays if most agents are active #843

Are you sure you want to change the base?

Operators work directly on raw arrays if most agents are active #843

Conversation

RomeshA commented Jan 17, 2025

Description

Checklist

RomeshA commented Jan 17, 2025