[Meta] disk queue journey to GA #118

leehinman · 2022-09-22T18:33:44Z

mbudge · 2023-08-15T22:48:23Z

Wouldn't it be more efficient to use in-memory queues which temporarily fall back to on-disk queues if there's a network issue?

When the connectivity issue is resolved, push the events from the on-disk queue back to the output integration. I've built software which did this using channels in Go to add redundancy by caching data to disk. This was done when there was a problem writing to elastic using the go-elasticsearch bulk indexer and it worked well. The software would periodically check connectivity to the cluster and send the events when there was no longer an issue.

This could be done by having an option to flush the in-memory queue to the on-disk queue instead of dropping the events. This would reduce the resources required to run on-disk queues on busy production servers 24/7. It sounds like on disk queues on busy domain controllers or other critical systems might use more resource which could trigger out IT teams to complain. It's too complicated to have separate policies in large multi-regional deployments to have in-memory queues on a subset of endpoints.

cmacknz · 2023-08-17T20:01:39Z

It's a good idea, but exploring that needs to come after making the disk queue available at all. Today there is definitely a performance penalty to write to disk as it requires serializing the event twice (once to disk, once to the output) but we haven't thoroughly quantified what it is. We may also be able to improve that cost.

leehinman self-assigned this Sep 22, 2022

leehinman added the Team:Elastic-Agent-Data-Plane Label for the Agent Data Plane team label Sep 22, 2022

cmacknz mentioned this issue Sep 22, 2022

[Meta] Elastic Agent Shipper Project #16

Open

100 tasks

This was referenced Sep 26, 2022

[Meta][Feature] Implement encrypted disk queue #33

Closed

Correctly handle enqueued events affected by agent policy changes #49

Closed

leehinman added 8.6-candidate v8.6.0 labels Sep 26, 2022

jlind23 added the Meta label Sep 27, 2022

leehinman mentioned this issue Oct 31, 2022

Add Disk Queue configuration #138

Merged

6 tasks

leehinman mentioned this issue Dec 6, 2022

[Meta] elastic-agent-shipper journey to GA #197

Open

54 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Meta] disk queue journey to GA #118

[Meta] disk queue journey to GA #118

leehinman commented Sep 22, 2022 •

edited by cmacknz

Loading

mbudge commented Aug 15, 2023

cmacknz commented Aug 17, 2023

[Meta] disk queue journey to GA #118

[Meta] disk queue journey to GA #118

Comments

leehinman commented Sep 22, 2022 • edited by cmacknz Loading

Checklist to achieve experimental status

Checklist to achieve beta status

Checklist to achieve ga status

mbudge commented Aug 15, 2023

cmacknz commented Aug 17, 2023

leehinman commented Sep 22, 2022 •

edited by cmacknz

Loading

Checklist to achieve `experimental` status

Checklist to achieve `beta` status

Checklist to achieve `ga` status