general performance issue #11

convolvatron · 2017-12-08T16:19:55Z

optimizations we are consider are

pipeline segmented transfers
connection bonding
iovecs to reduce copying
caching filehandles

note that (2) might obviate (1). (1) may in fact be necessary for correctness if the server evaluates out of order

convolvatron · 2017-12-08T16:52:53Z

inserts seem to involve a lot of small writes, probably btree pointers, lsns, etc. it would probably help to batch these between sync(). that may also lower the chances of corrupting the database on fault (quasi-atomic updates)

to address another comment in the source, this probably means

defining a synch mode
changing file write/create to take a default mode
allow a per file write to choose a different mode
exposing a flush

convolvatron · 2017-12-08T17:35:31Z

up the transfer chunking size

jssmith · 2017-12-08T18:46:00Z

Ok, good learnings + thoughts. Comments:

inserts seem to involve a lot of small writes...

Just to confirm I that I understand this: you expect we can get efficiencies from batching at the NFS protocol layer. I believe we already have the Nagle algorithm in TCP layer (though now that I think about it I'm not sure whether we want that - maybe we just do all of the batching ourselves).

up the transfer chunking size

Ah, yes, we should probably experiment with this. We definitely need to renumber these because this seems like low-hanging fruit.

I'll want to make sure that we have all of these changes flagged so that we can measure and compare performance under different approaches.

convolvatron · 2017-12-08T18:52:30Z

right. just like nagle. nagle though is probably going to be a little blind. in particular it flushes with a timer and not with an explicit sync.and i think(?) the size threshold is lower than the tcp mss. sure wrp flags. dynamic api is preferable to compile time.

convolvatron · 2017-12-08T19:30:54Z

oh right, nagle in this case wont help and in fact might be hurting because we block waiting for the remote response on each write

turn on NO_DELAY

convolvatron · 2017-12-09T18:19:48Z

note that pipelining segmented transfers might not get us anything, since i'm pretty sure sqlite will max out at a page size. if thats the <= segment size then it kind of doesn't matter (for this application)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

general performance issue #11

general performance issue #11

convolvatron commented Dec 8, 2017

convolvatron commented Dec 8, 2017 •

edited

Loading

convolvatron commented Dec 8, 2017 •

edited

Loading

jssmith commented Dec 8, 2017

convolvatron commented Dec 8, 2017

convolvatron commented Dec 8, 2017

convolvatron commented Dec 9, 2017

general performance issue #11

general performance issue #11

Comments

convolvatron commented Dec 8, 2017

convolvatron commented Dec 8, 2017 • edited Loading

convolvatron commented Dec 8, 2017 • edited Loading

jssmith commented Dec 8, 2017

convolvatron commented Dec 8, 2017

convolvatron commented Dec 8, 2017

convolvatron commented Dec 9, 2017

convolvatron commented Dec 8, 2017 •

edited

Loading

convolvatron commented Dec 8, 2017 •

edited

Loading