[WIP] Benchmark MLOps.NET #320

Brett-Parker · 2020-08-29T21:54:20Z

Resolves

Fixes #233

Brett-Parker · 2020-08-29T21:55:14Z

@aslotte I have pushed this as a baseline. I have been investigating this one for a few hours and there doesn't appear to be a great way to do this well.

The best I have come up with is;

Created a console app

This app can be called via CL dotnet MLOps.NET.Benchmarks.dll

Results

Comparisons

The only thing I can find to baseline and compare is something like this;
https://github.com/dotnet/performance/tree/master/src/tools/ResultsComparer#sample-results

Create a baseline csv report. Then every time we run the benchmark it compares against the baseline.

I have used Moq here to put something in the file so we can see roughly what structure I was thinking.

Before I continue I'd like some thoughts/feedback if possible.

aslotte · 2020-08-29T22:15:56Z

Awesome @Brett-Parker, I was actually just thinking of Benchmark.NET.
I think you're on the right track, and we can do this in phases, to start by having benchmarks and then see how we can implement a comparison and a baseline, and even how to integrate some in our CI pipelines (can be step two).

A couple of thoughts:

Benchmarks should run against the real implementation, thus we should create an MLOpsContext with access to a real database and a real model repository as it will give us a good understanding on any actual bottle necks.
Given that, we probably want benchmarks for various storage providers, CosmosDB, SQLServer and SQLite. I don't think we need to implement all of these at once though, but we probably want some structure to separate them and various combinations. I think we can keep one MLOps.NET.Benchmark project and have different folders e.g.

Let me know if you can think of any other structure though, happy to bounce some ideas :)

Brett-Parker · 2020-08-29T22:20:05Z

@aslotte thanks for your quick reply. I agree with everything. I'll take a look in the morning and implement something basic as a baseline.

Brett-Parker · 2020-08-30T12:54:48Z

@aslotte Thoughts on this project structure?

Each DB will have a setup and cleanup.

If you agree with this approach I will start populating all with at least 1 benchmark in for all catalogs. Then this PR can be closed and I will open a new issue for comparison and CI.

aslotte · 2020-08-30T13:16:31Z

@Brett-Parker looks great, and it allows us to modify the structure as needed going forward, I like it. Just a heads up, there's one too many Ls in SQLite :)

Added LifeCycleCatalog Benchmark only

Brett-Parker · 2020-08-30T15:46:43Z

@aslotte I have now added LifeCycleCatalog and completed the structure for this. I think this is now a good place to review this PR. I will create a new issue for expanding this to other catalogs.

Things still to do

Add other catalog benchmarks
Investigate the -filter for benchmarkdotnet to allow benchmarking on specific integrations
Benchmark cleanup
Add comparison to benchmarks #321 - Benchmark comparison
Github action to flag comparison as acceptable or not.

Removed unneccessary usings.

Brett-Parker · 2020-08-30T16:04:30Z

@aslotte sorry, found mistakes. Rectified them now.

aslotte · 2020-08-30T16:40:49Z

Great @Brett-Parker!
A couple of thoughts:

I think we can keep all config values in one appsettings.json
We can add a reference to MLOps.NET.Tests.Common to use the configuration builder there
I think that just like for integration tests, we'll have the exact same benchmarks for different storage providers, the only thing that changes is the set up. One way we can make it faster to write new benchmarks across all storage providers would be to create a base class LifeCycleCatalogBenchmarks.cs where we store all the benchmarks, e.g. the on you just created for creating a run and then we keep the GlobalSetup in the storage specific benchmarks and inherit from LifeCycleCatalogBenchmark.

Like that we only need to write the benchmark one time, and it will run for all storage providers.

aslotte · 2020-08-30T16:41:29Z

Let me know if that makes sense and if I was able to explain it properly. I think it should work but happy to bounce some ideas

aslotte · 2020-09-15T11:30:34Z

Did you intend to close this one @Brett-Parker?

Brett-Parker added 2 commits August 29, 2020 22:53

Baseline commit

5d542a2

Update MLOps.NET.sln

5cc1ae4

Brett-Parker self-assigned this Aug 29, 2020

Brett-Parker changed the title ~~Benchmark MLOps.NET~~ [WIP]Benchmark MLOps.NET Aug 29, 2020

Added project folder structure

e03e38d

Brett-Parker mentioned this pull request Aug 30, 2020

Add comparison to benchmarks #321

Open

Added LifeCycleCatalog

793825d

Added LifeCycleCatalog Benchmark only

Brett-Parker requested a review from aslotte August 30, 2020 15:46

Brett-Parker changed the title ~~[WIP]Benchmark MLOps.NET~~ Benchmark MLOps.NET Aug 30, 2020

Brett-Parker added 2 commits August 30, 2020 16:48

Removed Using

2ed5e10

Removed unneccessary usings.

Appsettings

5162566

Brett-Parker changed the title ~~Benchmark MLOps.NET~~ [WIP] Benchmark MLOps.NET Sep 1, 2020

Brett-Parker closed this Sep 14, 2020

Brett-Parker deleted the issue_233 branch October 3, 2020 13:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Benchmark MLOps.NET #320

[WIP] Benchmark MLOps.NET #320

Brett-Parker commented Aug 29, 2020

Brett-Parker commented Aug 29, 2020

aslotte commented Aug 29, 2020

Brett-Parker commented Aug 29, 2020

Brett-Parker commented Aug 30, 2020

aslotte commented Aug 30, 2020

Brett-Parker commented Aug 30, 2020

Brett-Parker commented Aug 30, 2020

aslotte commented Aug 30, 2020

aslotte commented Aug 30, 2020

aslotte commented Sep 15, 2020

[WIP] Benchmark MLOps.NET #320

[WIP] Benchmark MLOps.NET #320

Conversation

Brett-Parker commented Aug 29, 2020

Resolves

Brett-Parker commented Aug 29, 2020

aslotte commented Aug 29, 2020

Brett-Parker commented Aug 29, 2020

Brett-Parker commented Aug 30, 2020

aslotte commented Aug 30, 2020

Brett-Parker commented Aug 30, 2020

Brett-Parker commented Aug 30, 2020

aslotte commented Aug 30, 2020

aslotte commented Aug 30, 2020

aslotte commented Sep 15, 2020