You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
Part of the model flow is to create and add data validation tests. These may check the sanity of your data, e.g. that certain columns have a specific cardinality (e.g. only 5 different kinds of values), or that a numeric data column has a specific range.
Creating these tests are pretty repetitive.
Describe the solution you'd like
Add a Fluent API to validate data structure.
I'm envisioning a syntax such as the following in a new project, e.g. MLOps.NET.Data.Tests
[TestMethod]
public void VerifyCardinalityOfColumn()
{
var mlOpsTestingContext = new MLOpsTestingContext();
mlOpsTestingContext.WithData(pathToData)
.HasColumn(index, x => x.WithCardinality(3))
.Assert()
}
[TestMethod]
public void VerifyRangeOfColumn()
{
var mlOpsTestingContext = new MLOpsTestingContext();
mlOpsTestingContext.WithData(pathToData)
.HasColumn(index, x => x.WithRange(min: 0, max: 10000)
.Assert()
}
[TestMethod]
public void VerifySchema()
{
var mlOpsTestingContext = new MLOpsTestingContext();
mlOpsTestingContext.WithData(pathToData)
.HasNumberOfColumns(10)
.HasMinimumNumberOfRows(5000)
.Assert()
}
[TestMethod]
public void VerifyColumOnlyContainsApprovedValues()
{
var mlOpsTestingContext = new MLOpsTestingContext();
mlOpsTestingContext.WithData(pathToData)
.HasColumn(index, x => x.WithValues(listOfApprovedValues)
.Assert()
}
The text was updated successfully, but these errors were encountered:
@lqdev working on the workshop material, I thought that something like this would be super useful to create data validation tests as part of the pipeline.
Is your feature request related to a problem? Please describe.
Part of the model flow is to create and add data validation tests. These may check the sanity of your data, e.g. that certain columns have a specific cardinality (e.g. only 5 different kinds of values), or that a numeric data column has a specific range.
Creating these tests are pretty repetitive.
Describe the solution you'd like
Add a Fluent API to validate data structure.
I'm envisioning a syntax such as the following in a new project, e.g.
MLOps.NET.Data.Tests
The text was updated successfully, but these errors were encountered: