Benchmark

Machine:

This time, we only provide benchmark on CPU. In the near future, we will add benchmark on ARM and GPU.

System: CentOS 7 in Docker, for benchmark between Anakin and Tensorflow
System: CentOS 6.3, for benchmark between Anakin and Paddle

Counterpart of anakin :

The counterpart of Anakin is Tensorflow 1.8.0, which installed by Anaconda 4.5.4, run by Python 3.6

Benchmark Model

You can use pretrained model or the model trained by youself.

Please note that you should transform fluid model or others into anakin model with the help of external converter ->

Language model fluid model can be found here->
Chinese_ner fluid model can be found here->
text_classification fluid model can be found here->

We tested them on single-CPU with different thread numbers.

Anakin VS Tensorflow

language model in i7-7700

Latency (ms) of one batch

ThreadNum Tensorflow Anakin

1 5.64 2.44

2 8.29 4.44

4 14.23 9.91

6 19.83 15.51
Throughput (words/s)

ThreadNum Tensorflow Anakin

1 3459 8536

2 4772 9399

4 5498 8418

6 5764 8070

language model in E5-2620 v4

Latency (ms) of one batch

ThreadNum Tensorflow Anakin

1 6.31 2.84

2 7.94 2.678

4 8.66 4.32

6 12.33 7.12
Throughput (words/s)

ThreadNum Tensorflow Anakin

1 2890 7257

2 4726 15439

4 8659 18351

6 9414 17461

language model in E5-2650 v4

Latency (ms) of one batch

ThreadNum Tensorflow Anakin

1 3.69 2.84

2 4.62 2.85

4 7.78 3.48

6 13.54 4.79
Throughput (words/s)

ThreadNum Tensorflow Anakin

1 4456 7300

2 7522 14556

4 9580 22086

6 8664 23938

text_classfication model in i7-7700

Latency (ms) of one batch

ThreadNum Tensorflow Anakin

1 1.25 0.32

2 1.87 0.33

4 2.01 0.35

6 2.81 0.58
Throughput (words/s)

ThreadNum Tensorflow Anakin

1 12797 53506

2 17933 95898

4 31965 148427

6 31784 118684

text_classfication in E5-2620 v4

Latency (ms) of one batch

ThreadNum Tensorflow Anakin

1 3.89 0.58

2 3.77 0.61

4 3.05 0.62

6 3.84 0.66
Throughput (words/s)

ThreadNum Tensorflow Anakin

1 4281 28192

2 8804 49840

4 19949 89710

6 24798 116975

text_classfication in E5-2650 v4

Latency (ms) of one batch

ThreadNum Tensorflow Anakin

1 2.26 0.67

2 2.34 0.7

4 2.25 0.72

6 2.47 0.73
Throughput (words/s)

ThreadNum Tensorflow Anakin

1 6337 24636

2 12266 45368

4 24869 81952

6 34872 109993

chinese_ner model in i7-7700

Latency (ms) of one batch

ThreadNum Tensorflow Anakin

1 1.96 0.094

2 2.59 0.098

4 3.74 0.1

6 3.95 0.13
Throughput (words/s)

ThreadNum Tensorflow Anakin

1 8747 156564

2 13293 208484

4 18294 114348

6 25338 66480

chinese_ner in E5-2620 v4

Latency (ms) of one batch

ThreadNum Tensorflow Anakin

1 5.44 0.13

2 5.45 0.14

4 4.84 0.15

6 5.18 0.16
Throughput (words/s)

ThreadNum Tensorflow Anakin

1 4281 93527

2 8804 127232

4 19949 118649

6 24798 99553

chinese_ner in E5-2650 v4

Latency (ms) of one batch

ThreadNum Tensorflow Anakin

1 3.61 0.16

2 3.78 0.16

4 3.74 0.17

6 3.78 0.16
Throughput (words/s)

ThreadNum Tensorflow Anakin

1 4669 79225

2 8953 115761

4 18074 118696

6 26607 102044

Anakin VS PaddlePaddle/Fluid
We use private dataset and different QPS index in this benchmark.

language model in E5-2650 v4

Latency (ms) of one batch

ThreadNum Fluid Anakin

1 42.7418 1.93589

2 42.7418 2.49537

6 42.7734 3.14332

10 43.0721 4.55329

12 42.8501 5.09893
Throughput (sentence/s)

ThreadNum Fluid Anakin

1 23 504

2 46 762

6 134 1393

10 218 1556

12 260 1541

Chinese_ner model in E5-2650 v4

Latency (ms) of one batch

ThreadNum Fluid Anakin

1 0.380475 0.17034

4 0.380475 0.171143

6 0.380475 0.172688

10 0.380475 0.173269

12 0.380475 0.17668
Throughput (sentence/s)

ThreadNum Fluid Anakin

1 7844 5822

4 7844 11377

6 7844 29725

10 7844 41238

12 7844 42790

text_classfication model in E5-2650 v4

Latency (ms) of one batch

ThreadNum Fluid Anakin

1 1.48578 1.10088

4 1.54025 1.11258

6 1.68529 1.1257

10 1.9817 1.13267

12 2.21864 1.1429
Throughput (sentence/s)

ThreadNum Fluid Anakin

1 673 901

4 1289 1665

6 3458 4449

10 4875 6183

12 5265 6188

How to run those Benchmark models?

You can just run sh benchmark_tensorflow.sh and sh benchmark_anakin.sh

Get the model of caffe or fluid, convert model to anakin model, use net_test_*** to test your model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README_CPU.md

README_CPU.md

Benchmark

Machine:

Counterpart of anakin :

Benchmark Model

language model in i7-7700

language model in E5-2620 v4

language model in E5-2650 v4

text_classfication model in i7-7700

text_classfication in E5-2620 v4

text_classfication in E5-2650 v4

chinese_ner model in i7-7700

chinese_ner in E5-2620 v4

chinese_ner in E5-2650 v4

language model in E5-2650 v4

Chinese_ner model in E5-2650 v4

text_classfication model in E5-2650 v4

How to run those Benchmark models?

ThreadNum	Fluid	Anakin
1	42.7418	1.93589
2	42.7418	2.49537
6	42.7734	3.14332
10	43.0721	4.55329
12	42.8501	5.09893

ThreadNum	Fluid	Anakin
1	0.380475	0.17034
4	0.380475	0.171143
6	0.380475	0.172688
10	0.380475	0.173269
12	0.380475	0.17668

ThreadNum	Fluid	Anakin
1	1.48578	1.10088
4	1.54025	1.11258
6	1.68529	1.1257
10	1.9817	1.13267
12	2.21864	1.1429

ThreadNum	Tensorflow	Anakin
1	5.64	2.44
2	8.29	4.44
4	14.23	9.91
6	19.83	15.51

ThreadNum	Tensorflow	Anakin
1	3459	8536
2	4772	9399
4	5498	8418
6	5764	8070

ThreadNum	Tensorflow	Anakin
1	6.31	2.84
2	7.94	2.678
4	8.66	4.32
6	12.33	7.12

ThreadNum	Tensorflow	Anakin
1	2890	7257
2	4726	15439
4	8659	18351
6	9414	17461

Files

README_CPU.md

Latest commit

History

README_CPU.md

File metadata and controls

Benchmark

Machine:

Counterpart of anakin :

Benchmark Model

language model in i7-7700

language model in E5-2620 v4

language model in E5-2650 v4

text_classfication model in i7-7700

text_classfication in E5-2620 v4

text_classfication in E5-2650 v4

chinese_ner model in i7-7700

chinese_ner in E5-2620 v4

chinese_ner in E5-2650 v4

language model in E5-2650 v4

Chinese_ner model in E5-2650 v4

text_classfication model in E5-2650 v4

How to run those Benchmark models?