Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: TPCH 1T OOM during daily regression on standlone mode #21272

Open
1 task done
aressu1985 opened this issue Jan 17, 2025 · 4 comments
Open
1 task done

[Bug]: TPCH 1T OOM during daily regression on standlone mode #21272

aressu1985 opened this issue Jan 17, 2025 · 4 comments
Assignees
Labels
kind/bug Something isn't working long-term-job severity/s0 Extreme impact: Cause the application to break down and seriously affect the use
Milestone

Comments

@aressu1985
Copy link
Contributor

Is there an existing issue for the same bug?

  • I have checked the existing issues.

Branch Name

main

Commit ID

1392f1f

Other Environment Information

- Hardware parameters: 64C 256G
- OS type:
- Others:

Actual Behavior

job link:
https://github.com/matrixorigin/mo-auto-test/actions/runs/12806494085/job/35714847133

Image

heap:

LOG_7c4dccb4-4d3c-41f8-b482-5251dc7a41bf_heap_0194730b-d243-73d7-8677-bd962349d79a.gz
LOG_7c4dccb4-4d3c-41f8-b482-5251dc7a41bf_malloc_0194730c-5644-7caf-becb-3ff07048b42a.gz
LOG_7c4dccb4-4d3c-41f8-b482-5251dc7a41bf_malloc_0194730b-e521-7efe-b1cc-91f8c9eae971.gz
LOG_7c4dccb4-4d3c-41f8-b482-5251dc7a41bf_heap_0194730c-4772-7779-bcac-e42832a33207.gz

Expected Behavior

No response

Steps to Reproduce

DAILY REGRESSION

Additional information

No response

@aressu1985 aressu1985 added kind/bug Something isn't working needs-triage severity/s0 Extreme impact: Cause the application to break down and seriously affect the use labels Jan 17, 2025
@aressu1985 aressu1985 added this to the 2.1.0 milestone Jan 17, 2025
@badboynt1
Copy link
Contributor

tpch1T很久都没oom过了 这几个profile也都很小,都在几g到十几g之间。 麻烦 @reusee 看一下?

@reusee
Copy link
Contributor

reusee commented Jan 21, 2025

@aressu1985 这个是怎么判断是OOM的呢?从日志看,只能确定是mo-service崩溃了,连接关闭了。但mo-service崩溃的原因,不一定是OOM。从内存metrics看,是没有上涨的。这个怎么看mo-service的日志?

@reusee
Copy link
Contributor

reusee commented Jan 21, 2025

后面第 #78 #79 次跑,都没有报这个错误。所以应该不是哪里用多了内存导致的,不然每次跑都会报错的:

https://github.com/matrixorigin/mo-auto-test/actions/runs/12826858905

https://github.com/matrixorigin/mo-auto-test/actions/runs/12843042858

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Something isn't working long-term-job severity/s0 Extreme impact: Cause the application to break down and seriously affect the use
Projects
None yet
Development

No branches or pull requests

4 participants