Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MNN:Sync: Sync Internal 3.0.4 #3170

Merged
merged 1 commit into from
Jan 22, 2025
Merged

MNN:Sync: Sync Internal 3.0.4 #3170

merged 1 commit into from
Jan 22, 2025

Conversation

jxt1234
Copy link
Collaborator

@jxt1234 jxt1234 commented Jan 22, 2025

  1. Convolution 3D 暂时转换为 Convolution 2D 实现,相比当前im2col+gemm的方法,提升速度、降低内存
  2. 为限制 ConvolutionTranspose 占用的内存,在 memory 非 high 时,限制ConvolutionTranspose算子最大线程数为 4
  3. 优化动态量化计算非1x1卷积的精度
  4. 增加权重量化的精度提升脚本,支持对卷积采用不同的量化参数
  5. LLM export 相关代码重构,按功能拆分文件
  6. 透出 diffusion 相关接口
  7. 进一步优化 OpenCL 推理 llm 的性能
  8. 优化 Vulkan Image 推理性能,增加 autotuning 机制
  9. 修正存在多个Module,使用GPU,且存在算子回退情况时,第二次推理可能出现crash或者结果不对的问题

@CLAassistant
Copy link

CLA assistant check
Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.


xiaying seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
You have signed the CLA already but the status is still pending? Let us recheck it.

@jxt1234 jxt1234 force-pushed the feature/sync branch 3 times, most recently from dc895c9 to c9104fb Compare January 22, 2025 07:54
@jxt1234 jxt1234 merged commit b23b55b into master Jan 22, 2025
18 of 19 checks passed
@jxt1234 jxt1234 deleted the feature/sync branch January 22, 2025 10:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants