-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
3b和7b模型加载都要超过一分钟以上,请完善。 #3181
Labels
User
The user ask question about how to use. Or don't use MNN correctly and cause bug.
Comments
关于加载:
|
谢谢!
获取Outlook for Android<https://aka.ms/AAb9ysg>
…________________________________
From: jxt1234 ***@***.***>
Sent: Tuesday, January 28, 2025 6:07:20 PM
To: alibaba/MNN ***@***.***>
Cc: ccbsdu ***@***.***>; Author ***@***.***>
Subject: Re: [alibaba/MNN] 3b和7b模型加载都要超过一分钟以上,请完善。 (Issue #3181)
关于加载:
1. OpenCL 加载会快一些
2. CPU 可以把 use_mmap 设成 true ,第一次加载后会产生 cache ,第二次加载后就快了。
关于模型格式:
3. 不会支持加载 GGUF 这类第三方推理用模型格式的加载,MNN 模型本身是完备的深度学习模型,基于开源模型,用 MNN 的工具转换成 MNN 模型推理即可。
4. 目前暂无增加 GGUF 转换到 MNN 工具的计划。
―
Reply to this email directly, view it on GitHub<#3181 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/BAMHZRBHI3MHJNJNSJX6DUD2M5JFRAVCNFSM6AAAAABV56M2W2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDMMJYGU2DMMZUGY>.
You are receiving this because you authored the thread.Message ID: ***@***.***>
|
请问在哪里设置use_mmap =true |
修改 config.json : |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
3b和7b模型加载都要超过一分钟以上,请完善。我的安卓手机配置是最新的。另外,建议增加加载本地guff模型功能。目前的功能还是太简单,建议增加mermaid代码渲染功能,直接显示效果图。
The text was updated successfully, but these errors were encountered: