Install mkl for PyTorch and libjpeg/libpng for TorchVision #8248

huydhn · 2025-02-06T01:42:15Z

Fixes #8180

I should have caught this earlier that we didn't have mkl here, so PyTorch performance on x86 CPU suffered.

pytorch-bot · 2025-02-06T01:42:18Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/8248

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit c5dcd2d with merge base b1d76c9 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

swolchok

looks like you've got conda library problems but this is clearly a PR I should accept, contingent on you seeing llava-runner job improve

huydhn · 2025-02-07T17:12:20Z

looks like you've got conda library problems but this is clearly an accept contingent on you seeing llava-runner job improve

Let me look more into that conda issue. This is rather unexpected, I have done a rebase, so the failure is legit.

huydhn · 2025-02-09T19:58:51Z

looks like you've got conda library problems but this is clearly a PR I should accept, contingent on you seeing llava-runner job improve

Just FYI, the linking error is from this hack https://github.com/pytorch/executorch/blob/main/.ci/docker/common/install_conda.sh#L46-L58

huydhn · 2025-02-10T02:24:15Z

The run time is around 1h20m now https://github.com/pytorch/executorch/actions/runs/13229090225/job/36927181503 v.s 2h30m https://github.com/pytorch/executorch/actions/runs/13213708877/job/36890417988

swolchok · 2025-02-10T15:52:46Z

eval_llama-mmlu seems to have gone from 1.9h to under 9 minutes: https://hud.pytorch.org/hud/pytorch/executorch/main/1?per_page=50&name_filter=mmlu&mergeLF=true

#8173 raised these timeouts. Now that #8248 has landed to fix #8180, we should be able to lower them again. (I'm sending this early so I don't forget; double-check llava-runner running time) ghstack-source-id: cb4c1691907b8bb46a504a2d8cbc00d12b1ef4a4 ghstack-comment-id: 2648474106 Pull Request resolved: #8339

While debugging the build issue on #8322 w.r.t mkl, I undercover a complex interaction between #8322, #8248 (to install mkl), and https://github.com/pytorch/pytorch/blob/main/cmake/public/mkl.cmake from PyTorch. The error is as follows: ``` CMake Error at /opt/conda/envs/py_3.10/lib/cmake/mkl/MKLConfig.cmake:744 (add_library): <-- This file comes from conda mkl add_library cannot create imported target "MKL::MKL" because another target with the same name already exists. Call Stack (most recent call first): /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/share/cmake/Caffe2/public/mkl.cmake:1 (find_package) <-- this is from PyTorch /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:106 (include) /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) CMakeLists.txt:753 (find_package) ``` The conclusion is that, with mkl installed, there should be just one `find_package(Torch)` call because the mkl target is defined globally. The `torch` target, on the other hand, is only defined locally. So, this change adds `if(NOT TARGET torch)` check to only call `find_package(Torch)` if needed. ### Testing The change on top of #8322 looks like this f705b01 https://github.com/pytorch/executorch/actions/runs/13278590926?pr=8399

Install libjpeg and libpng for TorchVision

a806f3d

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 6, 2025

huydhn added the topic: not user facing label Feb 6, 2025

huydhn added 2 commits February 5, 2025 18:49

Merge branch 'main' into fix-llava-runner-timeout

7d5393c

Install MKL

89c1603

huydhn changed the title ~~Install libjpeg and libpng for TorchVision~~ Install mkl for PyTorch and libjpeg/libpng for TorchVision Feb 6, 2025

huydhn added 10 commits February 6, 2025 13:34

Use install_conda.sh

11b96a7

Also handle aarch64

bf17723

Wrong syntax?

0300f23

Oh darn

da3a668

Still not the right syntax

726e7c9

Typo

86c4e33

Merge branch 'main' into fix-llava-runner-timeout

d46509b

Use mkl=2021.4.0

ac0ae7e

Typo

726b38d

Wrong comment

0d1ec1c

swolchok approved these changes Feb 7, 2025

View reviewed changes

huydhn added 2 commits February 9, 2025 11:54

Remove wrong libstdc++ from conda

bb83fe0

No need to increase MacOS timeout

c5dcd2d

huydhn marked this pull request as ready for review February 9, 2025 19:59

huydhn merged commit 524ec78 into main Feb 10, 2025
52 checks passed

huydhn deleted the fix-llava-runner-timeout branch February 10, 2025 02:27

swolchok mentioned this pull request Feb 10, 2025

Lower timeout for llava-runner and eval_llama-mmlu jobs again #8339

Merged

github-actions bot mentioned this pull request Feb 11, 2025

Weekly pr metrics report - 2025-02-01..2025-02-07 wdvr/pytorch#6

Open

huydhn mentioned this pull request Feb 12, 2025

Fix multiple find_package(Torch) calls #8407

Merged

github-actions bot mentioned this pull request Feb 17, 2025

Weekly pr metrics report - 2025-02-01..2025-02-07 wdvr/pytorch#8

Open

This was referenced Feb 24, 2025

Weekly pr metrics report - 2025-02-01..2025-02-07 wdvr/pytorch#10

Open

Weekly pr metrics report - 2025-02-01..2025-02-07 wdvr/pytorch#14

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Install mkl for PyTorch and libjpeg/libpng for TorchVision #8248

Install mkl for PyTorch and libjpeg/libpng for TorchVision #8248

huydhn commented Feb 6, 2025 •

edited

Loading

pytorch-bot bot commented Feb 6, 2025 •

edited

Loading

swolchok left a comment •

edited

Loading

huydhn commented Feb 7, 2025

huydhn commented Feb 9, 2025

huydhn commented Feb 10, 2025

swolchok commented Feb 10, 2025

Install mkl for PyTorch and libjpeg/libpng for TorchVision #8248

Install mkl for PyTorch and libjpeg/libpng for TorchVision #8248

Conversation

huydhn commented Feb 6, 2025 • edited Loading

pytorch-bot bot commented Feb 6, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/8248

✅ No Failures

swolchok left a comment • edited Loading

Choose a reason for hiding this comment

huydhn commented Feb 7, 2025

huydhn commented Feb 9, 2025

huydhn commented Feb 10, 2025

swolchok commented Feb 10, 2025

huydhn commented Feb 6, 2025 •

edited

Loading

pytorch-bot bot commented Feb 6, 2025 •

edited

Loading

swolchok left a comment •

edited

Loading