Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cannot compile IPEX for Intel Arc A370M (DG2) - Compile errors #790

Open
staropram opened this issue Feb 24, 2025 · 8 comments
Open

Cannot compile IPEX for Intel Arc A370M (DG2) - Compile errors #790

staropram opened this issue Feb 24, 2025 · 8 comments
Assignees

Comments

@staropram
Copy link

Describe the issue

Dear Intel IPEX Gods/Godesses,

Have pity on a humble man of the digital path, striving to incise truth and meaning from the chaos of build hell. Help me, oh wise ones, to find the correct incantations.

I have been struggling for the last 2 days trying to get intel-extension-for-pytorch working/compiled without success.

I went down a few wrong avenues, tried the existing packages (no detection of A370M), tried to just clone the repo and build but without success. Then I discovered intel's official build instructions tada, and this compile_bundle.sh approach...

So where am I, let's begin there:

  • GPU: Intel Arc A370M (DG2)
  • OS: Lubuntu 24.10
  • Python: 3.12.7 (Virtualenv)
  • oneAPI: Installed in /home/ash/intel/oneapi/setvars.sh
  • GCC Version: 13 (also tried GCC 12)
  • Build Method: Source build (compile_bundle.sh)

compile_bundle.sh uses both pip and conda, I was using a python -m venv environment not conda, so then I had to PATH the bin dir in from miniconda3 in order for your script to install libpng via this on line 264: conda install -y conda-forge::libpng conda-forge::libjpeg-turbo. It looked like I got away with this conda venv inception trick, but do I need to go full jungle anaconda on this and start again?

So anyway, I'm running the script like this:

./compile_bundle.sh /home/ash/intel/oneapi dg2

Is that right, oneapi installed from the official installer and that's where he lives, and then dg2 is supposedly the right mapping for AOT for the A370M, is that all correct?

So here's the first important question: is this actually supposed to work with the A370M? If it works am I going to get a GPU accelerated pytorch on my A370M? I'm not training grok5 here, I'm just playing with some teeny tiny models, but any speedup would be very welcome. If it won't ever work then I may as well stop now and buy an old gaming laptop with an NVIDIA GPU .

So I run the compilation and it starts compiling 8509 objects, the laptop has an i7-12700H but it takes some time as you'd imagine, and I lost track but this was the failure point:

[750/780] Building CXX object src/gpu/intel/ocl/CMakeFiles/dnnl_gpu_intel_ocl.dir/stream_profiler.cpp.o [751/780] Building CXX object src/gpu/intel/ocl/CMakeFiles/dnnl_gpu_intel_ocl.dir/ref_reorder.cpp.o [752/780] Building CXX object src/gpu/intel/ocl/CMakeFiles/dnnl_gpu_intel_ocl.dir/reusable_softmax.cpp.o [753/780] Building CXX object src/gpu/intel/ocl/CMakeFiles/dnnl_gpu_intel_ocl.dir/simple_concat.cpp.o [754/780] Building CXX object src/gpu/intel/ocl/CMakeFiles/dnnl_gpu_intel_ocl.dir/rnn/rnn_utils.cpp.o [755/780] Building CXX object src/gpu/intel/ocl/CMakeFiles/dnnl_gpu_intel_ocl.dir/simple_sum.cpp.o [756/780] Building CXX object src/gpu/intel/ocl/CMakeFiles/dnnl_gpu_intel_ocl.dir/vectorized_lnorm.cpp.o [757/780] Building CXX object src/gpu/intel/ocl/CMakeFiles/dnnl_gpu_intel_ocl.dir/rnn/ref_postgemm.cpp.o [758/780] Building CXX object src/gpu/intel/ocl/CMakeFiles/dnnl_gpu_intel_ocl.dir/vectorized_resampling.cpp.o [759/780] Building CXX object src/gpu/intel/ocl/CMakeFiles/dnnl_gpu_intel_ocl.dir/ocl_kernel_list.cpp.o In file included from /home/ash/build/pytorch/build/xpu_mkldnn_proj-prefix/src/xpu_mkldnn_proj-build/src/gpu/intel/ocl/ocl_kernel_list.cpp:22: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/kernel_utils.hpp:24: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 1 warning generated. [760/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/sycl_stream_submit_cpu_primitive.cpp.o [761/780] Building CXX object src/gpu/intel/ocl/CMakeFiles/dnnl_gpu_intel_ocl.dir/rnn/rnn_reorders.cpp.o [762/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/sycl_compat.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_compat.cpp:21: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_utils.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_compat.cpp:37: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [763/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/capi/capi_memory.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/capi/capi_memory.cpp:25: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine.hpp:30: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:26: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/capi/capi_memory.cpp:25: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine.hpp:30: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [764/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/sycl_buffer_memory_storage.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_buffer_memory_storage.cpp:17: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_buffer_memory_storage.hpp:26: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_memory_storage_base.hpp:21: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_types.hpp:24: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_compat.hpp:20: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_utils.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_buffer_memory_storage.cpp:18: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [765/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/capi/capi_primitive.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/capi/capi_primitive.cpp:25: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine.hpp:30: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:26: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/capi/capi_primitive.cpp:25: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine.hpp:30: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [766/780] Building CXX object src/gpu/intel/ocl/CMakeFiles/dnnl_gpu_intel_ocl.dir/rnn/ref_rnn.cpp.o [767/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/capi/capi_stream.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/capi/capi_stream.cpp:23: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine.hpp:30: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:26: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/capi/capi_stream.cpp:23: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine.hpp:30: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [768/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/level_zero_utils.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/level_zero_utils.cpp:17: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/level_zero_utils.hpp:25: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_compat.hpp:20: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_utils.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/level_zero_utils.cpp:46: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [769/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/sycl_engine.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine.cpp:17: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine.hpp:30: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:26: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine.cpp:17: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine.hpp:30: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [770/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/capi/capi_engine.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/capi/capi_engine.cpp:22: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine.hpp:30: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:26: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/capi/capi_engine.cpp:22: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine.hpp:30: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [771/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/sycl_memory_storage_base.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_memory_storage_base.cpp:17: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_memory_storage_base.hpp:21: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_types.hpp:24: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_compat.hpp:20: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_utils.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_memory_storage_base.cpp:18: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_stream.hpp:28: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [772/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/sycl_device_info.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_device_info.cpp:18: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_compat.hpp:20: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_utils.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_device_info.cpp:24: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [773/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/sycl_engine_base.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.cpp:17: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:26: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.cpp:17: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [774/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/stream_profiler.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/stream_profiler.cpp:24: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_stream.hpp:28: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:26: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/stream_profiler.cpp:24: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_stream.hpp:28: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [775/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/sycl_stream.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_stream.cpp:17: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_stream.hpp:28: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:26: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_stream.cpp:17: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_stream.hpp:28: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [776/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/sycl_stream_cpu_thunk.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_stream_cpu_thunk.cpp:23: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_stream.hpp:28: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:26: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_stream_cpu_thunk.cpp:23: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_stream.hpp:28: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/sycl/sycl_gpu_engine.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [777/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/sycl_usm_memory_storage.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_usm_memory_storage.cpp:17: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_usm_memory_storage.hpp:24: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:26: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_usm_memory_storage.cpp:17: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_usm_memory_storage.hpp:24: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [778/780] Building CXX object src/sycl/CMakeFiles/dnnl_sycl.dir/sycl_utils.cpp.o In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_utils.cpp:17: In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_utils.hpp:23: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/ocl/ocl_gpu_engine.hpp:48:14: warning: 'dnnl::impl::gpu::intel::ocl::ocl_gpu_engine_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 48 | status_t create_stream(stream_t **stream, cl_command_queue queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs 'cl_command_queue' (aka '_cl_command_queue *')) 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_utils.cpp:20: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/sycl/sycl_engine_base.hpp:63:14: warning: 'dnnl::impl::sycl::sycl_engine_base_t::create_stream' hides overloaded virtual function [-Woverloaded-virtual] 63 | status_t create_stream(stream_t **stream, ::sycl::queue &queue); | ^ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/common/engine.hpp:89:34: note: hidden overloaded virtual function 'dnnl_engine::create_stream' declared here: type mismatch at 2nd parameter ('dnnl::threadpool_interop::threadpool_iface *' vs '::sycl::queue &') 89 | virtual dnnl::impl::status_t create_stream(dnnl::impl::stream_t **stream, | ^ 2 warnings generated. [779/780] Building CXX object src/gpu/intel/jit/CMakeFiles/dnnl_gpu_intel_jit.dir/gemm/gen_gemm_kernel_generator.cpp.o /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp: In member function ‘void dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::doAlternateCRemainder(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::Gen9]’: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:8825:48: warning: writing 2 bytes into a region of size 0 [-Wstringop-overflow=] 8825 | FOR_EACH_C header[q] = state.ra.alloc_range(hregs); | ~~~~~~~~~~~~~~~~~~~~^~~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:8823:14: note: at offset 4 into destination object ‘header’ of size 4 8823 | GRFRange header[2]; | ^~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp: In member function ‘void dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::doAlternateCRemainder(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::Gen11]’: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:8825:48: warning: writing 2 bytes into a region of size 0 [-Wstringop-overflow=] 8825 | FOR_EACH_C header[q] = state.ra.alloc_range(hregs); | ~~~~~~~~~~~~~~~~~~~~^~~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:8823:14: note: at offset 4 into destination object ‘header’ of size 4 8823 | GRFRange header[2]; | ^~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp: In member function ‘void dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::doAlternateCRemainder(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::XeLP]’: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:8825:48: warning: writing 2 bytes into a region of size 0 [-Wstringop-overflow=] 8825 | FOR_EACH_C header[q] = state.ra.alloc_range(hregs); | ~~~~~~~~~~~~~~~~~~~~^~~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:8823:14: note: at offset 4 into destination object ‘header’ of size 4 8823 | GRFRange header[2]; | ^~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp: In member function ‘void dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::doAlternateCRemainder(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::XeHP]’: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:8825:48: warning: writing 2 bytes into a region of size 0 [-Wstringop-overflow=] 8825 | FOR_EACH_C header[q] = state.ra.alloc_range(hregs); | ~~~~~~~~~~~~~~~~~~~~^~~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:8823:14: note: at offset 4 into destination object ‘header’ of size 4 8823 | GRFRange header[2]; | ^~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp: In member function ‘void dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::doAlternateCRemainder(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::XeHPG]’: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:8825:48: warning: writing 2 bytes into a region of size 0 [-Wstringop-overflow=] 8825 | FOR_EACH_C header[q] = state.ra.alloc_range(hregs); | ~~~~~~~~~~~~~~~~~~~~^~~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:8823:14: note: at offset 4 into destination object ‘header’ of size 4 8823 | GRFRange header[2]; | ^~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp: In member function ‘void dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::doAlternateCRemainder(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::XeHPC]’: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:8825:48: warning: writing 2 bytes into a region of size 0 [-Wstringop-overflow=] 8825 | FOR_EACH_C header[q] = state.ra.alloc_range(hregs); | ~~~~~~~~~~~~~~~~~~~~^~~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:8823:14: note: at offset 4 into destination object ‘header’ of size 4 8823 | GRFRange header[2]; | ^~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp: In member function ‘void dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::doAlternateCRemainder(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::Xe2]’: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:8825:48: warning: writing 2 bytes into a region of size 0 [-Wstringop-overflow=] 8825 | FOR_EACH_C header[q] = state.ra.alloc_range(hregs); | ~~~~~~~~~~~~~~~~~~~~^~~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:8823:14: note: at offset 4 into destination object ‘header’ of size 4 8823 | GRFRange header[2]; | ^~~~~~ In file included from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/ngen/ngen.hpp:37, from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/jit_generator.hpp:23, from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.hpp:31, from /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:25: In member function ‘void ngen::GRFRange::invalidate()’, inlined from ‘void ngen::RegisterAllocator::safeRelease(RD&) [with RD = ngen::GRFRange]’ at /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/../ngen/ngen_register_allocator.hpp:159:61, inlined from ‘bool dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::gemmAccessC(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::Gen9]’ at /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:17957:33: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/ngen/ngen_core.hpp:1373:34: warning: writing 1 byte into a region of size 0 [-Wstringop-overflow=] 1373 | void invalidate() { len = invalidLen; } | ~~~~^~~~~~~~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp: In member function ‘bool dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::gemmAccessC(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::Gen9]’: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:17899:18: note: at offset 5 into destination object ‘C_addr0’ of size 4 17899 | GRFRange C_addr0[2], C_addr0Unmasked[2]; | ^~~~~~~ In member function ‘void ngen::GRFRange::invalidate()’, inlined from ‘void ngen::RegisterAllocator::safeRelease(RD&) [with RD = ngen::GRFRange]’ at /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/../ngen/ngen_register_allocator.hpp:159:61, inlined from ‘bool dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::gemmAccessC(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::Gen11]’ at /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:17957:33: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/ngen/ngen_core.hpp:1373:34: warning: writing 1 byte into a region of size 0 [-Wstringop-overflow=] 1373 | void invalidate() { len = invalidLen; } | ~~~~^~~~~~~~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp: In member function ‘bool dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::gemmAccessC(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::Gen11]’: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:17899:18: note: at offset 5 into destination object ‘C_addr0’ of size 4 17899 | GRFRange C_addr0[2], C_addr0Unmasked[2]; | ^~~~~~~ In member function ‘void ngen::GRFRange::invalidate()’, inlined from ‘void ngen::RegisterAllocator::safeRelease(RD&) [with RD = ngen::GRFRange]’ at /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/../ngen/ngen_register_allocator.hpp:159:61, inlined from ‘bool dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::gemmAccessC(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::XeLP]’ at /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:17957:33: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/ngen/ngen_core.hpp:1373:34: warning: writing 1 byte into a region of size 0 [-Wstringop-overflow=] 1373 | void invalidate() { len = invalidLen; } | ~~~~^~~~~~~~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp: In member function ‘bool dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::gemmAccessC(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::XeLP]’: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:17899:18: note: at offset 5 into destination object ‘C_addr0’ of size 4 17899 | GRFRange C_addr0[2], C_addr0Unmasked[2]; | ^~~~~~~ In member function ‘void ngen::GRFRange::invalidate()’, inlined from ‘void ngen::RegisterAllocator::safeRelease(RD&) [with RD = ngen::GRFRange]’ at /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/../ngen/ngen_register_allocator.hpp:159:61, inlined from ‘bool dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::gemmAccessC(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::XeHP]’ at /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:17957:33: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/ngen/ngen_core.hpp:1373:34: warning: writing 1 byte into a region of size 0 [-Wstringop-overflow=] 1373 | void invalidate() { len = invalidLen; } | ~~~~^~~~~~~~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp: In member function ‘bool dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::gemmAccessC(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::XeHP]’: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:17899:18: note: at offset 5 into destination object ‘C_addr0’ of size 4 17899 | GRFRange C_addr0[2], C_addr0Unmasked[2]; | ^~~~~~~ In member function ‘void ngen::GRFRange::invalidate()’, inlined from ‘void ngen::RegisterAllocator::safeRelease(RD&) [with RD = ngen::GRFRange]’ at /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/../ngen/ngen_register_allocator.hpp:159:61, inlined from ‘bool dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::gemmAccessC(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::XeHPG]’ at /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:17957:33: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/ngen/ngen_core.hpp:1373:34: warning: writing 1 byte into a region of size 0 [-Wstringop-overflow=] 1373 | void invalidate() { len = invalidLen; } | ~~~~^~~~~~~~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp: In member function ‘bool dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::gemmAccessC(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::XeHPG]’: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:17899:18: note: at offset 5 into destination object ‘C_addr0’ of size 4 17899 | GRFRange C_addr0[2], C_addr0Unmasked[2]; | ^~~~~~~ In member function ‘void ngen::GRFRange::invalidate()’, inlined from ‘void ngen::RegisterAllocator::safeRelease(RD&) [with RD = ngen::GRFRange]’ at /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/../ngen/ngen_register_allocator.hpp:159:61, inlined from ‘bool dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::gemmAccessC(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::XeHPC]’ at /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:17957:33: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/ngen/ngen_core.hpp:1373:34: warning: writing 1 byte into a region of size 0 [-Wstringop-overflow=] 1373 | void invalidate() { len = invalidLen; } | ~~~~^~~~~~~~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp: In member function ‘bool dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::gemmAccessC(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::XeHPC]’: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:17899:18: note: at offset 5 into destination object ‘C_addr0’ of size 4 17899 | GRFRange C_addr0[2], C_addr0Unmasked[2]; | ^~~~~~~ In member function ‘void ngen::GRFRange::invalidate()’, inlined from ‘void ngen::RegisterAllocator::safeRelease(RD&) [with RD = ngen::GRFRange]’ at /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/../ngen/ngen_register_allocator.hpp:159:61, inlined from ‘bool dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::gemmAccessC(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::Xe2]’ at /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:17957:33: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/ngen/ngen_core.hpp:1373:34: warning: writing 1 byte into a region of size 0 [-Wstringop-overflow=] 1373 | void invalidate() { len = invalidLen; } | ~~~~^~~~~~~~~~~~ /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp: In member function ‘bool dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::gemmAccessC(COperation, const dnnl::impl::gpu::intel::jit::GEMMProblem&, const dnnl::impl::gpu::intel::jit::GEMMStrategy&, dnnl::impl::gpu::intel::jit::GEMMState&) [with ngen::Core hw = ngen::Core::Xe2]’: /home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp:17899:18: note: at offset 5 into destination object ‘C_addr0’ of size 4 17899 | GRFRange C_addr0[2], C_addr0Unmasked[2]; | ^~~~~~~ [780/780] Linking CXX static library src/libdnnl.a ninja: build stopped: subcommand failed. Building wheel torch-2.5.0a0+gita8d6afb -- Building version 2.5.0a0+gita8d6afb cmake -GNinja -DBUILD_PYTHON=True -DBUILD_TEST=True -DCMAKE_BUILD_TYPE=Release -DCMAKE_INSTALL_PREFIX=/home/ash/build/pytorch/torch -DCMAKE_PREFIX_PATH=/home/ash/venv/lila/lib/python3.12/site-packages;'/home/ash/venv/lila/bin/../':/home/ash/intel/oneapi/pti/0.10/lib/cmake/pti:/home/ash/intel/oneapi/mkl/2025.0/lib/cmake:/home/ash/intel/oneapi/compiler/2025.0:/home/ash/intel/oneapi/tbb/2022.0/env/..:/home/ash/intel/oneapi/pti/0.10/lib/cmake/pti:/home/ash/intel/oneapi/mkl/2025.0/lib/cmake:/home/ash/intel/oneapi/ipp/2022.0/lib/cmake/ipp:/home/ash/intel/oneapi/dpl/2022.7/lib/cmake/oneDPL:/home/ash/intel/oneapi/dnnl/2025.0/lib/cmake:/home/ash/intel/oneapi/dal/2025.0:/home/ash/intel/oneapi/compiler/2025.0:/home/ash/intel/oneapi/tbb/2022.0/env/..:/home/ash/intel/oneapi/pti/0.10/lib/cmake/pti:/home/ash/intel/oneapi/mkl/2025.0/lib/cmake:/home/ash/intel/oneapi/ipp/2022.0/lib/cmake/ipp:/home/ash/intel/oneapi/dpl/2022.7/lib/cmake/oneDPL:/home/ash/intel/oneapi/dnnl/2025.0/lib/cmake:/home/ash/intel/oneapi/dal/2025.0:/home/ash/intel/oneapi/compiler/2025.0 -DGLIBCXX_USE_CXX11_ABI=1 -DPython_EXECUTABLE=/home/ash/venv/lila/bin/python -DTORCH_BUILD_VERSION=2.5.0a0+gita8d6afb -DUSE_CUDA=0 -DUSE_MPI=0 -DUSE_NUMA=0 -DUSE_NUMPY=True -DUSE_STATIC_MKL=1 /home/ash/build/pytorch

I have no idea what's gone wrong here. Any help would be much appreciated.

Ash

@jingxu10
Copy link
Contributor

Hi,

  1. Could you share more info with this script?
  2. We don't see the real error message. Could you upload the full compilation log?
  3. Which IPEX branch did you compile against? Please try 2.5.10.

@alexsin368
Copy link
Contributor

@staropram Please follow the instructions from @jingxu10. The collect_env.py script tells us what versions of all libraries you have installed. The full compilation log tells us what went wrong during compilation.

In general, these are the steps to install IPEX, where the link you provided is correct. For Arc A370M, you need to compile from source since there isn't a pre-built binary for it. https://pytorch-extension.intel.com/installation?platform=gpu&version=v2.5.10%2Bxpu&os=linux%2Fwsl2&package=source

Just to be 100% certain, the compile script for IPEX v2.5.10+xpu is here: https://github.com/intel/intel-extension-for-pytorch/blob/v2.5.10%2Bxpu/scripts/compile_bundle.sh

@staropram
Copy link
Author

Thank you both for the help, here's the collect_env.py output:

collect_env.log

@jingxu10 I was using "2.5.10" above.

I will build again from scratch using the exact compile bundle linked above from a conda env instead (just in case) and upload the build log.

@staropram
Copy link
Author

OK, I have tried again using the exact compile_bundle.sh link above, a brand new conda env.

Here is

collect_env.log

and

build.log

Any help you could give me would be appreciated. Thanks.

@staropram
Copy link
Author

This might be a red herring, I asked chatgpt to analyse the log file and it thinks this is the error:

/home/ash/build/pytorch/third_party/ideep/mkl-dnn/src/gpu/intel/jit/gemm/gen_gemm_kernel_generator.cpp: In member function ‘bool dnnl::impl::gpu::intel::jit::gemm_kernel_generator_t::gemmAccessC(...)’:
...
note: at offset 5 into destination object ‘C_addr0’ of size 4
17899 | GRFRange C_addr0[2], C_addr0Unmasked[2];

Although ninja doesn't actually say anything until the linking step:

[780/780] Linking CXX static library src/libdnnl.a
ninja: build stopped: subcommand failed.

Is chatgpt sending us down a rabbit hole here or is it onto something?

Otherwise, does the script have an easy way to produce more verbose output?

I'll think about that while I await your observations of the above log and chatgpt's idea.

Ash

@alexsin368
Copy link
Contributor

alexsin368 commented Mar 5, 2025

@staropram from your collect_env.log, your GCC version is 14.2.0. Can you try with GCC 11?

@staropram
Copy link
Author

staropram commented Mar 9, 2025

I just get other errors, before we go down that route, can I please ask are you using icpx for host and device compilation?

Or are you using gcc, which version? Clang?

@alexsin368
Copy link
Contributor

Just GCC 11 is sufficient. If you're still getting errors with that, please share with us the full compilation log.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants