Anybody running an Intel arc 750 or 770? #1923

RahulVivekNair · 2023-06-18T08:38:02Z

RahulVivekNair
Jun 18, 2023

If anyone is running these cards for their vram capacity , what is your experience like? How many iterations/ms are you getting through opencl offloading? Does it work with UI's like oobabooga and is it worth getting one?

KerfuffleV2 · 2023-06-18T14:02:31Z

KerfuffleV2
Jun 18, 2023
Collaborator

This is gaming oriented, but it seems like it's about half the speed of a 3090: https://gpu.userbenchmark.com/Compare/Nvidia-RTX-3090-vs-Intel-Arc-A770/4081vsm1850973

It also has 33% less memory. However, it costs about half as much. Assuming the OpenCL performance is in line with the gaming performance, it could possibly make sense to get two of them and use stuff like GGML GPU splitting feature. However, the cards have 250 watt TDP so that's a huge amount of power.

The cheapest one I found was $339, used 3090s are $700-800.

I didn't find anything much about OpenCL usage, but it seems like the card is pretty well supported on Linux.

2 replies

Green-Sky Jun 18, 2023
Collaborator

looks like Intel MKL (BLAS) is supported. So at least prompt ingestion should be good.

JohannesGaessler Jun 18, 2023
Collaborator

get two of them and use stuff like GGML GPU splitting feature.

As far as I know OpenCL currently does not support multi GPU.

ianscrivener · 2023-06-19T07:02:39Z

ianscrivener
Jun 19, 2023

I checked out the the specs on these recently. Yes, price for performance is very good. Though am I right to assume they are using 50-100% more electricity than an equaivalent nvidia card? I'm SUPER impressed with the performance versus energy of my MacOS M2. The Amazon Inferentia etc ARM chips are also super energy efficient.

0 replies

taka-tuos · 2023-07-19T08:16:48Z

taka-tuos
Jul 19, 2023

I have an A380 (ASRock Challenger) and tried llama.cpp. 7B (vicuna-1.0, Q4_0) produced a terrible result of 240ms/tok. (ArchLinux, E5-2670 v3 with DDR4-2133 32GB)
P104-100 hits 30ms/tok in the same environment. (Slightly unfair comparison: A380 cost 22000 Yen brand new, while P104-100 costs only 3000 Yen because I co-bought a mining rig and took it apart lol)

10 replies

taka-tuos Aug 18, 2023

Thanks for the useful information, I'll try tuning CLBlast on my A380 this weekend.

DarwinAnim8or Aug 4, 2024

@taka-tuos did you ever re-try running llama on the a380? curious if it's doable

taka-tuos Aug 4, 2024

I'll be trying out SYCL in the next few weeks.
As far as I know, the only 75W, 6GB (that can run 8b Q3_K_M) is the A380, so if it works, I think it's a pretty attractive option.

DarwinAnim8or Aug 16, 2024

I'll be trying out SYCL in the next few weeks. As far as I know, the only 75W, 6GB (that can run 8b Q3_K_M) is the A380, so if it works, I think it's a pretty attractive option.

I have an A380 now, and tried to run SYCL on Ubuntu but it spat an error at the end :(

DarwinAnim8or Aug 19, 2024

Update-- seems that it's largely dependent on what model you're loading and at what Quant, Q4_K_M works, but Q5_K_S didn't play nice, for example.

tarunmcom · 2023-08-12T11:08:20Z

tarunmcom
Aug 12, 2023

Run LLama-2 13B, very fast, Locally on Low Cost Intel's ARC GPU , iGPU and on CPU:- https://youtu.be/FRWy7rzOsRs

6 replies

delock Oct 17, 2023

Hi @tarunmcom from your video I saw you are using A770M and the speed for 13B is quite decent.. I have tuned for A770M in CLBlast but the result runs extermly slow. Also when I try to copy A770 tuning result, the speed to inference llama2 7b model with q5_M is not very high (around 5 tokens/s), which is even slower than using 6 Intel 12gen CPU P cores.

Are you using A770 tunning result in CLBlast or do you tune for A770M by yourself? would like to know whether the tunning result can be shared.

NeoZhangJianyu Aug 19, 2024
Collaborator

@delock
You could try with llama.cpp for SYCL backend, refer to https://github.com/ggerganov/llama.cpp/blob/master/docs/backend/SYCL.md
It's faster than CLBlast on Intel GPU.

HumerousGorgon Sep 13, 2024

Using the sycl backend, whenever it tries to allocate >4GB of VRAM to my A770, the model fails loading. Trying to load anything higher than Q4 means a failure, even though I have the available VRAM for it. Any idea on how to solve this?

NeoZhangJianyu Sep 13, 2024
Collaborator

Please share the whole log, including the cmd line.
Please report an issue, title with [SYCL].
I will monitor the issue in general.

HumerousGorgon Sep 13, 2024

@NeoZhangJianyu
Just posted my error with relevant logs. thank you.

tarunmcom · 2024-08-19T03:42:47Z

tarunmcom
Aug 19, 2024

here is video how to run using sycl its father than CLBLAST :- https://www.youtube.com/watch?v=Q7t4CmziaqA

…

On Mon, 19 Aug 2024 at 07:25, Neo Zhang Jianyu ***@***.***> wrote: @delock <https://github.com/delock> You could try with llama.cpp for SYCL backend, refer to https://github.com/ggerganov/llama.cpp/blob/master/docs/backend/SYCL.md It's faster than CLBlast on Intel GPU. — Reply to this email directly, view it on GitHub <#1923 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFKFRVK4HWROL7SE6OXRLQLZSFGAXAVCNFSM6AAAAABL64FVKCVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTAMZXG4ZDEOA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anybody running an Intel arc 750 or 770? #1923

{{title}}

Replies: 5 comments 18 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Anybody running an Intel arc 750 or 770? #1923

Replies: 5 comments · 18 replies

KerfuffleV2 Jun 18, 2023 Collaborator

Green-Sky Jun 18, 2023 Collaborator

JohannesGaessler Jun 18, 2023 Collaborator

NeoZhangJianyu Aug 19, 2024 Collaborator

NeoZhangJianyu Sep 13, 2024 Collaborator

Replies: 5 comments 18 replies

KerfuffleV2
Jun 18, 2023
Collaborator

Green-Sky Jun 18, 2023
Collaborator

JohannesGaessler Jun 18, 2023
Collaborator

NeoZhangJianyu Aug 19, 2024
Collaborator

NeoZhangJianyu Sep 13, 2024
Collaborator