Add support for Idefics 3 #2503

stelterlab · 2024-09-07T12:27:41Z

Model description

Please add support for HuggingFaceM4/Idefics3-8B-Llama3 in tgi:

Idefics3 is an open multimodal model that accepts arbitrary sequences of image and text inputs and produces text outputs. The model can answer questions about images, describe visual content, create stories grounded on multiple images, or simply behave as a pure language model without visual inputs.

Open source status

The model implementation is available
The model weights are available

Provide useful links for the implementation

Well, the necessary changes for the transformers library are just waiting for a review for the PR:

huggingface/transformers#32473

as the time of writing this model request.

As model/finetune and transformers lib is made by the same famous company I would assume there should be no big problems. ;-)

ErikKaum · 2024-09-09T09:12:50Z

Hi @stelterlab 👋

We have a PR in the making, no big problems indeed ;) but we are a bit constrained on bandwidth at the moment, so it's not moving as fast as we'd like

efenocchi · 2024-09-09T10:51:45Z

Hi @ErikKaum I noticed a bug, could you check my last comment in the PR?

ErikKaum · 2024-09-09T11:22:43Z

Hi @efenocchi I'm unfortunately not super well versed in the transformers library. I'd consider reaching out to the people in the conversation you have in the repo 👍

ErikKaum added the new model Request for integration of new model label Sep 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Idefics 3 #2503

Add support for Idefics 3 #2503

stelterlab commented Sep 7, 2024

ErikKaum commented Sep 9, 2024

efenocchi commented Sep 9, 2024

ErikKaum commented Sep 9, 2024

Add support for Idefics 3 #2503

Add support for Idefics 3 #2503

Comments

stelterlab commented Sep 7, 2024

Model description

Open source status

Provide useful links for the implementation

ErikKaum commented Sep 9, 2024

efenocchi commented Sep 9, 2024

ErikKaum commented Sep 9, 2024