Bump transformers from 4.36.0 to 4.41.0 #218

dependabot · 2024-05-20T16:06:07Z

Bumps transformers from 4.36.0 to 4.41.0.

Release notes

v4.41.0: Phi3, JetMoE, PaliGemma, VideoLlava, Falcon2, FalconVLM & GGUF support

New models

Phi3

The Phi-3 model was proposed in Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone by Microsoft.

TLDR; Phi-3 introduces new ROPE scaling methods, which seems to scale fairly well! A 3b and a Phi-3-mini is available in two context-length variants—4K and 128K tokens. It is the first model in its class to support a context window of up to 128K tokens, with little impact on quality.

Phi-3 by @gugarosa in huggingface/transformers#30423

JetMoE

JetMoe-8B is an 8B Mixture-of-Experts (MoE) language model developed by Yikang Shen and MyShell. JetMoe project aims to provide a LLaMA2-level performance and efficient language model with a limited budget. To achieve this goal, JetMoe uses a sparsely activated architecture inspired by the ModuleFormer. Each JetMoe block consists of two MoE layers: Mixture of Attention Heads and Mixture of MLP Experts. Given the input tokens, it activates a subset of its experts to process them. This sparse activation schema enables JetMoe to achieve much better training throughput than similar size dense models. The training throughput of JetMoe-8B is around 100B tokens per day on a cluster of 96 H100 GPUs with a straightforward 3-way pipeline parallelism strategy.

Add JetMoE model by @yikangshen in huggingface/transformers#30005

PaliGemma

PaliGemma is a lightweight open vision-language model (VLM) inspired by PaLI-3, and based on open components like the SigLIP vision model and the Gemma language model. PaliGemma takes both images and text as inputs and can answer questions about images with detail and context, meaning that PaliGemma can perform deeper analysis of images and provide useful insights, such as captioning for images and short videos, object detection, and reading text embedded within images.

More than 120 checkpoints are released see the collection here !

Add PaliGemma by @molbap in huggingface/transformers#30814

VideoLlava

Video-LLaVA exhibits remarkable interactive capabilities between images and videos, despite the absence of image-video pairs in the dataset.

💡 Simple baseline, learning united visual representation by alignment before projection With the binding of unified visual representations to the language feature space, we enable an LLM to perform visual reasoning capabilities on both images and videos simultaneously. 🔥 High performance, complementary learning with video and image Extensive experiments demonstrate the complementarity of modalities, showcasing significant superiority when compared to models specifically designed for either images or videos.

Add Video Llava by @zucchini-nlp in huggingface/transformers#29733

Falcon 2 and FalconVLM:

... (truncated)

Commits

4c6c45b Release: v4.41.0
e9a8041 update release script (#30880)
0a9300f Support arbitrary processor (#30875)
57edd84 [whisper] fix multilingual fine-tuning (#30865)
977ce58 Fix dependencies for image classification example (#30842)
3802e78 Enable device map (#30870)
57c965a Remove deprecated logic and warnings (#30743)
3d7d3a8 TEST: Add llama logits tests (#30835)
15c74a2 Fix VideoLlava imports (#30867)
4e17e7d TST / Quantization: Reverting to torch==2.2.1 (#30866)
Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR
@dependabot recreate will recreate this PR, overwriting any edits that have been made to it
@dependabot merge will merge this PR after your CI passes on it
@dependabot squash and merge will squash and merge this PR after your CI passes on it
@dependabot cancel merge will cancel a previously requested merge and block automerging
@dependabot reopen will reopen this PR if it is closed
@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
@dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [transformers](https://github.com/huggingface/transformers) from 4.36.0 to 4.41.0. - [Release notes](https://github.com/huggingface/transformers/releases) - [Commits](huggingface/transformers@v4.36.0...v4.41.0) --- updated-dependencies: - dependency-name: transformers dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]>

Bump transformers from 4.36.0 to 4.41.0

dependabot bot added dependencies Pull requests that update a dependency file python Pull requests that update Python code labels May 20, 2024

dependabot bot mentioned this pull request May 20, 2024

Bump transformers from 4.36.0 to 4.40.2 #214

Closed

dependabot bot had a problem deploying to production May 20, 2024 16:06 Failure

kyegomez merged commit b2b4d0d into master May 21, 2024
23 of 44 checks passed

dependabot bot deleted the dependabot/pip/transformers-4.41.0 branch May 21, 2024 00:21

kyegomez added a commit that referenced this pull request Sep 3, 2024

Merge pull request #218 from kyegomez/dependabot/pip/transformers-4.41.0

63374d8

Bump transformers from 4.36.0 to 4.41.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump transformers from 4.36.0 to 4.41.0 #218

Bump transformers from 4.36.0 to 4.41.0 #218

dependabot bot commented on behalf of github May 20, 2024

Bump transformers from 4.36.0 to 4.41.0 #218

Bump transformers from 4.36.0 to 4.41.0 #218

Conversation

dependabot bot commented on behalf of github May 20, 2024

v4.41.0: Phi3, JetMoE, PaliGemma, VideoLlava, Falcon2, FalconVLM & GGUF support

New models

Phi3

JetMoE

PaliGemma

VideoLlava

Falcon 2 and FalconVLM: