Pydantic v2 Migration #54

movchan74 · 2024-02-20T16:17:02Z

Summary:
This PR migrates the codebase to Pydantic >= 2.0.

Key Changes:

Updated Pydantic to >=2.0
Updated Ray to >=2.9
Updated vLLVM to >= 0.3.0
Updated Pytorch to 2.1.2
Updated CUDA to 12.1
Updated the codebase to use the new Pydantic API
Increased the required memory for the LLaMa 2 7b Chat from 10GB to 13GB because new version of vLLM consumes more memory. There is an open issue in vLLM about this. Note that now vLLM requires 13GB of memory to run so if gets allocated on the GPU with less than 13GB of memory, it will fail. And because Ray still does not support GPU memory aware scheduling, we cannot enforce this requirement in the code. There is an open PR in Ray with the proposal to add GPU memory aware scheduling.

Related Issues: #49, #35

HRashidi

LGTM

aana/api/request_handler.py

evanderiel

Overall good. Just a couple of general things:

Why are we using typing_extensions instead of typing? The canonical module is typing; typing_extensions is a non-standard package. If we do want to use typing_extensions, then we need to add it to the dependencies in pyproject.toml
In the future this could maybe be split into a couple of different PRs - one for updating dependencies and fixing breaking changes, one for changing the config from MappingProxyType to ConfigDict, etc, and maybe a separate PR just for updating the test cache.

aana/configs/db.py

movchan74 · 2024-02-23T16:15:10Z

In the future this could maybe be split into a couple of different PRs - one for updating dependencies and fixing breaking changes, one for changing the config from MappingProxyType to ConfigDict, etc, and maybe a separate PR just for updating the test cache.

It's all part of the same upgrade, doesn't make sense for me to split it.
"changing the config from MappingProxyType to ConfigDict" was part of the pydantic migration.
"updating the test cache" was also part of the migration. The cache was using old pydantic models and the tests were failing because of it.

movchan74 · 2024-02-26T16:22:47Z

@evanderiel @HRashidi I've updated the submodule for Mobius Pipeline and merged the main branch. Please, take another look.

HRashidi

👍🏽

movchan74 added 4 commits February 16, 2024 09:33

Migration to Pydantic v2

d4dd46d

Updated cache

b74a863

Ignore ImportError for test cache

4a83de4

Update mobius-pipeline subproject commit

f68c19c

movchan74 requested review from HRashidi and evanderiel February 20, 2024 16:17

movchan74 self-assigned this Feb 20, 2024

Add chat_template and enforce_eager options to docsting of VLLMConfig

ea52411

HRashidi approved these changes Feb 21, 2024

View reviewed changes

aana/api/request_handler.py Outdated Show resolved Hide resolved

evanderiel approved these changes Feb 23, 2024

View reviewed changes

aana/configs/db.py Show resolved Hide resolved

movchan74 added 2 commits February 26, 2024 15:58

Updated Mobius Pipeline submodule commit

cddfd71

Merge branch 'main' into pydantic_v2_migration

55ce3bc

movchan74 requested review from HRashidi and evanderiel February 26, 2024 16:21

HRashidi approved these changes Feb 26, 2024

View reviewed changes

movchan74 merged commit 6ac2c78 into main Feb 26, 2024
2 checks passed

movchan74 deleted the pydantic_v2_migration branch February 26, 2024 16:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pydantic v2 Migration #54

Pydantic v2 Migration #54

movchan74 commented Feb 20, 2024

HRashidi left a comment

evanderiel left a comment

movchan74 commented Feb 23, 2024

movchan74 commented Feb 26, 2024

HRashidi left a comment

Pydantic v2 Migration #54

Pydantic v2 Migration #54

Conversation

movchan74 commented Feb 20, 2024

HRashidi left a comment

Choose a reason for hiding this comment

evanderiel left a comment

Choose a reason for hiding this comment

movchan74 commented Feb 23, 2024

movchan74 commented Feb 26, 2024

HRashidi left a comment

Choose a reason for hiding this comment