Added OLMo support to builder.py #1061

shobrienDMA · 2024-11-12T14:38:53Z

No description provided.

shobrienDMA · 2024-11-25T09:32:13Z

@microsoft-github-policy-service agree company="AMD"

BowenBao · 2024-12-10T17:16:38Z

@kunal-vaishnavi ptal, thanks!

kunal-vaishnavi · 2024-12-10T22:06:49Z

Thanks for the contribution! Does OLMo run end-to-end with the ONNX Runtime GenAI tokenizer?

Can you also update the following places?

Add OLMo to the repo README and the model builder README to show that OLMo is now supported
Add OLMo to the CIs

onnxruntime-genai/test/python/_test_utils.py

Lines 55 to 77 in 0f59a90

    
           def get_model_paths(): 
        
               hf_paths = { 
        
                   "phi-2": "microsoft/phi-2", 
        
                   # "phi-3-mini": "microsoft/Phi-3-mini-128k-instruct", 
        
               } 
        
               ci_data_path = os.path.join("/", "data", "ortgenai_pytorch_models") 
        
               if not os.path.exists(ci_data_path): 
        
                   return {}, hf_paths 
        
               # Note: If a model has over 4B parameters, please add a quantized version 
        
               # to `ci_paths` instead of `hf_paths` to reduce file size and testing time. 
        
               ci_paths = { 
        
                   "llama-2": os.path.join(ci_data_path, "Llama-2-7B-Chat-GPTQ"), 
        
                   "llama-3": os.path.join(ci_data_path, "Meta-Llama-3-8B-AWQ"), 
        
                   "mistral-v0.2": os.path.join(ci_data_path, "Mistral-7B-Instruct-v0.2-GPTQ"), 
        
                   # "phi-2": os.path.join(ci_data_path, "phi2"), 
        
                   # "gemma-2b": os.path.join(ci_data_path, "gemma-1.1-2b-it"), 
        
                   "gemma-7b": os.path.join(ci_data_path, "gemma-7b-it-awq"), 
        
                   # "phi-3-mini": os.path.join(ci_data_path, "phi3-mini-128k-instruct"), 
        
               } 
        
               return ci_paths, hf_paths

The models in hf_paths are downloaded from Hugging Face, and the models in ci_paths are currently uploaded to /data/ortgenai_pytorch_models in the Linux CUDA CI VM.

onnxruntime-genai/.github/workflows/linux-gpu-x64-build.yml

Line 122 in 0f59a90

--volume /data/ortgenai_pytorch_models:/data/ortgenai_pytorch_models \

You can add it to hf_paths for now. If you can also add Qwen to the CIs, that would be helpful.

shobrienDMA · 2024-12-12T13:45:15Z

That is be updated as requested now. It runs end to end and I've also added Qwen to the CI list.

kunal-vaishnavi · 2024-12-17T05:35:25Z

Thank you for adding the changes. The end-to-end tests in the CIs appear to be failing due to the transformers version. Can you pin it to v4.44.2?

onnxruntime-genai/test/python/requirements.txt

Line 9 in 9055e68

transformers

shobrienDMA · 2024-12-17T16:58:44Z

This should be good to go!

kunal-vaishnavi · 2024-12-18T19:42:48Z

After some further investigation, it appears that the tokenizer CI failure is happening because the tokenizer for OLMo is not currently supported in ONNX Runtime Extensions. Once the support is added, the main branch of ONNX Runtime GenAI can be merged into this PR to integrate the changes.

kunal-vaishnavi · 2025-01-08T22:45:50Z

The ONNX Runtime Extensions PR has been merged now. You can update its commit ID in deps.txt to pull in those changes.

onnxruntime-genai/cmake/deps.txt

Line 17 in 41c2543

    
           onnxruntime_extensions;https://github.com/microsoft/onnxruntime-extensions.git;4e10ee046a2f035351f3fe88740bd8215a18fdb9

shobrienDMA · 2025-01-13T13:23:19Z

Hi @kunal-vaishnavi thanks for letting me know, I've bumped that dependency.

src/models/model.cpp

src/python/py/models/builder.py

…y fake layernorm Co-authored-by: Tim Costigan <[email protected]> Co-authored-by: Tim Costigan <[email protected]>"

…yerNorm process

… and set then in our override Co-authored-by: Tim Costigan <[email protected]> Co-authored-by: Tim Costigan <[email protected]>

…exist, which caused errors in model_qa.py

shobrienDMA marked this pull request as ready for review November 25, 2024 10:16

kunal-vaishnavi mentioned this pull request Dec 10, 2024

adding OLMo to the list of Decoder Only Models #1060

Closed

kunal-vaishnavi reviewed Jan 13, 2025

View reviewed changes

src/models/model.cpp Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Jan 13, 2025

View reviewed changes

src/python/py/models/builder.py Outdated Show resolved Hide resolved

kunal-vaishnavi added the 0.6.0 label Jan 13, 2025

shobrienDMA force-pushed the shobrien/add-olmo-builder-support branch from 8257b5b to 0829f0a Compare January 16, 2025 11:28

shobrienDMA and others added 7 commits January 17, 2025 09:57

adding OLMO to the list of Decoder Only Models

4a3767c

Added OLMoModel Class and config.architecture detection, and temporar…

b472b78

…y fake layernorm Co-authored-by: Tim Costigan <[email protected]> Co-authored-by: Tim Costigan <[email protected]>"

Comment out our hack, modify the OLMo class to attempt to skip the La…

dff41a2

…yerNorm process

add olmo builder support

ec3a40f

Pulled the layernorm.weight and layernorm.bias values from the config…

4bf139b

… and set then in our override Co-authored-by: Tim Costigan <[email protected]> Co-authored-by: Tim Costigan <[email protected]>

fix new issue where bos_token_id was always set to None if it didn't …

d0495f2

…exist, which caused errors in model_qa.py

Update readmes and add OLMo and Qwen to the CI tests_utils

5b3e7f1

shobrienDMA force-pushed the shobrien/add-olmo-builder-support branch from 0829f0a to 5b3e7f1 Compare January 17, 2025 09:59

kunal-vaishnavi approved these changes Jan 18, 2025

View reviewed changes

kunal-vaishnavi merged commit 471e715 into microsoft:main Jan 18, 2025
12 of 14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added OLMo support to builder.py #1061

Added OLMo support to builder.py #1061

shobrienDMA commented Nov 12, 2024

shobrienDMA commented Nov 25, 2024

BowenBao commented Dec 10, 2024

kunal-vaishnavi commented Dec 10, 2024

shobrienDMA commented Dec 12, 2024 •

edited

Loading

kunal-vaishnavi commented Dec 17, 2024

shobrienDMA commented Dec 17, 2024

kunal-vaishnavi commented Dec 18, 2024

kunal-vaishnavi commented Jan 8, 2025

shobrienDMA commented Jan 13, 2025

Added OLMo support to builder.py #1061

Added OLMo support to builder.py #1061

Conversation

shobrienDMA commented Nov 12, 2024

shobrienDMA commented Nov 25, 2024

BowenBao commented Dec 10, 2024

kunal-vaishnavi commented Dec 10, 2024

shobrienDMA commented Dec 12, 2024 • edited Loading

kunal-vaishnavi commented Dec 17, 2024

shobrienDMA commented Dec 17, 2024

kunal-vaishnavi commented Dec 18, 2024

kunal-vaishnavi commented Jan 8, 2025

shobrienDMA commented Jan 13, 2025

shobrienDMA commented Dec 12, 2024 •

edited

Loading