Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable vortex style fp8 as an option in evo2 #12464

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

jstjohn
Copy link
Collaborator

@jstjohn jstjohn commented Mar 3, 2025

This particular form of FP8 is needed to replicate a number of results in the Evo2 paper.

PR Type:

  • New Feature
  • Bugfix
  • Documentation

Without it we get random AUCs, with it we can recover the results reported in the arc notebooks.

Signed-off-by: John St John <[email protected]>
@jstjohn jstjohn force-pushed the jstjohn/evo2-enable-vortex-inference branch from 2da79b9 to 5afbbad Compare March 3, 2025 23:53
class _te:
"""If this dummy module is accessed, a not imported error will be raised."""

def __getattribute__(self, name: str) -> None:

Check notice

Code scanning / CodeQL

Non-standard exception raised in special method Note

Function always raises
builtin-class ImportError
; raise AttributeError instead

Copilot Autofix AI about 8 hours ago

To fix the problem, we need to modify the __getattribute__ method to raise an AttributeError instead of an ImportError. This change will ensure that the method conforms to the standard protocol for attribute access in Python. The rest of the functionality should remain the same, and the warning message about the missing transformer_engine module will still be logged.

Suggested changeset 1
nemo/collections/llm/gpt/model/megatron/hyena/hyena_mixer.py

Autofix patch

Autofix patch
Run the following command in your local git repository to apply this patch
cat << 'EOF' | git apply
diff --git a/nemo/collections/llm/gpt/model/megatron/hyena/hyena_mixer.py b/nemo/collections/llm/gpt/model/megatron/hyena/hyena_mixer.py
--- a/nemo/collections/llm/gpt/model/megatron/hyena/hyena_mixer.py
+++ b/nemo/collections/llm/gpt/model/megatron/hyena/hyena_mixer.py
@@ -62,3 +62,3 @@
             """Not imported: te. An error will be raised if this is called like a module."""
-            raise ImportError("transformer_engine not installed. Using default recipe.")
+            raise AttributeError(f"'_te' object has no attribute '{name}'")
 
EOF
@@ -62,3 +62,3 @@
"""Not imported: te. An error will be raised if this is called like a module."""
raise ImportError("transformer_engine not installed. Using default recipe.")
raise AttributeError(f"'_te' object has no attribute '{name}'")

Copilot is powered by AI and may make mistakes. Always verify output.
Positive Feedback
Negative Feedback

Provide additional feedback

Please help us improve GitHub Copilot by sharing more details about this comment.

Please select one or more of the options
@jstjohn jstjohn enabled auto-merge (squash) March 4, 2025 00:07
@jstjohn jstjohn mentioned this pull request Mar 4, 2025
9 tasks
@jstjohn jstjohn added feature request/PR for a new feature Run CICD and removed Run CICD labels Mar 4, 2025
@ko3n1g ko3n1g added Run CICD and removed Run CICD labels Mar 4, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request/PR for a new feature Run CICD
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants