Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adopt new Llama 3.1 HF names #1357

Merged
merged 6 commits into from
Feb 4, 2025
Merged

Adopt new Llama 3.1 HF names #1357

merged 6 commits into from
Feb 4, 2025

Conversation

wizeng23
Copy link
Contributor

@wizeng23 wizeng23 commented Feb 4, 2025

Description

Llama 3.1 8B Instruct used to be named meta-llama/Meta-Llama-3.1-8B-Instruct, but it's now been renamed to meta-llama/Llama-3.1-8B-Instruct, which is how the models have been named in Llama 3.2 and after. The page for the former now redirects to the latter: https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct.

We make the following renames:

  • meta-llama/Meta-Llama-3.1-8B -> meta-llama/Llama-3.1-8B
  • meta-llama/Meta-Llama-3.1-8B-Instruct -> meta-llama/Llama-3.1-8B-Instruct
  • meta-llama/Meta-Llama-3.1-70B-Instruct -> meta-llama/Llama-3.1-70B-Instruct
  • meta-llama/Meta-Llama-3.1-405B-Instruct -> meta-llama/Llama-3.1-405B-Instruct

I also updated references to Llama 2 or 3 to instead use the latest versions.
Tested one of the configs to verify it still works

Before submitting

  • This PR only changes documentation. (You can ignore the following checks in that case)
  • Did you read the contributor guideline Pull Request guidelines?
  • Did you link the issue(s) related to this PR in the section above?
  • Did you add / update tests where needed?

@@ -58,32 +58,32 @@ Configure the model architecture and loading using the {py:obj}`~oumi.core.confi
```yaml
model:
# Required
model_name: "meta-llama/Llama-2-7b-hf" # Model ID or path (REQUIRED)
model_name: "meta-llama/Llama-3.3-70B-Instruct" # Model ID or path (REQUIRED)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are you intentionally changing from 7b -> 70b?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nope, that's my bad. Thanks for the catch!

@wizeng23 wizeng23 merged commit 82617b4 into main Feb 4, 2025
3 checks passed
@wizeng23 wizeng23 deleted the wizeng/llama branch February 4, 2025 02:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants