Adopt new Llama 3.1 HF names #1357

wizeng23 · 2025-02-04T01:46:28Z

Description

Llama 3.1 8B Instruct used to be named meta-llama/Meta-Llama-3.1-8B-Instruct, but it's now been renamed to meta-llama/Llama-3.1-8B-Instruct, which is how the models have been named in Llama 3.2 and after. The page for the former now redirects to the latter: https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct.

We make the following renames:

meta-llama/Meta-Llama-3.1-8B -> meta-llama/Llama-3.1-8B
meta-llama/Meta-Llama-3.1-8B-Instruct -> meta-llama/Llama-3.1-8B-Instruct
meta-llama/Meta-Llama-3.1-70B-Instruct -> meta-llama/Llama-3.1-70B-Instruct
meta-llama/Meta-Llama-3.1-405B-Instruct -> meta-llama/Llama-3.1-405B-Instruct

I also updated references to Llama 2 or 3 to instead use the latest versions.
Tested one of the configs to verify it still works

Before submitting

This PR only changes documentation. (You can ignore the following checks in that case)
Did you read the contributor guideline Pull Request guidelines?
Did you link the issue(s) related to this PR in the section above?
Did you add / update tests where needed?

taenin · 2025-02-04T01:52:12Z

docs/user_guides/train/configuration.md

@@ -58,32 +58,32 @@ Configure the model architecture and loading using the {py:obj}`~oumi.core.confi
 ```yaml
 model:
  # Required
-  model_name: "meta-llama/Llama-2-7b-hf"    # Model ID or path (REQUIRED)
+  model_name: "meta-llama/Llama-3.3-70B-Instruct"    # Model ID or path (REQUIRED)


Are you intentionally changing from 7b -> 70b?

Nope, that's my bad. Thanks for the catch!

docs/user_guides/infer/configuration.md

tests/unit/utils/test_torch_naming_heuristics.py

wizeng23 added 3 commits February 3, 2025 17:21

Update HF model name Meta-LLama to Llama

23ded04

a

9434e68

a

0f3bb2f

wizeng23 requested review from optas, oelachqar, taenin, jgreer013 and nikg4 February 4, 2025 01:46

taenin reviewed Feb 4, 2025

View reviewed changes

docs/user_guides/infer/configuration.md Outdated Show resolved Hide resolved

oelachqar approved these changes Feb 4, 2025

View reviewed changes

wizeng23 added 3 commits February 3, 2025 18:01

a

fa1da3f

a

747c36f

merge main

2867406

taenin reviewed Feb 4, 2025

View reviewed changes

tests/unit/utils/test_torch_naming_heuristics.py Show resolved Hide resolved

taenin approved these changes Feb 4, 2025

View reviewed changes

wizeng23 merged commit 82617b4 into main Feb 4, 2025
3 checks passed

wizeng23 deleted the wizeng/llama branch February 4, 2025 02:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adopt new Llama 3.1 HF names #1357

Adopt new Llama 3.1 HF names #1357

wizeng23 commented Feb 4, 2025

taenin Feb 4, 2025

wizeng23 Feb 4, 2025

Adopt new Llama 3.1 HF names #1357

Adopt new Llama 3.1 HF names #1357

Conversation

wizeng23 commented Feb 4, 2025

Description

Before submitting

taenin Feb 4, 2025

Choose a reason for hiding this comment

wizeng23 Feb 4, 2025

Choose a reason for hiding this comment