Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Start using $SKYPILOT_NUM_GPUS_PER_NODE in SkyPilot config #90

Merged
merged 2 commits into from
Jun 18, 2024

Conversation

nikg4
Copy link
Collaborator

@nikg4 nikg4 commented Jun 18, 2024

.... to reduce the number of manual edits

Also, update README.md

Towards OPE-16

@nikg4 nikg4 requested review from wizeng23, taenin and oelachqar June 18, 2024 20:57
Copy link

linear bot commented Jun 18, 2024

OPE-16 Investigate FSDP implementation for LeMa platform

Which implementation & abstraction should we use?

  • torch native
  • accelerate
  • apex
  • megatron
  • others?

@nikg4 nikg4 merged commit 20f8e1d into main Jun 18, 2024
1 check passed
@nikg4 nikg4 deleted the xrdaukar/SKYPILOT_NUM_GPUS_PER_NODE branch June 18, 2024 21:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants