Comments on the "dry run"


Creating the template:

- [x] Sanitize and/or give guidance on appropriate project names
- [x] Ask the user to create a repo named for the project, and supply the repo URL
- [x] Don't ask for lease name at template creation time. 
   - Instead, direct people to use the project name as the prefix for the lease name.
- [x] Ask which site to put data buckets at, have a sane default if user doesn't know or care
- [x] Some people may be GPU-agnostic (don't care what type of GPU), should be able to use both
- [x] Should give some additional guidance in general, to help people make good decisions

Workflow: 

- [x] After creating the template, we should advise to put the newly generated project in a Github repo
- [x] User is going to run the template generator once. EVERYTHING they will need goes in it then.
- [x] General comment: want to minimize friction by making user run step only at the time that it is required.

`chi` materials:

- [x] In notebook 0, remove angle brackets around project name
- [x] At the end of notebook 0: remove "next steps: launch GPU instance"
- [x] At the end of notebook 0: add a section that shows how to check on the newly created buckets using the Horizon GUI
- [x] Notebook 1: instead of a single lease name, let the lease name start with the project name; write code to get the active lease name that starts with the project name.
- [x] Notebook 1: didn't substitute project name when mounting buckets
- [x] Notebook 1: instance name should have project name instead of `mltrain`
- [x] Should have notebook 1 equivalent for no GPU, also for VM instance (with NVIDIA GPU and with no GPU)
- [x] Need to be able to mount object store buckets even if they are on different site than compute instance
- [x] Might need to adjust mount settings for data bucket (e.g. cache etc.) so it's not very slow to load data

Post-`chi` materials:

- [x] Need a new notebook after Notebook 1, for setting up env variable, building container images, and starting containers
   - If I don't have/need a HF token, it should do something sane
   - specify `HF_HOME` where I mounted object store bucket for datasets, but must also specify `HF_TOKEN_PATH` in an ephemeral location
- [x] The whole Docker workflow assumes I am in NVIDIA land, and Pytorch
- [x] Don't assume Lightning automatically 
- [x] When installing MLFlow Python client in Dockerfile, match the MLFlow server version
- [x] Only need to log in to Github when I need to push something
- [x] `mlflow.set_experiment` - use project name
- [x] Git context
- [x] in examples, don't put everything in one cell
- [x] should have notebook and src examples for each

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments on the "dry run" #12

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Comments on the "dry run" #12

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions