Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Backend + AI Server] Train job #159

Open
5 of 6 tasks
Benvii opened this issue Jun 27, 2022 · 1 comment
Open
5 of 6 tasks

[Backend + AI Server] Train job #159

Benvii opened this issue Jun 27, 2022 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@Benvii
Copy link
Member

Benvii commented Jun 27, 2022

We need :

  • Provide bash script to start ssh tunneling from the manager to the AI trainer machine and that will start the AI trainner runner service
  • Have a service file and ansible so that it can be deployed on the GPU virtual manchine
  • Uncompress dataset from FTP server
  • Start donkeycar train
  • Compress result models and upload to the ftp server
  • Add a sort of "waiting for model job" on the car ?

We might need an abstract JobManager with JobManagerCar implementation and JobManagerIA implementation.

@Benvii Benvii added the enhancement New feature or request label Jun 27, 2022
@Benvii Benvii added this to the V2 - IA for everyone milestone Jun 27, 2022
@Benvii Benvii self-assigned this Jun 27, 2022
@Benvii Benvii mentioned this issue Sep 24, 2022
@Benvii
Copy link
Member Author

Benvii commented Sep 24, 2022

Mostly done in #175

Still need the services files for the trainer daemon

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant