Livepeer.Cloud SPE - Proposal #2 - Enable Single Orchestrator AI Job Testing Support for Gateway Nodes #3241

mikezupper · 2024-11-07T14:10:04Z

What does this pull request do? Explain your changes. (required)
Provide features that enables AI Job Testing through gateway nodes. The gateway node has several hard-coded timeouts/cache values that need to be configurable to allow the a gateway to send an AI job to a specific orchestrator for testing.

Specific updates (required)

Introduce several new startup flags to enable a gateway node to support AI Job testing.
- aiTesterGateway - a boolean that enables the gateway node to bypass AI Session caching. Defaults to false to prevent any behavior changes to the default gateway node.
- aiSessionTimeout - a duration value that allows the AI session timeouts to be configured to a desired value. The default is 600s to match the existing hard-coded value.
- webhookRefreshInterval - a duration value that allows the orchWebhookUrl cached responses to be configured to a desired value. The default is 60s to match the existing hard-coded value.
- LIVEPEER_OS_HTTP_TIMEOUT - This is an environment variable (ENV Var). The code is standalone and cannot use common livepeer flags. The variable is a duration value that allows the AI assets (.mp4 files, etc...) download timeout to be configured to a desired value. The default is 4s to match the existing hard-coded value.
A new HTTP endpoint was added to fetch all AI capabilities of each orchestrator (/getOrchestratorAICapabilities). This endpoint provides the AI Job Tester with information on all AI models available for the all orchestrators.

How did you test each of these updates (required)
Each of the new flags and timeout values were manually tested in our development environments. They are also deployed to the testing and production Livepeer.Cloud SPE AI Gateway nodes.

Does this pull request close any open issues?
No

Checklist:

[ X] Read the contribution guide
[ X] make runs successfully
[ X] All tests in ./test.sh pass
README and other documentation updated
Pending changelog updated

…Tests

thomshutt · 2024-11-25T12:43:54Z

core/os.go

+
+	// Return the HTTP client with the calculated timeout
+	return &http.Client{
+		Transport: &http.Transport{TLSClientConfig: &tls.Config{InsecureSkipVerify: true}},


@mikezupper Why are we skipping TLS here?

core/os.go

@@ -76,7 +103,7 @@
 func downloadDataHTTP(ctx context.Context, uri string) ([]byte, error) {
 	clog.V(common.VERBOSE).Infof(ctx, "Downloading uri=%s", uri)
 	started := time.Now()
-	resp, err := httpc.Get(uri)
+	resp, err := osHttpClient.Get(uri)


rickstaa · 2024-11-22T00:46:11Z

@thomshutt this pull request has overlap with #3246 which followed #3052. It can be merged after that one is merged and the pull request is rebased.

leszko · 2025-01-02T08:57:19Z

@mikezupper @thomshutt what's the plan for this PR? Do we plan to review/merge/productionize it?

I can review and help with that, but I'd like to know what's the plan in the context of the AI Video work.

Livepeer.Cloud SPE - Proposal #2 - Enable Single Orchestrator AI Job …

1606ad3

…Tests

mikezupper mentioned this pull request Nov 7, 2024

Call-05 Agenda - 2024-11-07 livepeer/project-management#77

Open

github-advanced-security bot found potential problems Nov 7, 2024

View reviewed changes

thomshutt requested review from rickstaa, j0sh and leszko November 25, 2024 12:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Livepeer.Cloud SPE - Proposal #2 - Enable Single Orchestrator AI Job Testing Support for Gateway Nodes #3241

Livepeer.Cloud SPE - Proposal #2 - Enable Single Orchestrator AI Job Testing Support for Gateway Nodes #3241

mikezupper commented Nov 7, 2024

thomshutt Nov 25, 2024

rickstaa commented Nov 22, 2024

leszko commented Jan 2, 2025

Livepeer.Cloud SPE - Proposal #2 - Enable Single Orchestrator AI Job Testing Support for Gateway Nodes #3241

Are you sure you want to change the base?

Livepeer.Cloud SPE - Proposal #2 - Enable Single Orchestrator AI Job Testing Support for Gateway Nodes #3241

Conversation

mikezupper commented Nov 7, 2024

thomshutt Nov 25, 2024

Choose a reason for hiding this comment

rickstaa commented Nov 22, 2024

leszko commented Jan 2, 2025