[OPIK-415] Compute traces cost based on token usage #703

BorisTkachenko · 2024-11-22T17:00:37Z

Details

Computes trace cost based on token usage

Resolves #OPIK-415

Testing

Added integration tests

Documentation

https://www.notion.so/cometml/Opik-Span-Cost-tracking-13c7124010a380a7ad36ced128a26dc7

thiagohora · 2024-11-24T20:20:01Z

apps/opik-backend/src/main/java/com/comet/opik/domain/SpanDAO.java

@@ -581,7 +581,7 @@ AND id in (
            """;

    private static final String ESTIMATED_COST_VERSION = "1.0";
-    private static final BigDecimal ZERO_COST = new BigDecimal("0.00000000");
+    public static final BigDecimal ZERO_COST = new BigDecimal("0.00000000");


Suggested change

public static final BigDecimal ZERO_COST = new BigDecimal("0.00000000");

public static final BigDecimal ZERO_COST = new BigDecimal.ZERO;

thiagohora · 2024-11-24T20:21:35Z

apps/opik-backend/src/main/java/com/comet/opik/domain/TraceDAO.java

@@ -747,6 +753,9 @@ private Publisher<Trace> mapToDto(Result result) {
                        .filter(it -> !it.isEmpty())
                        .orElse(null))
                .usage(row.get("usage", Map.class))
+                .totalEstimatedCost(row.get("total_estimated_cost", BigDecimal.class).equals(ZERO_COST)


We probably should use compareTo

Yes, now it's a must, since we changed to BigDecimal.ZERO.
equals works only for the same precision comparison and is not robust. Good catch.

thiagohora · 2024-11-25T14:45:11Z

apps/opik-backend/src/main/java/com/comet/opik/domain/SpanDAO.java

@@ -581,7 +581,7 @@ AND id in (
            """;

    private static final String ESTIMATED_COST_VERSION = "1.0";
-    private static final BigDecimal ZERO_COST = new BigDecimal("0.00000000");
+    public static final BigDecimal ZERO_COST = BigDecimal.ZERO;


No need for this constant just use the one from BigDecimal

thiagohora · 2024-11-25T14:46:14Z

apps/opik-backend/src/test/java/com/comet/opik/api/resources/v1/priv/TracesResourceTest.java

+                    .toBuilder()
+                    .id(null)
+                    .projectName(projectName)
+                    .usage(Map.of("completion_tokens", 200 * 5L, "prompt_tokens", 300 * 5L, "total_tokens", 4 * 5L))


Can we generate random values to make the test more robust?

No we can't. It should be related to spans to assert and also usage keys should be specific for cost calculation and can't be random.

I mean something like this:

Spans

.usage(Map.of("completion_tokens", factory. manufacturePojo(Integer.class), "prompt_tokens", factory. manufacturePojo(Integer.class), "total_tokens", factory. manufacturePojo(Integer.class)))

Then, in the traces, we just group by usage name and calculate the avg expected

I updated per your request. But could you please explain how hardcoded usage values might introduce flakiness?

thiagohora · 2024-11-26T08:17:23Z

apps/opik-backend/src/test/java/com/comet/opik/api/resources/v1/priv/TracesResourceTest.java

+            assertThat(traceExpectedCost.compareTo(BigDecimal.ZERO) == 0 ?
+                    createdTrace.totalEstimatedCost() == null :
+                    traceExpectedCost.compareTo(createdTrace.totalEstimatedCost()) == 0)
+                    .isEqualTo(true);


Suggested change

assertThat(traceExpectedCost.compareTo(BigDecimal.ZERO) == 0 ?

createdTrace.totalEstimatedCost() == null :

traceExpectedCost.compareTo(createdTrace.totalEstimatedCost()) == 0)

.isEqualTo(true);

var actual = createdTrace.totalEstimatedCost();

traceExpectedCost = traceExpectedCost.compareTo(BigDecimal.ZERO) == 0 ? null : traceExpectedCost;

assertThat(actual)

.usingRecursiveComparison(RecursiveComparisonConfiguration.builder()

.withComparatorForType(BigDecimal::compareTo, BigDecimal.class)

.build())

.isEqualTo(traceExpectedCost);

BorisTkachenko self-assigned this Nov 22, 2024

BorisTkachenko requested a review from a team as a code owner November 22, 2024 17:00

BorisTkachenko force-pushed the boryst/OPIK-415-tracing-compute-traces-cost-based-on-token-usage branch from 00b21ea to dd0b37f Compare November 22, 2024 17:01

thiagohora reviewed Nov 25, 2024

View reviewed changes

BorisTkachenko requested a review from thiagohora November 25, 2024 08:48

BorisTkachenko force-pushed the boryst/OPIK-415-tracing-compute-traces-cost-based-on-token-usage branch 3 times, most recently from d8a8ec8 to 2be4465 Compare November 25, 2024 14:34

thiagohora reviewed Nov 25, 2024

View reviewed changes

BorisTkachenko force-pushed the boryst/OPIK-415-tracing-compute-traces-cost-based-on-token-usage branch from 2be4465 to 9215e66 Compare November 25, 2024 15:19

BorisTkachenko requested a review from thiagohora November 25, 2024 15:20

BorisTkachenko force-pushed the boryst/OPIK-415-tracing-compute-traces-cost-based-on-token-usage branch from 9215e66 to 6a2f37b Compare November 25, 2024 16:33

Borys Tkachenko added 4 commits November 26, 2024 09:10

OPIK-415 Compute traces cost based on token usage

aa55ef6

Fix comments

1e27e2e

fix comment

e171b44

Refactor test

2ea154e

BorisTkachenko force-pushed the boryst/OPIK-415-tracing-compute-traces-cost-based-on-token-usage branch from 6a2f37b to 2ea154e Compare November 26, 2024 08:10

thiagohora approved these changes Nov 26, 2024

View reviewed changes

BorisTkachenko merged commit 1d68993 into main Nov 26, 2024
7 checks passed

BorisTkachenko deleted the boryst/OPIK-415-tracing-compute-traces-cost-based-on-token-usage branch November 26, 2024 08:16

thiagohora reviewed Nov 26, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OPIK-415] Compute traces cost based on token usage #703

[OPIK-415] Compute traces cost based on token usage #703

BorisTkachenko commented Nov 22, 2024

thiagohora Nov 24, 2024

BorisTkachenko Nov 25, 2024

thiagohora Nov 24, 2024

BorisTkachenko Nov 25, 2024

thiagohora Nov 25, 2024

BorisTkachenko Nov 25, 2024

thiagohora Nov 25, 2024

BorisTkachenko Nov 25, 2024

thiagohora Nov 25, 2024 •

edited

Loading

BorisTkachenko Nov 25, 2024

thiagohora Nov 26, 2024

	public static final BigDecimal ZERO_COST = new BigDecimal("0.00000000");
	public static final BigDecimal ZERO_COST = new BigDecimal.ZERO;

-            assertThat(traceExpectedCost.compareTo(BigDecimal.ZERO) == 0 ?
-                    createdTrace.totalEstimatedCost() == null :
-                    traceExpectedCost.compareTo(createdTrace.totalEstimatedCost()) == 0)
-                    .isEqualTo(true);
+             var actual = createdTrace.totalEstimatedCost();
+             traceExpectedCost = traceExpectedCost.compareTo(BigDecimal.ZERO) == 0 ? null : traceExpectedCost;
+            assertThat(actual)
+                      .usingRecursiveComparison(RecursiveComparisonConfiguration.builder()
+                            .withComparatorForType(BigDecimal::compareTo, BigDecimal.class)
+                            .build())
+                    .isEqualTo(traceExpectedCost);

[OPIK-415] Compute traces cost based on token usage #703

[OPIK-415] Compute traces cost based on token usage #703

Conversation

BorisTkachenko commented Nov 22, 2024

Details

Testing

Documentation

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thiagohora Nov 25, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thiagohora Nov 25, 2024 •

edited

Loading