now fetching operatorId dynamically in economicCollector #54

samlaf · 2023-10-24T00:57:12Z

Motivation

eigenDA team was running into errors because the economicCollector was fetching operatorId from the eigenda registry coordinator in its constructor, which is initialized before the node starts. When the operator is registered at node start, it only registers in the start() function, which happens after, so the economicCollector never gets updated with the operatorId and hence keeps erroring on the collect call.

Solution

This PR moves fetching the operatorId into the collector. This adds 1 round trip latency to the node on every collect (every ~15sec), but at least this will fix this problem. Another solution might be to cache the operatorId after one of the collect calls gets the operatorId, but this could break if we later allow the operatorId to change.

Open questions

metrics/collectors/economic/economic.go

jianoaix

It looks reasonable to cache the operatorID, as it's something extremely rare (and difficult) to change?

jianoaix · 2023-10-24T03:21:42Z

metrics/collectors/economic/economic.go

+			for quorumIdx, quorumNum := range quorumNums {
+				// TODO: this is stupid.. when AVSs scale to have 5K operators we'll be running through a bunch of operators
+				// we should instead just call registryCoordinator.getQuorumBitmapIndicesByOperatorIdsAtBlockNumber
+				// and stakeRegistry.getStakeForOperatorIdForQuorumAtBlockNumber directly


Why don't we do these two per-operator fetches?

Because I was lazy at the time. Fixed: 858ae16

samlaf · 2023-10-24T18:39:01Z

It looks reasonable to cache the operatorID, as it's something extremely rare (and difficult) to change?

changed to cache: 5b942b6
@shrimalmadhur @jianoaix should be good to go.

shrimalmadhur · 2023-10-24T19:09:08Z

metrics/collectors/economic/economic.go

@@ -110,6 +108,17 @@ func (ec *Collector) Describe(ch chan<- *prometheus.Desc) {
 	// ch <- ec.delegatedShares
 }

+func (ec *Collector) cacheOperatorIdIfNotCached() error {


cacheOperatorIdIfNotCached - it's a weird name. don't cache already mean if some key is not present it would cache it? just cacheOperatorId should be a good name for this?

Not in this case because we only cache the very first time this is called. If it's already cached we don't update the cache.

maybe maybeCacheOperator? or cacheOperatorIfNeeded? feel free to find another name, but I don't think cacheOperatorId works

Renamed to initOperatorId as we discussed on slack: fa42170

IMO just something like getOperatorID() will be good. The cache is internal detail/optimization that the caller doesn't need to care about.

jianoaix · 2023-10-25T00:08:09Z

metrics/collectors/economic/economic.go

@@ -110,6 +108,17 @@ func (ec *Collector) Describe(ch chan<- *prometheus.Desc) {
 	// ch <- ec.delegatedShares
 }

+func (ec *Collector) cacheOperatorIdIfNotCached() error {


IMO just something like getOperatorID() will be good. The cache is internal detail/optimization that the caller doesn't need to care about.

samlaf requested review from shrimalmadhur and jianoaix October 24, 2023 00:57

shrimalmadhur reviewed Oct 24, 2023

View reviewed changes

metrics/collectors/economic/economic.go Outdated Show resolved Hide resolved

samlaf force-pushed the samlaf/fix-economic-collector branch from b2bde68 to 7d096fb Compare October 24, 2023 03:11

jianoaix reviewed Oct 24, 2023

View reviewed changes

samlaf added 3 commits October 24, 2023 11:24

now fetching operatorId dynamically in economicCollector

a8930d9

fix economic collect error logic

147eb5c

fix test

973343f

samlaf force-pushed the samlaf/fix-economic-collector branch from 7d096fb to 973343f Compare October 24, 2023 18:24

samlaf added 2 commits October 24, 2023 11:30

make economic collector more efficient (less stupid)

858ae16

caching operatorId in economic collector

5b942b6

fix bug

593a6ac

shrimalmadhur reviewed Oct 24, 2023

View reviewed changes

shrimalmadhur previously approved these changes Oct 24, 2023

View reviewed changes

renamed cacheOperatorIfNotCached -> initOperatorId

fa42170

samlaf dismissed shrimalmadhur’s stale review via fa42170 October 24, 2023 22:32

jianoaix approved these changes Oct 25, 2023

View reviewed changes

samlaf merged commit 71ec295 into master Oct 25, 2023
3 checks passed

samlaf deleted the samlaf/fix-economic-collector branch October 25, 2023 01:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

now fetching operatorId dynamically in economicCollector #54

now fetching operatorId dynamically in economicCollector #54

samlaf commented Oct 24, 2023

jianoaix left a comment

jianoaix Oct 24, 2023

samlaf Oct 24, 2023

samlaf commented Oct 24, 2023

shrimalmadhur Oct 24, 2023 •

edited

Loading

samlaf Oct 24, 2023

samlaf Oct 24, 2023

samlaf Oct 24, 2023

jianoaix Oct 25, 2023

jianoaix Oct 25, 2023

now fetching operatorId dynamically in economicCollector #54

now fetching operatorId dynamically in economicCollector #54

Conversation

samlaf commented Oct 24, 2023

Motivation

Solution

Open questions

jianoaix left a comment

Choose a reason for hiding this comment

jianoaix Oct 24, 2023

Choose a reason for hiding this comment

samlaf Oct 24, 2023

Choose a reason for hiding this comment

samlaf commented Oct 24, 2023

shrimalmadhur Oct 24, 2023 • edited Loading

Choose a reason for hiding this comment

samlaf Oct 24, 2023

Choose a reason for hiding this comment

samlaf Oct 24, 2023

Choose a reason for hiding this comment

samlaf Oct 24, 2023

Choose a reason for hiding this comment

jianoaix Oct 25, 2023

Choose a reason for hiding this comment

jianoaix Oct 25, 2023

Choose a reason for hiding this comment

shrimalmadhur Oct 24, 2023 •

edited

Loading