Add MongoDB query timeout #3158

jnm · 2021-04-28T01:53:30Z

We have an issue where zombie Mongo queries are able to run for several hours—long after any client that requested them has perished. Unfortunately, a global query timeout seems impossible, but maybe we could wrap pymongo somehow so that it sends maxTimeMS with all queries: https://stackoverflow.com/a/60542564/2402324

Whatever solution we arrive at we should implement in KoBoCAT as well.

Somewhat related to kobotoolbox/kobocat#696 in that both that and this work together to cause MongoDB slowdowns, which lead to users getting 502s

The text was updated successfully, but these errors were encountered:

jnm · 2021-05-05T07:21:55Z

Elevated the priority because the servers are really struggling under the load.

As I mentioned earlier, I don't see a way to set maxTimeMS on every query used with our pymongo MONGO_DB, but maybe the easiest thing to do is to make a helper function that wraps MONGO_DB.instances.find() and adds the maxTimeMS argument.

The limit should be CELERY_TASK_TIME_LIMIT (converted to milliseconds) + some grace period

kpi/kobo/settings/base.py

Lines 422 to 426 in 7edbc13

    
           # Default to a 30-minute soft time limit and a 35-minute hard time limit 
        
           CELERY_TASK_TIME_LIMIT = int(os.environ.get('CELERYD_TASK_TIME_LIMIT', 2100)) 
        
           CELERY_TASK_SOFT_TIME_LIMIT = int(os.environ.get( 
        
               'CELERYD_TASK_SOFT_TIME_LIMIT', 1800))

jnm · 2021-05-05T08:09:37Z

It's more of a sysadmin thing, but for reference, here's a quick and really-very-dirty 🙈 method to reduce load from runaway queries:

root@mongo:/# while true; do mongo -u root -p "$MONGO_INITDB_ROOT_PASSWORD" admin --eval 'db.currentOp(true).inprog.forEach(function(op){ if(op.secs_running > 2110) { print(op.opid); db.killOp(op.opid) } });'; sleep 10; done

jnm assigned JacquelineMorrissette May 5, 2021

jnm added the high priority To be done soon label May 5, 2021

This was referenced May 12, 2021

Added max_time_ms to mongo queries kobotoolbox/kobocat#710

Merged

Mongo query timeout #3206

Closed

Mongo query timeout #3210

Merged

noliveleger closed this as completed in kobotoolbox/kobocat#710 May 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MongoDB query timeout #3158

Add MongoDB query timeout #3158

jnm commented Apr 28, 2021 •

edited

Loading

jnm commented May 5, 2021

jnm commented May 5, 2021 •

edited

Loading

Add MongoDB query timeout #3158

Add MongoDB query timeout #3158

Comments

jnm commented Apr 28, 2021 • edited Loading

jnm commented May 5, 2021

jnm commented May 5, 2021 • edited Loading

jnm commented Apr 28, 2021 •

edited

Loading

jnm commented May 5, 2021 •

edited

Loading