- There is currently no automatic deploy
- When deploying, on server:
- git pull
- sudo docker compose --compatibility up --build -d
- Then restart containers if necessary
- sudo docker compose restart backend loader
- New municipalities are probably added to the issue tracker.
- Identify supplier (Notubiz / go / etc)
- Open the relevant
ocd_backend/sources
file. - Duplicate bottom municipality
- Set key (equal to shortname, dashes allowed)
- Allmanak code: https://allmanak.nl/cat/1/Gemeenten (https://allmanak.nl/cat/32/Waterschappen), search name, get ID from URL
- For CBS code: https://www.cbs.nl/nl-nl/onze-diensten/methoden/classificaties/overig/gemeentelijke-indelingen-per-jaar/indeling-per-jaar/gemeentelijke-indeling-op-1-januari-2024
- Optionally set
source_name
if municipality name can't be properly derived from shortname. - Optionally set
municipality_prefix
if municipality has multiple suppliers per region. - The next step depends on supplier, see below
- Push to master
ssh
towolf
(ask Breyten)- sh to
redis
(see redis) select 1
for setting individual municipalitiesset "ori.{supplier}.{key}" "all daily monthly"
add municipalityset _all.start_date
set start date for a new run (e.g. when some specific run has to be done - use 2010-01-01 for historic runs)set _all.end_date
to today (xxxx-xx-xx format)exit
- see #starting-a-run below to sh into
backend-${id}
- start the extraction process for the new municipality
sudo docker exec ori_backend_1 ./manage.py extract process all --source_path=ori.notubiz.weesp
. They will be set in a list forcelery
, which means that they will be processed in time. - You can track the progress in the logs under /var/lib/docker/containers for ori_backend_1 and ori_loader_1.
- Update the status per municipality (importing, finished) in the github issue tracker.
Some useful commands to see queues (run from ori_backend_1):
- celery -A ocd_backend.app status
- celery -A ocd_backend.app inspect active
- celery -A ocd_backend.app inspect scheduled
- celery -A ocd_backend.app inspect reserved
- celery -A ocd_backend.app inspect stats
- celery -A ocd_backend.app inspect active_queues To see number of tasks currently waiting:
- sudo docker exec ori_redis_1 redis-cli llen $'loaders\x06\x163' To see details of e.g. first job:
- sudo docker exec ori_redis_1 redis-cli lindex $'loaders\x06\x163' 0 If a task failed with an exception and is queued to be retried it is placed in unacked: sudo docker exec -it ori_redis_1 redis-cli hgetall unacked See also unacked_index (contains the time the task were added): sudo docker exec -it ori_redis_1 redis-cli zrange unacked_index 0 -1 WITHSCORES
- For finding
ibabs_sitename
, google foribabs ${municipality_name}
and derive it from the URL - Duplicate
- Exclude / include are rarely required, but can be useful if one instance is shared across municipalitites
- Usually trying a
base_url
that makes sense works fine
- Go to redis (see devops), set source value to
archived
for the older one.
- Not enough available disk space can cause downtime. Elastic starts to have issues at 80% disk usage - it starts moving stuff to other instances. Fix this by making the disk larger and copying the contents.
- When dealing with IBabs issues, use SoapUI.
- When finding logs for a municipality, use the GCP querybuilder with
textPayload:municipality
- If an error was made in a key when adding a new source, a new run may create a new index in Elastic. The old index, which is probably
still empty, can be removed, but the new index may not yet show up. To fix this make sure that the e.g. municipality is retrieved again:
- get the
item_id
for the municipality (e.g. using alog
statement when retrieving in development) - delete the row with this
item_id
from theItemHash
table - rerun the import
- get the
- ocd_backend contains most logic
- bin
- models Use OpenGov definitions
- sources contain municipalities config
- extractor Responsible for fetching data
- loader Responsible for writing data (e.g. elastic and linked-delta's)
- transformers Responsible for mapping data
- enrichers Extracting text from PDFs, adding locations, adding themes
This project uses cert-manager
for creating certificates.
sudo docker exec -it ori_elastic_1 sh
get a shell in a running elastic container- Send an HTTP DELETE to the ID:
curl -X DELETE 0.0.0.0:9200/${index}/${type}/${id}
, e.g.curl -X DELETE 0.0.0.0:9200/ori_vlaardingen_20190809125128/_doc/1234567
Sometimes VNG wants a list of municipalities. You can use ./fetch_municipalities.sh
to fetch the data for all suppliers and output it in a format suitable for Excel sheets.