Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add ingestor tool #147

Closed
wants to merge 8 commits into from
Closed

Add ingestor tool #147

wants to merge 8 commits into from

Conversation

Marie59
Copy link
Collaborator

@Marie59 Marie59 commented Dec 4, 2024

Hi all,

New tool to aggregate and harmonized the Biogeochemical data from Argo raw data and from Gliders.
This is part of one of FAIR-EASE pilots (Ocean Biogeochemical) in which @alban.sizun and @clement.weber !

  • I invite you to check the lats section on help see if it's okay with you.
  • Also if the name and the small description at the beginning is good enough to make the tool findable
  • And also the help line in the input is it good ?

@bgruening Is this enough for the test part ? I nerver know how much is enough.

Let's see what crashes ! and thanks for the reviews :)

@bgruening
Copy link
Collaborator

Tests can never be enough ;-)
If roughly every conditional, input output compilation is covered it's ok. This tool LGTM! 🥳

@Marie59
Copy link
Collaborator Author

Marie59 commented Dec 4, 2024

Ah @bgruening I see that the copernicus marine is making this PR bug ... can I skip the test on it for this PR ?

@bgruening
Copy link
Collaborator

One reason more to have those tools in a separate directory :-)

@Marie59
Copy link
Collaborator Author

Marie59 commented Dec 4, 2024

One reason more to have those tools in a separate directory :-)

Oups .... OKay I keep in mind for next time !

@Marie59
Copy link
Collaborator Author

Marie59 commented Dec 4, 2024

Okay so I guess now everything that is related to this new tool is working

@detocjerome
Copy link

I guess the tool will not in charge to collect data. So call it Ingestor will cause incomprehension
I propose for Galaxy purpose to name it convert_to_bgc_netcdf

@Marie59
Copy link
Collaborator Author

Marie59 commented Dec 4, 2024

I guess the tool will not in charge to collect data. So call it Ingestor will cause incomprehension I propose for Galaxy purpose to name it convert_to_bgc_netcdf

  • convert_to_bgc_netcdf would more fit for an id not a tool name let's say id=convert_to_bgc_netcdf
  • name=Netcdf aggregator
  • description=for marine biogeochemical data
    (Knowing that in the tool panel whats going to be seen is: " Netcdf aggregator for marine biogeochemical data "

@cwebear
Copy link

cwebear commented Dec 4, 2024

Hello Marie and Jérôme,
This tool takes in entry files of Argo or Glider data (T,S or BGC, or both) (provided by the user on galaxy) and returns this set of files harmonized (i.e renamed and variables filtered) following qcv platform convention and aggragated (i.e, all the files having different shapes, are aligned to create a single file readable as a dataset).

For the needs of Galaxy I suggest
id = harmonize_insitu_to_netcdf
name = qcv_app_harmonizer
description = Harmonizer and aggregator of insitu marine physical and biogeochimical data. This tools works for Argo or Gliders datafiles following CMEMS convention.

What do you think ?

@detocjerome
Copy link

I always have a kind of suppressed admiration for people who can get everyone to agree.
In other words OK
I hope you enjoy this discovery
https://youtu.be/II1vza8955g

@detocjerome
Copy link

Otherwise, this isn't the place to say this any more than it is to share the music, but it's coming along nicely.

@Marie59
Copy link
Collaborator Author

Marie59 commented Dec 4, 2024

Hello Marie and Jérôme, This tool takes in entry files of Argo or Glider data (T,S or BGC, or both) (provided by the user on galaxy) and returns this set of files harmonized (i.e renamed and variables filtered) following qcv platform convention and aggragated (i.e, all the files having different shapes, are aligned to create a single file readable as a dataset).

For the needs of Galaxy I suggest id = harmonize_insitu_to_netcdf name = qcv_app_harmonizer description = Harmonizer and aggregator of insitu marine physical and biogeochimical data. This tools works for Argo or Gliders datafiles following CMEMS convention.

What do you think ?

All good for me @cwebear I would just change the name to sometihing more user friendly like "QCV harmonizer" ?
that way in the end the user will see
"QCV harmonizer and aggregator of insitu marine physical and biogeochimical data"

@Marie59 Marie59 mentioned this pull request Dec 5, 2024
@Marie59
Copy link
Collaborator Author

Marie59 commented Dec 5, 2024

I am fixing my folder issue (to make something cleaner), the PR keeps going here #148

@Marie59 Marie59 closed this Dec 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants