-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Relocate dependencies #39
Comments
Seems reasonable but this specific issue (with lineToLineDelimiterStats) is from gbif-common. |
I'd encourage that anything that is expected to be used widely as a lib needs to keep dependencies to a minimum and consider relocating (especially things like jackson, guava, commons-lang which are really volatile across versions). Where only a few really basic utils are needed (e.g. strings being null or empty) might it be worth even considering adding that code natively? |
Fully agree with the goal to keep dependencies minimal.
|
registry-metadata is pretty bad and should intuitively not be necessary. |
The DatasetParser is used to return a GBIF-API Dataset from an Archive: https://github.com/gbif/dwca-io/blob/master/src/main/java/org/gbif/dwca/io/Archive.java#L232 |
After removing the DatasetParser and Dataset we have this (test dependencies omitted)
|
…ather than EML. Many fewer dependencies now: #39
unfortunately this has increased considerably as #47 noticed, mostly via the gbif-commons dependency:
Maybe we can exclude all of simple-odf and poi-ooxml? Or do we need to be able to parse excel sheets in dwca-io? Replacing apache digester with some less heavy xml parsing is also pretty simple for the meta.xml part. E.g. I have used Stax in the CoL+ project, which pulls in the woodstax parser. |
gbif-commons's text handling has the ability to parse Excel and OpenDocument spreadsheets. This can safely be excluded by most users; it's probably appropriate to refactor it into a submodule of gbif-common. However, I think we ourselves do use it to gracefully accept a CSV or spreadsheet with DWC headers. |
In fact, we don't — not here anyway, the things that do that convert the spreadsheets to CSV before passing them on to DWCA-IO. I've moved the spreadsheet handling classes to a new module, so this is now back down to where it was in March.
|
I suggest this project use minimal dependencies and relocate them to avoid e.g.:
The text was updated successfully, but these errors were encountered: