Skip to content

Commit

Permalink
add extraction to outs
Browse files Browse the repository at this point in the history
  • Loading branch information
tomlue committed Oct 26, 2024
1 parent c790bb8 commit 8f7c75b
Show file tree
Hide file tree
Showing 3 changed files with 12 additions and 2 deletions.
9 changes: 8 additions & 1 deletion dvc.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -23,4 +23,11 @@ stages:
outs:
- brick/riskder.pdf:
persist: true
- brick/riskder.parquet
- brick/riskder.parquet

extract_data:
cmd: python stages/03_data_extractor.py
deps:
- brick/riskder.pdf
outs:
- brick/extraction.parquet
3 changes: 3 additions & 0 deletions notebook.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
import pandas as pd

results = pd.read_parquet('brick/riskder.parquet')
2 changes: 1 addition & 1 deletion stages/03_data_extractor.py
Original file line number Diff line number Diff line change
Expand Up @@ -128,5 +128,5 @@ def extract_testing_results(pdf_path):
continue


aggdf.to_csv('extraction.csv')
aggdf.to_csv('brick/extraction.csv')
# endregion

0 comments on commit 8f7c75b

Please sign in to comment.