calzone: a python package for measuring calibration in probabilistic models

calzone is a comprehensive Python package for calculating and visualizing metrics for assessing the calibration of models with probabilistic output.

Features

Supports multiple calibration metrics including Spiegelhalter's Z-test, Expected Calibration Error (ECE), Maximum Calibration Error (MCE), Hosmer-Lemeshow (HL) test, Cox regression analysis, and Loess regression analysis.
Provides tools for creating reliability diagrams and ROC curves.
Offers equal-space and equal-frequency binning options.
Provides bootstrapped confidence intervals for each calibration metric.
Supports prevelance adjustment to account for prevalance differences between enriched data and population data.
Extends metrics to multiclass classification problems with one-vs-rest or top-class calculations.

To accurately assess the calibration of machine learning models, it is essential to have a comprehensive and representative dataset with sufficient coverage of the prediction space. The calibration metrics are not meaningful if the dataset is not representative of true intended population.

Installation

You can install calzone using pip:

pip install calzone-tool

Usage

run python cal_metrics.py -h to see the help information and usage. To use the package in your Python code, please refer to the examples in the documentation pages.

A GUI is available by running python GUI_cal_metrics.py. Support for the GUI is experiment and requires additional dependencies (i.e., nicegui).

Documentation

For a detailed manual and API reference, please visit our documentation page.

Support

If you encounter any issues or have questions about the package, please open an issue request or contact:

Disclaimer

This software and documentation (the "Software") were developed at the Food and Drug Administration (FDA) by employees of the Federal Government in the course of their official duties. Pursuant to Title 17, Section 105 of the United States Code, this work is not subject to copyright protection and is in the public domain. Permission is hereby granted, free of charge, to any person obtaining a copy of the Software, to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, or sell copies of the Software or derivatives, and to permit persons to whom the Software is furnished to do so. FDA assumes no responsibility whatsoever for use by other parties of the Software, its source code, documentation or compiled executables, and makes no guarantees, expressed or implied, about its quality, reliability, or any other characteristic. Further, use of this code in no way implies endorsement by the FDA or confers any advantage in regulatory decisions. Although this software can be redistributed and/or modified freely, we ask that any derivative works bear some notice that they are derived from it, and any modified versions bear some notice that they have been modified.

Name		Name	Last commit message	Last commit date
Latest commit History 153 Commits
.github/workflows		.github/workflows
calzone		calzone
docs		docs
example_data		example_data
paper		paper
tests		tests
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
GUI_cal_metrics.py		GUI_cal_metrics.py
LICENSE		LICENSE
README.md		README.md
cal_metrics.py		cal_metrics.py
citation.cff		citation.cff
gui.png		gui.png
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

calzone: a python package for measuring calibration in probabilistic models

Features

Installation

Usage

Documentation

Support

Disclaimer

About

Releases

Packages

Contributors 2

Languages

License

DIDSR/calzone

Folders and files

Latest commit

History

Repository files navigation

calzone: a python package for measuring calibration in probabilistic models

Features

Installation

Usage

Documentation

Support

Disclaimer

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages