Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Function summarize_target test suite #32

Open
HrayrMuradyan opened this issue Jan 14, 2025 · 1 comment
Open

Function summarize_target test suite #32

HrayrMuradyan opened this issue Jan 14, 2025 · 1 comment
Assignees
Labels
planning Includes tasks related to brainstorming, deciding project scope, and high-level planning
Milestone

Comments

@HrayrMuradyan
Copy link
Collaborator

Plan the summarize_target function test suite.

@HrayrMuradyan HrayrMuradyan added the planning Includes tasks related to brainstorming, deciding project scope, and high-level planning label Jan 14, 2025
@HrayrMuradyan HrayrMuradyan added this to the Milesone 2 milestone Jan 14, 2025
@HrayrMuradyan HrayrMuradyan moved this from Ready to In progress in @summarease-project-board Jan 14, 2025
@Green-zy
Copy link
Collaborator

Expected Use Cases:

when target_type="categorical":
  • correctly calacuate the average proportion of each calss

  • check if the classes are balanced based on the given threshold

    when target_type="numerical":

  • correctly visualize the distribution of a numerical target variable

Edge Cases:

  • Empty dataset input.
  • Missing or non-existent target column in the dataset.
  • Extremely imbalanced class distribution for categorical targets (e.g., one class has 99% of the observations).
  • Numerical target variable with all identical values, making distribution visualization non-informative.

Erroneous/Adversarial Inputs:

  • Invalid target_type value (not within {"categorical", "numerical"}).
  • Providing a non-numeric target column when target_type="numerical".
  • Specifying a non-categorical target column when target_type="categorical".
  • Passing a threshold value outside the valid range (negative or greater than 1).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
planning Includes tasks related to brainstorming, deciding project scope, and high-level planning
Projects
Status: In progress
Development

No branches or pull requests

2 participants