Skip to content

models classification accuracy eval

github-actions[bot] edited this page Aug 29, 2023 · 10 revisions

classification-accuracy-eval

Overview

Description: This is a flow illustrating how to evaluate the performance of a classification system. It involves comparing each prediction to the groundtruth and assigns a "Correct" or "Incorrect" grade, and aggregating the results to produce metrics such as accuracy, which reflects how good the system is at classifying the data. Brief description: Measuring the performance of a classification system by comparing its outputs to groundtruth. ### What you will learn In this flow, you will learn - how to compose a point based evaluation flow, where you can calculate point-wise metrics. - the way to log metrics. #### Evaluate a classification flow There are two ways to evaluate a classification flow. * Run a classification flow and evaluation flow all together * step 1: create or clone a classification flow * step 2: select bulk test and fill in variants, then click on next * step 3: fill in test data, then click on next * step 4: when you are in evaluation setting page, use quotas for 'Sample evaluation flows'. Select 'Classification Accuracy Evaluation' from Sample evaluation flows, select the evaluation flow's inputs mapping from normal flow's inputs or outputs and click on next * step 5: review run settings and submit * Run 'Classification Accuracy Evaluation' from an existing classification flow run * step 1: create a bulk test classification flow run and submit * step 2: click on 'View run history' to go to all submitted runs page and select a bulk test in bulk runs panel to go to details page * step 3: click on 'New evaluation', select one or more variants and the Classification Accuracy Evaluation from Sample evaluation flows. Then set connections, input mappings and submit ### Tools used in this flow - Python Tool

Version: 1

View in Studio: https://ml.azure.com/registries/azureml/models/classification-accuracy-eval/version/1

Properties

is-promptflow: True

promptflow.section: gallery

promptflow.type: evaluate

promptflow.name: Classification Accuracy Eval

promptflow.description: Measuring the performance of a classification system by comparing its outputs to groundtruth.

inference-min-sku-spec: 2|0|14|28

inference-recommended-sku: Standard_DS3_v2

Clone this wiki locally