Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UC: Category Optimization #65

Open
Tracked by #61 ...
jawache opened this issue Jun 5, 2024 · 1 comment
Open
Tracked by #61 ...

UC: Category Optimization #65

jawache opened this issue Jun 5, 2024 · 1 comment
Assignees
Labels
Action Item documentation Improvements or additions to documentation

Comments

@jawache
Copy link
Contributor

jawache commented Jun 5, 2024

Since a rating is linked to a category of products, and the selection of which category a product lives in is mostly subjective (is AirTable a Database or a Spreadsheet? Is Grammarly an AI Copilot or a Word Processor?), you might re-categorize your product into one where your rating for the same score is more favorable.

This is also challenging. How do we decide the buckets without information? New products come on the market with overlapping feature sets, which makes comparison very challenging.

Equivalence

  • Publishing a book on Amazon under an obscure category so you can claim to be top of the charts with minimal sales.
  • This is an area where there is a big difference between the digital and physical worlds. It's very hard to claim a Fridge Freezer is an Oven, so Energy Star does not have this problem. The categorization of a software product can be a very subjective decision. Most organizations spend significant time trying to differentiate from their competitors, so they are well placed to put forward arguments for recategorization or category creation.

Counter

  • First gather data through encouraging transparent reporting. If the label was initially just a record of transparency, like a food ingredients label, with no ratings then there is no need for categorization. Once we have a large body of public, open, transparent data, we can find useful clusterings, and perhaps other forms of categorization will surface once we have the data.
  • Categorize based on objective traits rather than features.
  • Categories directly linked to Functional Units. For instance if in the LLM space the Functional Unit is "Prompt" as per the SCI for AI specification, then moving categories to say Word Processor would change the Functional Unit to perhaps "Character".
@chrisxie-fw
Copy link
Contributor

This is related to the definition of the 1st step of the SCER specification: Categorization - what does Categorization mean? What constitute a software category? What are the key aspects or components that a category of software or AI models is composed of, so that apples are compared with apples, not oranges? The base SCER spec identified a number of key aspects/components of a software categorization, such as purpose, function, platform, end user. The SCER for LLMs spec identified a number of aspects as well, such as model type (text, image, voice, etc), parameter size (3b, 7b, etc), use/tasks (multi model etc).

The implementers of SCER framework may decide what a category will concretely be. For example, in the case of greencoding.ai, users can choose which LLMs to issue the prompt to, and a SCER rating/labeling can be returned based on all the prompts the user has issued to whatever types of LLMs (llama2, llama3, mistral, gemma, tinyllama etc).
image

@seanmcilroy29 seanmcilroy29 mentioned this issue Jul 3, 2024
18 tasks
@seanmcilroy29 seanmcilroy29 mentioned this issue Aug 7, 2024
18 tasks
@seanmcilroy29 seanmcilroy29 mentioned this issue Sep 4, 2024
23 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Action Item documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

4 participants