Diablo IV Item OCR Library - README.md

Introduction

This library aims to provide developers with an easy-to-use Optical Character Recognition (OCR) system, specially designed for recognizing and interpreting in-game items in Diablo. Built using Node and the power of tesseract.js, this library is perfect for browser-based applications.

Features

Built using Node and tesseract.js
Works seamlessly in the browser.
Supports multiple languages for a broad range of players.
Accounts for varying tooltip sizes, making it versatile.
Designed to recognize items even under different color blind settings.
Converts the recognized item into a structured JSON representation, in line with our game data packages.

Requirements

1. System & Environment

Node.js (Version XX or higher)
A modern web browser (Chrome, Firefox, Safari, etc.)

2. Game Data Packages

Make sure the game data packages are available and up-to-date. This ensures accurate JSON representation.

Links

Discord support channels

#project-discussion

#project-forum

Sanctuary Team projects

Installation & Setup

NOTE: These steps are an example as of 8/24/2023. Official steps be updated in the future.

Install the package
```
yarn install
```

Include in your project

const diabloOCR = require('diablo4trading-ocr');

Setup game data packages

Ensure the game data packages are located in a reachable directory. Refer to the config documentation section for specifying the path.

Usage

Initiate the OCR process

let itemImage = document.getElementById('item-image');  // Get your item image

diabloOCR.recognize(itemImage).then(data => {
    console.log(data);  // JSON representation of the item
});

Language & Settings Configuration

The library supports easy configuration for different languages, tooltip sizes, and color blind settings. Refer to the config documentation section for a detailed guide.

Processing Strategy

Image Submission Image is passed to the library.
Post Processing Any required post-processing of the image is done to enhance readability and recognition.
Edge Detection Attempt to detect the edges of the tooltip to determine the boundary of relevant content.
Bullet Point Detection Detect bullet points on the image. This helps in identifying item properties and features.
Image Breakdown The image is broken down into smaller pieces when required. This strategy assists in dealing with complex affixes and ensuring accurate recognition.
Text Scanning Scan the text content of each image segment and any remaining pieces.
Language Determination Determine the language of the recognized text to ensure proper translation and representation.
Item Property Extraction After all previous steps, item properties are determined and extracted for further use.

Essential Dependencies

diablosnaps

tesseract.js

Game Data Packages Dependency

For accurate JSON representation, this project depends on packages from the repository diablosnaps. While these packages are essential for the OCR system, there are a few considerations to be made:

Maintenance Concerns: With the evolution of Diablo IV — encompassing new game features, seasons, and expansions — the data from diablosnaps must remain current. Yet, the sustained upkeep of this repository isn't guaranteed.

Forking & Alternatives: In the event that the primary repository becomes inactive or outdated, the community may opt to fork it or search for other alternatives. Such actions, however, might bring forth new challenges, such as maintaining the fork or smoothly integrating alternate data sources.

Risks: Depending on an external repository carries inherent risks. Alterations or discontinuations in the dependency could impact this OCR library's functionality.

Proposed Strategy & Outreach:

With release of new Diablo IV features, season mechanics, affix's, item names, item types... etc. Montiroing diablosnaps repository for signs of outdated data.

Test Coverage

Strategy:

Our approach to ensuring the accuracy and reliability of the Diablo Item OCR Library hinges on comprehensive test coverage. Here's a step-by-step breakdown of our test coverage strategy:

Executing tests: We are using jest for our unit test frame work, current test scripts are listed below.
- test:unit: this command will trigger the unit test suite

   yarn test:unit

test: this command will be an e2e test triggering all test suites currently only triggering test:unit

   yarn test

test:coverage: generates coverage will report to console and html reports into a coverage folder.

  yarn test:coverage

Fixture Data Collection:
- We maintain a curated set of fixture data, comprised of a series of known item images. These images are representative of different scenarios, languages, tooltip sizes, and color blind settings to ensure our library's broad applicability.
Automated Testing:
- For each image in the fixture data set, we have an associated known output (i.e., a predetermined JSON representation of the item).
- During testing, our test cases loop through each image in the fixture data, process it using the OCR library, and then generate an output.
Asserting Accuracy:
- The generated output is then compared or "asserted" against the known output. Any discrepancies are flagged, allowing us to pinpoint potential issues in the OCR recognition or processing.
Continuous Integration:
- To maintain the highest level of reliability, these tests are routinely run, especially when any changes or updates are made to the library. This ensures that any modifications do not introduce regressions or unexpected behaviors.

Contribution to Tests:

If you come across any in-game items that you believe would make valuable additions to our fixture data set, please consider contributing! It helps us to continually refine and enhance the accuracy of the library.

Contributing

Feel free to fork, modify, and send in pull requests. The main aim is to enhance the library's accuracy and support for various in-game items. Check the CONTRIBUTING.md file for detailed guidelines.

Support

For any issues or feature requests, open an issue on GitHub.

License

MIT License. See the LICENSE file for more details.

Thank you for choosing and supporting the Diablo Item OCR Library. Happy coding! 🎮🔥

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
src		src
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dprint.json		dprint.json
jest.config.ts		jest.config.ts
package.json		package.json
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diablo IV Item OCR Library - README.md

Introduction

Features

Requirements

1. System & Environment

2. Game Data Packages

Links

Installation & Setup

Usage

Processing Strategy

Essential Dependencies

Game Data Packages Dependency

Test Coverage

Strategy:

Contribution to Tests:

Contributing

Support

License

About

Releases

Packages

Contributors 3

Languages

License

wenqu/diablo4trading-ocr

Folders and files

Latest commit

History

Repository files navigation

Diablo IV Item OCR Library - README.md

Introduction

Features

Requirements

1. System & Environment

2. Game Data Packages

Links

Installation & Setup

Usage

Processing Strategy

Essential Dependencies

Game Data Packages Dependency

Test Coverage

Strategy:

Contribution to Tests:

Contributing

Support

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages