Skip to content
This repository has been archived by the owner on Jul 13, 2023. It is now read-only.
/ capacitor-ocr Public archive
forked from bendyworks/cap-ml

Text Detection Capacitor Plugin

License

Notifications You must be signed in to change notification settings

dtarnawsky/capacitor-ocr

 
 

Repository files navigation


Image To Text

@capacitor-community/image-to-text

Capacitor plugin for image to text processing using Apple's Vision Framework for iOS and MLKit's Vision Framework for Android..

Credits

This project was forked from the Cap ML plugin written by Vennela Kodali. It was refactored and converted to Capacitor 4.

Installation

npm install @capacitor-community/image-to-text

Usage

There is one method detectText that takes a filename of an image and will return the text associated with it.

Add the following to your application:

import { Ocr, TextDetections } from 'capacitor-ocr';

...

const data: TextDetections = await Ocr.detectText({ filename: '[get-filename-of-image-jpg]' });
for (let detection of data.textDetections) {
    console.log(detection.text);
}

The above code will convert the image file and console.log the text found in it.

Example with Camera

You can use the @capacitor/camera plugin to take a photo and convert it to text:

import { Camera, CameraResultType, CameraSource } from '@capacitor/camera';
import { Ocr, TextDetections } from 'capacitor-ocr';

...

const photo = await Camera.getPhoto({
   quality: 90,
   allowEditing: true,
   resultType: CameraResultType.Uri,
   source: CameraSource.Camera        
});

const data: TextDetections = await Ocr.detectText({ filename: photo.path });

for (let detection of data.textDetections) {
    console.log(detection.text);
}

A full sample application can be found here.

iOS Usage

No additional setup is required to use this plugin in a iOS Capacitor project.

Android Usage

Your project must include a google-services.json file stored in the Android project folder (usually android/app).

Create Firebase Project and App

  • Sign in to console.firebase.google.com
  • Click on Add Project and follow through the steps.
  • Click the Android icon to create an android app.
  • Enter the Package Name which must match your apps package name (You can find it in android/app/AndroidManifest.xml).
  • Click Register App
  • Download google-services.json and save into your project's android/app directory.

Add Firebase SDK

The sample project has this in place in its build.gradle (see here as a reference).

Note: Most starter Capacitor projects are preconfigured to load google-services.json.

API Reference

detectText(...)

detectText(options: DetectTextOptions) => Promise<TextDetections>

Detect text in an image

Param Type Description
options DetectTextOptions Options for text detection

Returns: Promise<TextDetections>


Interfaces

TextDetections

Prop Type
textDetections TextDetection[]

TextDetection

Prop Type
bottomLeft [number, number]
bottomRight [number, number]
topLeft [number, number]
topRight [number, number]
text string

DetectTextOptions

Prop Type
filename string
orientation ImageOrientation

Enums

ImageOrientation

Members Value
Up "UP"
Down "DOWN"
Left "LEFT"
Right "RIGHT"

Compatibility

Images are expected to be in portrait mode only, i.e. with text facing up. It will try to process even otherwise, but note that it might result in gibberish.

iOS and Android are supported. Web is not.

Feature ios android
ML Framework CoreML Vision Firebase MLKit
Text Detection with Still Images Yes Yes
Detects lines of text Yes Yes
Bounding Coordinates for Text Yes Yes
Image Orientation Yes (Up, Left, Right, Down) Yes (Up, Left, Right, Down)
Skewed Text Yes Unreliable
Rotated Text (<~ 45deg) Yes Yes (but with noise)
On-Device Yes Yes
SDK/ios Version ios 13.0 or newer Targets API level >= 16
Uses Gradle >= 4.1
com.android.tools.build:gradle >= v3.2.1
compileSdkVersion >= 28

License

Hippocratic License Version 2.0.

For more information, refer to LICENSE file

About

Text Detection Capacitor Plugin

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Swift 41.9%
  • Kotlin 19.6%
  • Java 14.0%
  • TypeScript 8.1%
  • Ruby 7.5%
  • Objective-C 5.2%
  • JavaScript 3.7%