GitHub - expertspec/profanity-predictor

Solution profanity-predictor is designed for the task of real-time profanity prediction based on the multimodal (audio and textual channels of the speech) analysis.

Description

The proposed pipeline allows for working with a sound stream in a standby fashion. It transforms the signal to MFCC to deal with the audio channel's information and process ASR to extend the set of features with previous word labels. The prediction model is the LSTM with attention layers.

Installation

Clone repository:

git clone https://github.com/expertspec/profanity-predictor.git

Install all dependencies from requirements.txt file:

pip install -r requirements.txt

How to Use

/profanity-predictor
    ├── assets  # Images for readme
    ├── data
    │   ├── banned_words.txt
    │   └── test_records
    ├───src         # Executive files
    │   ├───features        # Scripts for features extraction
    │   │   └───tools
    │   ├───models          # Models's architecture and tools for usage
    │   └───preprocessing   # Scripts for dataset preporation
    └───weights     # Folder for model's weights

It is possible to download test records for quick start.

Default weights for prediction model can be download here

Run inference for prediction on the samples from test records

$  python3 data_inference.py ./data/test_records --device cpu

It is also possible to specify arguments "--path_to_banned_words" and "--weights"

Run inference for working with speech stream

$  python3 stream_inference.py

Dataset

The dataset is available here

Article

Multimodal prediction of profanity based on speech analysis

Backlog

[x] Initial inference for test data
[x] Real-time implementation
[ ] Examples
[ ] Tests

Supported by

Funding research project No. 622279 "Development of a service for assessing the validity of expert opinion based on dynamic intelligent analysis of video content".

Citation

@software{expertspec,
    title = {profanity-predictor},
    author = {Smirnov, Ivan},
    year = {2023},
    url = {https://github.com/expertspec/profanity-predictor},
    version = {0.0.1}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Description

Installation

How to Use

Dataset

Article

Backlog

Supported by

Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
assets		assets
data		data
src		src
weights		weights
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.rst		README.rst
data_inference.py		data_inference.py
requirements.txt		requirements.txt
stream_inference.py		stream_inference.py

License

expertspec/profanity-predictor

Folders and files

Latest commit

History

Repository files navigation

Description

Installation

How to Use

Dataset

Article

Backlog

Supported by

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages