Skip to content

Latest commit

 

History

History
976 lines (794 loc) · 53.4 KB

GovoritelBG-ss_an-idea-for-a-Bulgarian-language-speech-synthesis-software-app-program_multiplatform+WebUI.md

File metadata and controls

976 lines (794 loc) · 53.4 KB

📝 -> 📢 🔉 👄 Original idea for an open-source CLI/GUI/Web speech-synthesis app for computer pronunciation of texts in the contemporary Bulgarian language

Оригинална идея за програма от типа гласов/речев(и) синтезатор, за компютърно произношение на текстове, написани на съвременния български език, с лиценз на свободен софтуер с отворен код (и с платена версия с малко добавки).

Идея за GovoritelBG-ss (c) 2018-2019 sahwar (github.com/sahwar/, twitter.com/ve4ernik, Иван И. Курдов) [email protected] / [email protected] и други.

It is still rough around the edges as an idea but the core app idea&data-structures are here in this document (while also being in continous revision, further tweaking/details-finetuning & development).

Tags/Keywords/Ключови думи: speech synthesis engine, Bulgarian-language speech synthesis engine and speech-synthesizer app, гласов синтезатор, речев синтезатор, преобразуване на писмен текст в компютърно произнесена (изговорена) реч, гласово възпроизвеждане на текст на български, "От текст към говор" (TTS), текст към реч чрез компютърна програма, computer text-to-speech synthesis, програма свободен софтуер с отворен код, посричково произнасяне, дифони, IPA, МФА, extIPA, X-SAMPA, Z-SAMPA, GitHub, espeak, MBROLA, original research, syllabification algorithms, contemporary Bulgarian syllable patterns, experimental application software, assistive software for blind and sight-impaired people, social-care software for disabled people; ... tts/TTS/T2S (Text-to-speech, speech synthesis), Speech Recognition, Voice Activity Detection, Natural Language Processing (NLP; not to be confused with neuro-linguistic programming (NLP)!), voice/speech assistant.

Third-party data to use in GovoritelBG-ss: Using the new Bulgarian-language syllabification (syllable-separation) algorithm (better than the old one from TeX BG): http://mirror.ctan.org/language/hyph-utf8/tex/generic/hyph-utf8/patterns/tex/hyph-bg.tex https://t.co/m4rPDuWQVD?amp=1 https://www.ctan.org/tex-archive/language/hyph-utf8 https://github.com/hyphenation/tex-hyphen (and also the old TeX algorithm for Bulgarian-language syllabification!) and syllable-patterns for the contemporary Bulgarian language taken from my General Linguistics class at university and generously supplied by my lecturer Assistant Boryan Yanev (pasted below for convenience), etc.

Theoretical foundations of this Bulgarian-language TTS (text-to-speech) engine/app:

Approximate letter-to-IPA-phoneme correspondence table for the contemporary Bulgarian language (for Standard [literary] Bulgarian) (the total number of phonemes is considered to be 45–48; add them to this file: https://github.com/sahwar/Bulogos/blob/master/diphones-combinations-bg.txt ):

Vowel phonemes (theoretically considered to be 8 in total — /i/, /u/, /o/, /ɤ/, /ɛ/, /ɔ/, /ɐ/, /a/):

(Letter—phoneme) и /i/, (/ɪ/) у /u /; /o/ е /ɛ/ ъ /ɤ̞/, /ɤ/; /ɐ/; ( (/ʌ/), (/ə/) ) о /ɔ/, /o/ а /a/, /ɐ/; /ɑ/, /ʌ/, /ɤ̞/; (/ʌ/), (/ə/)

ю /ju/ (at syllable-start), /ʲu/; /jo/, /ʲo/ я /ja/ (at syllable-start), /ʲa/; /jɐ/, /ʲɐ/; /jɤ̞/, /ʲɤ̞/

( Ѫ (голям юс, голяма носовка) /он/, /ън (Old Bulgarian); /ɤ/, /ɤ̞/ (as a now-obsolete letter found in pre-1945 Bulgarian orthography in Bulgarian writings) )

Consonant phonemes (WARNING: depending on stress position and position within a word, the letters represent the voiceless IPA phoneme of the voiced—voiceless pair, even though the orthographic letter is the same):

м /m/ н /n/ мь /mʲ/ нь /ɲ/ п /p/ б /b/ т /t/ д /d/ к /k/ г /ɡ/ пь /pʲ/ бь /bʲ/ ть /tʲ/ дь /dʲ/ кь /kʲ/ гь /ɡʲ/ ц /t͡s/ дз /d͡z/ ч /t͡ʃ/ дж /d͡ʒ/ ць /t͡sʲ/ ф /f/ в /v/ с /s/ з /z/ ш /ʃ/ ж /ʒ/ х /x/ фь /fʲ/ вь /vʲ/ сь /sʲ/ зь /zʲ/ р /r/ рь/rʲ/ й /j/ (semi-vowel, considered a 'consonant', e.g. C in CVC... syllable pattern/model notation) л, /l/, /ɫ/ ль /ʎ/

Example approximate IPA transcriptions of some Bulgarian words and multiword expressions (MWE, phrases):

жена̀ /ʒɛˈnɑ/ вода̀ /voˈdɑ/ ста̀вам /ˈstɑvʌm/ бра̀ва /ˈbrɑvʌ/ мъжа̀ /mɐˈʒɤ̞/ чета̀ /ʧɛˈtɤ̞/ зѐле /ˈzɛlɛ/ пѐсен /ˈpɛsɛn/ син /ˈsin/ ѝстина /ˈistinɐ/ ко̀тка /ˈkɔtkɐ/ боза̀ /boˈza/ сту̀д /ˈstut/ уста̀ /oˈsta/ ъ̀гъл /ˈɤ̞gɐl/ ка̀мък /ˈkamɐk/ ю̀жен /ˈjuʒɛn/ дю̀ля /dʲˈulʲa/ ютѝя /joˈtija/ кюфтѐ /kʲoftˈɛ/ господа̀рю /gospoˈdarjo/ я̀то /ˈjato/ бя̀л /ˈbʲal/ ярѐм /jɐrˈɛm/ ба̀мя /ˈbamʲɐ/ деня̀т /dɛˈnjɤ̞t/ вървя̀ /vɐrˈvjɤ̞/ (To add more later...)

0. App name

$ GovoritelBG-ss (SS = SpeechSynthesis)

GlashatayBG-ss

1. General idea

(c) 2018-2019 sahwar (github.com/sahwar/, twitter.com/ve4ernik, Ivan I. Kurdov) [email protected], et al.

(THE MOST IMPORTANT PART)

(to add from my rough research-paper .txt files from my other PCs & my related tweets from my Twitter account...)

For now, see the short summary from here before I upload the full explanation of the idea from my .txt files: https://twitter.com/search?l=&q=speech+synthesis%20from%3Ave4ernik&src=typd https://twitter.com/search?l=&q=%D1%81%D1%80%D0%B8%D1%87%D0%BA%D0%B8%20from%3Ave4ernik&src=typd https://twitter.com/search?l=&q=%D1%81%D1%80%D0%B8%D1%87%D0%BA%D0%B8%20from%3Ave4ernik&src=typd

I've now added a more thorough explanation below (This IS NOT the same as in my .txt files, that is pasted in the other Roman-numeral sections after I.!!!).

I. 7 core speech-synthesis subsystems/'engines' included with govoritelbg-ss:

1) An engine which parses input text, applies a syllable-separation algorithm (several flavors to select from via --some-argument/flag-here option, one of them is this great syllable-seperation method ), then matches the separated syllables to a JSON-file linking to human-speaker-pronounced speech-segment-recordings of the most-common syllables of the contemporary Bulgarian language, then concatenates (merges) all the text's syllables' native-human-speaker-pronounced syllable-recordings pre-supplied-by-the-app audio-files via ffmpeg/libav and generates the output file with the pronunciation which is then played to speakers (DEFAULT engine!).

NOTE0: To support multiple languages & speaker-voice datasets, there will be more --arguments to select language and speaker-voice dataset before generating&playing the output audio-file, but the Bulgarian language is the default one to be used for the app. Also note that there will be several configuration files (JSON/.config/.conf/.cfg/YAML/custom-txt-based formats) like those of the NVDA screen-reader app bundled with the app, and with a custom/user copy of those for the user to fine-tune the app before running it if the user doesn't like the default settings&configs of the app.

2) Same as 1 above but instead of syllable concatenation, we use an MBROLA-based diphones human-speaker-voice audio-recordings files being concatenated (merged).

3) Same as method 2 above but with syllable concatenation (&whitespace/pauses-between-words/strings) applied before the concatenation of MBROLA(-like) diphones.

4) Same as method 1, but we use speaker-voice dataset(s) which are human-speech recordings of the IPA phonemes (X-SAMPA/Z-SAMPA/IPA/extIPA list of the phonemes, in a JSON file(s) listing all these phonemes and their recording audio-files), with the speaker-voice datasets being at least 3-5 supplied to choose from (a few others will be in the commercial/paid version of the app).

5) Some computer-generated worse-sounding speech-synthesis which is programmatic-only (WITHOUT using better-sounding concatenation of pre-recorded human-voice datasets of syllables/phonemes/whole words/etc.). See for ideas: the Toshko2 Bulgarian-speech-synthesis app.

6) A mixture of the above methods with the additional power of sorta-even-better-sounding generated results via the methods outlined in NOTE1 below.

7) (some other speech-synthesis engine/subsystem to be freely supplied/plugged-in/integrated into the ap as supplied by the end-user).

NOTE1: There will also be a --argument to use several dictionary datasets of pre-recorded human-speaker-voice-pronounced speech audio-file recordings of whole-words & whole-phrases for even better-sounding output audio-file generated by this app (a user can also change (replace) those files OR supply a new, user-generated (manually recorded&edited) dictionary of such speaker-voice datasets.

NOTE2: One of the config files will have NVDA-like JSON-based instructions on how to pronounce punctuation and basic post-generation speech-filters for e.g. speech-quality shifting for emphasis, questions, exclamations, quotations within the text, Markov-chain audio-filters, Fast Fourier Transform audio-filters, pitch, sound volume, sound tone, pronunciation speed (and related Audacity-like simple plug-able filters), etc.).

NOTE3: The end-user may produce his/her own language&specific human-speaker-voice audio-recodings speech-segments (&hundreds-of-files-or-more dictionary of whole-word&whole-phrase audio-file recordings) to use with the app via another --argument if they don't like the default&supplied human-speaker(s) dataset audio-files used in the concatenation for the generation of the final output speech-synthesis audio-file.

NOTE4: There will also be another --argument which enables the end-user to use several further JSON-config files and thus apply additional fine-tuning language-specific & source-text-file-specific audio-filters (and whole-word & whole-phrase human-recordings) based on regex matches and matches of language/dialect-specific specific substring/string language segments.

PROGRAMMING LANGUAGE(S) FOR WRITING THE APP: Rust, C/C++, Python3, Lua, Go, bash shell-scripting (for ffmpeg/libav use?), Perl+PCRE, Vala, FreePascal/Delphi/LazaursIDE, etc., even PHP is sorta OK-ish...

INPUT FILES: .txt and other text formats (maybe use pandoc and HTML5-parser plug-in for cleanup-to-plaintext, or maybe also have an option to change pronunciations based on those substring or string tags...).

OUTPUT FILES: .wav, .mp3, .ogg, .m4a, .flac

TO CONSIDER: All pre-recorded human-voice-speaker dataset recordings files SHOULD be in the same audio fileformat, with the SAME basic audio-file characteristics (e.g. 44kHZ, 20Hz-20kHz, 8-bit/16-bit/32-bit floating, lossless or lossy filetype/recording, with those of similar type being of approximately the same short audio-duration/length). All audio-files of each dataset should be those of the voice/speech of a single human (with basic bio-sex&bio-age&approx. geographical-dialect characteristics information) for optimal results!!!

SUMMARY: The main original idea is the concatenation (via open-source ffmpeg/libav) of bundled (supplied with the app) pre-recorded audio-files containing recordings of real speech segments of human-speaker(s)' real voice (in a normal/'flat' tone, maybe also add --argument for a sub-speaker-voice dataset of a different speech quality), with (optional/mandatory) syllable-segmentation-algorithm(s) parsing done beforehand, and with optional post-generating speech audio-filters (e.g. language-specific orthography-to-orthoepy differences, etc.) applied & outputted to a second generated resulting audio-file.

Characters-to-sounds per-language correspondence rulesets, NVDA/SpeechLab/etc.-like configs/settings, voice-qualities adjustable outputting audio settings, etc.

Syllables (syllable patterns (CVV, VCV, etc. (Bulgarian syllable patterns taken from my General Linguistics class at university and generously supplied by my lecturer Assistant Boryan Yanev) ) with all combinations of all corresponding language-specific phonemes - forming the syllable), diphones (~2 phones flowing together in connected speech), and (ext)IPA/X-SAMPA/Z-SAMPA phomenic/phonological phonemes (approximatly matched to lang-specific charset subset of Unicode UTF-8); plus whole-word and whole-phrase human pronunciation audio-files, and human pronunciation audio-files for punctuation characters (and their prosodic audio-filters effects?) and maybe the whole Unicode character set(?)...

AIMS/GOALS OF THIS APP: This is all in order to produce SOMEWHAT better-sounding speech-synthesis results than those of entirely programmatically-generated speech-synthesis engines which sound too robotic. The whole app $ govoritelbg-ss is about SIMPLE, QUICK & DIRTY speech synthesis of texts (mostly in the contemporary Bulgarian language with with ~45 phonemes in its orthoepy standardized contemporary variant which is parallel to the standard contemporary (semi-)formal written Bulgarian).

Making a speech-synthesis engine that emulates the whole human tract/speech organs is too cumbersome and complex, while speech-synthesis engines which are entirely programmatic-only sound TOO robotic, while we are aiming at a more natural-sounding result which is UNDERSTANDABLE to native-language human listeners (despite still being sorta/somewhat robotic/strung-together-sounds-sounding).

Other existing speech-synthesis engines (including for contemporary Bulgarian speech) use sorta similar techniques and algorithms, BUT NOT exactly as this one & are somewhat limited for our use (CLI&GUI-for-Linux-Windows-macOS & Web & Android) and ARE NOT open-source, that's why we're making this one!!! GovoritelBG-ss may find use by Google for its Google Translate website (for the Bulgarian speech-synthesis core(s)/feature).

WE ARE LOOKING FOR PROGRAMMERS TO CODE THIS APP with collaboration with the author of the idea for this speech-synthesis app!! Please apply now to this potentially very-handy&useful open-source software project!

II. (to add by pasting here from my twitter.com/ve4ernik tweets about this project...)

III. (to add by pasting here from my numerous original-research & general-ideas .txt files about this project...)

IV. (...)

V. (...)

...

2. Features

  • CLI version: supply a text input file, add arguments/flags, output the result to an .mp3/.ogg/.flac/.wav file, play the generated audio-file result; built-in manual: man govoritelbg-ss, govoritelbg-ss --help (in English AND Bulgarian), govoritelbg-ss --version (-v) (version information), govoritelbg-ss --update (-U) (check for newer version and update the app if one is available), etc. commandline arguments and config (sub)options.

  • GUI version: same as CLI but with GUI controls

  • Web-version: similar to the GUI version but simplified

  • Android version: similar to the desktop GUI version but also adding these additional features: (1) take photo (or supply image from phone memory or URL address) and process with the open-source tesseract-OCR engine, then play the OCR'd result as BG speech-synthesized output audio-file; (2) visual&audio-recorded (language-specific) warning for when the Android device's battery level is running low; (3) option to import your own speaker-pronunciation dataset files for use with the app;

3. Target platforms

  • GNU/Linux (CLI&GUI), Windows (CLI&GUI), macOS (CLI&GUI), HTML5 Web-interface (web-browsers, cross-platform), Android (GUI)

4. Target end-users

  • Blind or sight-impaired Bulgarians, the general public, tests for writers, editors, proofreaders/spellcheckers/copywriters, and reseachers; also as a toy for demonstrating speech-synthesis to children and the general public.

5. License

  • open-source: BSD-3 (modified) or MIT, commercial for a few bucks = additional speaker pronunciation packs (aside from the free&open-source bundled with the app)

6. Similar software (for inspiration / ideas / alternatives / collaboration)

It is compatible with standard text-to-speech interfaces on these platforms: SAPI5 on Windows, Speech Dispatcher on GNU/Linux and Android's text-to-speech APIs. It can also be used by the NVDA screen reader directly (the driver is provided by RHVoice itself).

Валидният набор от фонеми, които са съобразени с особеностите на българския
език е следния:
&*0
. ? , - : И Е Е/ А А/ Ъ Ъ/ О О/ У У/ Р Р/ Л Л/ Л//
М Н Й Ь Ю Я С Ш Ф Х З Ж В Ч Ц Щ Б Д Г П Т К
&*1

TTS: MaryTTS, eSpeak, gESpeaker, FestivalTTS, Merlin, Mimic, BalabolkaTTS STT (speech-to-text): CMU Sphinx, Julius, HTK, Kaldi SpeechLab2 (paid but cheap, for Google Android - available at the Google Play web-store) = SpeechLab1 - freeware, freely given to blind people, see:

https://www.slant.co/topics/9395/~open-source-speech-to-text-stt-engines

6.1. Post-2020 developments in Bulgarian-language speech-synthesis software

  1. Google Translate introduced initial support for basic Bulgarian-language speech-synthesis BUT, sadly, they probably didn't use any of the R&D by me ( github.com/sahwar/Bulogos/GovoritelBG-ss... or Toshko2 or gespeaker/MBROLA-Bulg-BG, etc. ... Here's a demo(nstration): https://translate.google.com/?sl=bg&tl=en&text=%D0%A2%D0%BE%D0%B2%D0%B0%20%D0%B5%20%D1%82%D0%B5%D1%81%D1%82%20%D0%B7%D0%B0%20%D0%B8%D0%B7%D0%B3%D0%BE%D0%B2%D0%B0%D1%80%D1%8F%D0%BD%D0%B5%20%D0%BD%D0%B0%20%D0%BD%D0%B0%D0%BF%D0%B8%D1%81%D0%B0%D0%BD%D0%B8%D1%8F%20%D1%82%D0%B5%D0%BA%D1%81%D1%82%20%D1%87%D1%80%D0%B5%D0%B7%20%D0%BD%D0%BE%D0%B2%D0%B8%D1%8F%20%D0%B1%D1%8A%D0%BB%D0%B3%D0%B0%D1%80%D0%BE%D0%B5%D0%B7%D0%B8%D1%87%D0%B5%D0%BD%20%D0%B5%D0%B7%D0%B8%D0%BA%D0%BE%D0%B2%20%D0%BF%D0%B0%D0%BA%D0%B5%D1%82%20%D0%B7%D0%B0%20%D0%BC%D0%B0%D1%88%D0%B8%D0%BD%D0%BD%D0%BE-%D0%BA%D0%BE%D0%BC%D0%BF%D1%8E%D1%82%D1%8A%D1%80%D0%BD%D0%BE-%D1%81%D0%BE%D1%84%D1%82%D1%83%D0%B5%D1%80%D0%BD%D0%BE%20%D1%81%D0%B8%D0%BD%D1%82%D0%B5%D0%B7%D0%B8%D1%80%D0%B0%D0%BD%D0%B5%20%D0%BD%D0%B0%20%D0%B3%D0%BB%D0%B0%D1%81%D0%BE%D0%B2%D0%B0%20%D1%80%D0%B5%D1%87%20%D0%BD%D0%B0%20%D0%B1%D1%8A%D0%BB%D0%B3%D0%B0%D1%80%D1%81%D0%BA%D0%B8%20%D0%B5%D0%B7%D0%B8%D0%BA%20%D1%87%D1%80%D0%B5%D0%B7%20%D0%93%D1%83%D0%B3%D1%8A%D0%BB%20%D0%9F%D1%80%D0%B5%D0%B2%D0%BE%D0%B4%D0%B0%D1%87%20(Google%20Translate%20-%20http%3A%2F%2Ftranslate.google.com%20)!%0A%D0%97%D0%B0%20%D1%81%D1%8A%D0%B6%D0%B0%D0%BB%D0%B5%D0%BD%D0%B8%D0%B5%2C%20%D0%BD%D0%B5%20%D0%B8%D0%B7%D0%BF%D0%BE%D0%BB%D0%B7%D0%B2%D0%B0%20%D0%B8%D0%B4%D0%B5%D0%B8%20%D0%B8%20%D0%BF%D1%80%D0%B8%D0%BD%D0%BE%D1%81%D0%B8%20%D0%BE%D1%82%20github.com%2Fsahwar%2FBulogos%2FGovoritelBG-ss.md%20%D0%B8%D0%BB%D0%B8%20%D0%BE%D1%82%20Toshko2%20BG%20TTS%20%D0%B8%D0%BB%D0%B8%20%D0%BE%D1%82%20azbuki.bg%20%D0%B8%D0%BB%D0%B8%20ibl.bas.bg%20%D0%B8%D0%BB%D0%B8%20%D0%BE%D1%82%20NVDA%20%D0%B8%D0%BB%D0%B8%20%D0%BE%D1%82%20gespeaker%2FMBROLA-Bulg-BG%20%D0%B8%D0%BB%D0%B8%20%D0%BE%D1%82%20bacl.bg%2FSpeechLab2%20(Google%20Android)%20%D0%B8%D0%BB%D0%B8%20%D0%BE%D1%82%20Balabolka%20%D0%B8%D0%BB%D0%B8%20%D0%BE%D1%82%20Orca%20screen%20reader%20(Linux)%20%D0%B8%D0%BB%D0%B8%20%D0%BE%D1%82%20FestivalTTS%20(Linux)...%20%D0%9D%D0%B0%D1%82%D0%B8%D1%81%D0%BD%D0%B5%D1%82%D0%B5%20%D0%B1%D1%83%D1%82%D0%BE%D0%BD%D0%B0%20%D1%81%20%D0%B2%D0%B8%D1%81%D0%BE%D0%BA%D0%BE%D0%B3%D0%BE%D0%B2%D0%BE%D1%80%D0%B8%D1%82%D0%B5%D0%BB%D1%8F%20%D0%BE%D1%82%D0%B4%D0%BE%D0%BB%D1%83%2C%20%D0%B7%D0%B0%20%D0%B4%D0%B0%20%D1%87%D1%83%D0%B5%D1%82%D0%B5%20%D1%82%D0%BE%D0%B7%D0%B8%20%D0%B1%D1%8A%D0%BB%D0%B3%D0%B0%D1%80%D0%BE%D0%B5%D0%B7%D0%B8%D1%87%D0%B5%D0%BD%20%D1%80%D0%B5%D1%87%D0%B5%D0%B2%D0%B8%20%D1%81%D0%B8%D0%BD%D1%82%D0%B5%D0%B7%20%D0%BE%D1%82%20%D0%93%D1%83%D0%B3%D1%8A%D0%BB%20%D0%9F%D1%80%D0%B5%D0%B2%D0%BE%D0%B4%D0%B0%D1%87.&op=translate
  2. I chatted with Aleksandar Kamburov ( http://kamburov.net - author of the famous app AEDict.exe - AEnglishDictionary, similar to gbgdict from http://bgoffice.sourceforge.net ) and he shared with me that a new and very promising project by a talented young Bulgarian programmer was making headlines in the field of Bulgarian-language speech synthesis:
https://github.com/AzBuki-ML/
АзБуки.ML
AzBuki.ML е иновативна machine learning платформа - съвкупност от компютърни програми за обработка на българския език и слово.
 Bulgaria
 https://azbuki-ml.com
public-data
Public
Custom-built Bulgarian language data sets, used by АзБуки.ML for sentiment analysis, text classification, summarisation and generation. Open-source & free to use in any ML project.
Custom-built Bulgarian language data sets, used by АзБуки.ML for sentiment analysis, text classification, summarisation and generation. Open-source & free to use in any ML project.

azbuki-ml.com/
Topics
data-science machine-learning bulgarian data-set
https://github.com/AzBuki-ML/public-data

https://www.bing.com/search?q=bulgarian+speech+synthesis+google+play&qs=n&form=QBRE&sp=-1&pq=bulgarian+speech+synthe+google+play&sc=0-35&sk=&cvid=34F5AA3372BD4894A6CA863E1E1A193E
https://play.google.com/store/apps/details?id=com.skycode.tts&hl=en_US&gl=US
https://www.appbrain.com/app/bulgarian-text-to-speech-tts/com.skycode.tts

atodorov.org
  • вижте уебсайта на българина, написал статията „Как спасих правилния графичен дизайн на знака за валутата евро“ - май там имаше статия за иновативен речеви синтез на български език чрез роботски модел на човешкия речеви апарат (т.е. роботски модел на главата и устата (устния тракт и трахеята и т.н.) плюс опит за софтуерно моделиране на същото...
  1. Mozilla Common Voice & tatoeba.org & librivox.org & archive.org & storytel.bg - тонове записани с български гласове е-книги и отделни изречени...

7. (G)UI ideas from Librera Reader (for Google Android)

Some ideas borrowed from:...

TTS Engine: 
Language+Voice+male/female[+age+dialect/accent/geographic-location+datetime-stamp-year] pack:
Speed:
Pitch:
Volume:
Expressive breaks: ?? ms (e.g. 350) {and/or when you supply a .txt file, you can place SPECIAL IPA&orthography-altering-the-pronunciation HTML/XML/SGML/JSON/HAML/CommonMark tags + |like|this|+more| special regex/regexp that supply the wordform paradigm (словообразувателната парадигма + неотговарящи на нея различаващи се отделни словоформи, примерно речетатив като „пека /пекЪ/ -> пеЧен, изпечени, запичам, изпичам“ и т.н.!) ~added 01-May-2020 by https://github.com/sahwar/Bulogos }
Reading ends in: xxx hh yyy min zzz sec
[?] Resume from last punctuation mark: '[]()...'
Read punctuation in full: '[]()...'
Read unknown characters as their Unicode UTF-8 hexadecimal point number+title: yes/no (language for pronunciation = ?)
Break at page turn / break after number of lines: ...
Pause TTS-reading on incoming calls, messages, SMSs, etc.
Pause TTS-reading on incoming new Android notifications (DANGEROUS!): ...
Enable remote bookmarks (Start/Stop button): ...
Record TTS from whole file: Start / Pause / Cancel
{ + exact-frequency-or-frequency-span-band speech-synthesizer sound-generator app! Frequencies below 7.9Hz(Earth's natural Schummann resonance)-20 Hz are HARMFUL to humans and above 20kHz could be deadly to humans; these are used by psychotronics weapons/devices like HAARP, GWEN towers, number(s) stations, military/police LRAD & V-MADs (there are YouTube videos about those actually existing!), 5G towers, possibly by Elon Musk's Starlink satellites (unless those satellites are just for beaming cheap/free Internet to remote areas, and for military/GoogleEarth/GoogleMaps/openstreetmap.org geo-spying/surveillance/observation/&MIA-rescuing-retrieval-missions, and for laser-cutting&destroying/crumbling/deflecting big asteroids which pose a threat to planet Earth (also by using atomic bombs & hydrogen bombs (even of newer designs!)! HOWEVER, if these satellites are used to do electronic electromagnetic harassment, heatwave-torturing, mind-control remote brain-scanning/earworm-putting-audio/video-playback-for-MKULTRA/Monarch-mind-control consciousness/brain/neural-system-harming or for laser-incineration of people or for destroying objects on Earth via lasers or bombs, then ve4ernik/sahwar does NOT support said Starlink satellites...}

(Other app names considered:

IzrechitelBG-ss, GlasitelkoBG, GlasnostBG-ss, GlasoveBG-ss, GlasilkaBG-ss, HortuvayBG-ss, QSDIzrechilkaBG_SpeechSynth (QSD='quick, simple, and dirty), IzrechilkaBG, ProiznositelBGSpeechSynth, Govoritel(che)BGSpeechSynth, IzgovoritelBG, HortuvayNaBG_SpeechSynth, fossBGSpeechSynth, IzgovoriNaBG_SpechSynth, IzrechilkoBG, KazhiBG, ProiznositelBG_SpeechSynth, BGSpeechSynth, BulgSynth, etc.)


https://www.google.com/inputtools/help/languages.html - Google Translate IME Input Tools + handwriting-recognition sketchpad-panel
https://www.google.com/inputtools/try/
https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/shape/sc_digits.html ( http://archive.is/O8kL0 )
http://shapecatcher.com/index.html ( http://archive.is/Ndf1l )

https://translate.google.com/intl/en/about/languages/
https://support.google.com/translate/#googtrans/en/bg&topic=7011755
https://translate.google.com/intl/en/about/
https://translate.google.com/intl/en/about/forbusiness/
https://cloud.google.com/translate/
https://translate.google.com/?hl=bg&tab=TT&authuser=0
'Dattebayo!' (Naruto Shippuuden catchprase often used by Naruto; it was translated as '... Believe it!' in the official Naruto English-dub for English-language TV; first released as https://horriblesubs.info/ ( https://horriblesubs.info/images/b/covid19.png ) & as http://dattebayo.com/ - it later emerged as official English-subtitles via paid web-streaming platform http://crunchyroll.com ): https://translate.google.com/?hl=bg&tab=TT&authuser=0#view=home&op=translate&sl=bg&tl=en&text=%D0%9C%D1%80%D0%B0%D0%B7%D1%8F%20%D1%82%D0%BE%D0%B2%D0%B0%20%D1%88%D0%B8%D0%B1%D0%B0%D0%BD%D0%BE%20%D0%B3%D0%BE%D0%B2%D0%B5%D0%B4%D0%BE!%20%D0%94%D0%B5%D0%B9%D0%B1%D0%B0%20%D0%B8%20%D0%B6%D0%B8%D0%B2%D0%BE%D1%82%D0%B0%2C%20%D0%B4%D0%B5%D0%B9%D0%B1%D0%B0!


Download & use Google Translate

You can translate text, handwriting, photos, and speech in over 100 languages with the Google Translate app. You can also use Translate on the web.
Computer AndroidiPhone & iPad

To translate text, speech, and websites in more than 100 languages, go to Google Translate page.

Предоставено от Google ПреводачПреводач

http://bezmonitor.com/ - <[email protected]> (Viktor) (useful information for blind Bulgarians working with computers - полезна информация за слепи българи, работещи с компютри: http://bezmonitor.com/news.htm , http://bezmonitor.com/other.htm , http://bezmonitor.com/speechlab.htm)
Вижте още: NVDA, Festival TTS, CMU Sphinx TTS, gespeaker/espeak/espeak-ng+MBROLA, FreeTTS, Balabolka TTS, JAWS screen reader (екранен четец), Toshko2 TTS <[email protected]>, Google Translate speech-synthesis, )
* https://github.com/zzw922cn/awesome-speech-recognition-speech-synthesis-papers
* https://github.com/topic/tts

https://gist.github.com/erogol/97516ad65b44dbddb8cd694953187c5b
Hands-on example for TTS https://github.com/mozilla/TTS & https://github.com/common-voice/common-voice (TTS datasets of human-voices recordings...) & http://tatoeba.org & http://dumps.wikimedia.org & http://dicts.info & http://dict.cc & http://rechnik.info + http://bgjargon.com & http://urbandictionary.org & more... + "БЪРБОРИНО" + gespeaker+MBROLA-for-Bulgarian + toshko2 TTS app + CMU Sphinx/Festival ; + notepad++ speak/TTS+DSpellchecker plugin; + google chrome: bgspeech2

This notebook trains Tacotron model on LJSpeech dataset.
https://gist.github.com/sahwar/8eae919103915aed718d03af75b0d1e7

https://archive.is/62c.hit.bg
https://yovko.net/initsiativa-shlokavitsa/

http://archive.is/BaQTS#selection-787.28-799.60
https://archive.is/o/BaQTS/zver.fsa-bg.org/pipermail/dict/2011-March/005151.html
---
регистрирах следния пощенски списък:
[email protected]
Можете да се запишете като изпратите писмо на:
[email protected] със subject ‘subscribe’
или от страницата на списъка:
http://www.freelists.org/list/espeak-bg
Всеки на когото му допада този проект и иска да се ангажира в развитието му е добре дошъл.
Списъка има и публичен архив чрез който всеки може да следи как се движат нещата.
Новите постове може да се следят и чрез RSS.

https://archive.is/o/BaQTS/fsa-bg.org/project/gtp/wiki/ToDo
http://archive.is/BaQTS
http://www.gatchev.info/blog/?p=1163
http://gatchev.info/articles/articles_bg.html

http://www.gatchev.info/blog/
http://www.gatchev.info/blog/?p=2302
http://www.gatchev.info/blog/?page_id=409
http://twenkid.com - toshko2









За TTS не съм сигурен, че има хляб вече.

Tosh (Арнаудов) от Пловдив правеше това:
https://github.com/Twenkid/Toshko_2
 
Има и един младеж, Радостин Чолаков, който е много активен по медиите:
https://azbuki-ml.com/speech
etc.

Mozilla Voice & http://tatoeba.org
https://github.com/common-voice/common-voice

http://librivox.org
http://bezmonitor.com
https://github.com/common-voice/common-voice
What is Common Voice project?
http://azbuki-ml.com (!!!) - пробвайте демото
http://www.numenta.org


Common Voice is Mozilla's initiative to help teach machines how real people speak.
Voice is natural, voice is human. That’s why we’re excited about creating usable voice technology for our machines. But to create voice systems, developers need an extremely large amount of voice data.
Most of the data used by large companies isn’t available to the majority of people. We think that stifles innovation. So we’ve launched Common Voice, a project to help make voice recognition open and accessible to everyone.
Now you can donate your voice to help us build an open-source voice database that anyone can use to make innovative apps for devices and the web. Read a sentence to help machines learn how real people speak. Check the work of other contributors to improve the quality. It’s that simple!


You can find more details on the official website https://voice.mozilla.org/

+ the CommonVoice app:
https://f-droid.org/en/packages/org.commonvoice.saverio/

9. Spellchecker software for the Bulgarian language

  • DSpellchecker plugin for Notepad++ (may also be adapted for use with NeoVim, Spacemacs, jEdit, notepad2 (notepa2-mod fork, not the original), notepad3, notepad2e, GitHub Atom text-editor , etc.
  • those F(L)OSS spellcheckers used in Mozilla Firefox, LibreOffice, Google Android OS, Google Chrome, etc: aspell, espell, hunspell (run via the Enchant spellchecking wrapper library!) - see: http://bgoffice.sourceforge.net
  • http://ibl.bas.bg - WinEst, MacEst - by BAS' IBL + ibl's freely-downloadable contemporary-Bulgarian-language dictionary datasets + OCR'd .pdf files of http://dv.government.bg and http://lex.bg .html/.pdf legal documents, etc. + webscraped versions of http://pcmania.bg , http://weblog.gatchev.info , & http://actualno.com & http://dnes.bg & paid-subscription libre contemporary Bulgarian-language citizen-journalism/gonze-journalism/underground-anonymous-activists' journalism websites...
  • idispellchecker + dictionary (you can use google.com to find older freeware versions of that software; newer versions are commercial paid but a free trial/demo version can also be downloaded) --- this is currently the best spellchecking app for Bulgarian, when used in combination with ОПРБЕ2012.pdf, ОПРБЕ2002.pdf, and http://kaksepishe.com & some further additions&corrections from (the yet-unfinished as of January 2022) Bulogos' BLSG style-guide and http://dictcorrect.com (but sahwar doesn't always agree with dictcorrect.com !) & with online Internet dictionary databases (http://onelook.com , http://bg.wiktionary.org , http://dicts.info/uddl/ , http://slovored.com , http://rechnik.info , http://bgjargon.com , etc.)
  • some other commercial Bulgarian-language spellchecking apps, etc. ...
  • etc.