Hebrew Grapheme-to-Phoneme Conversion
for Real-Time
Text-to-Speech
Real-time text-to-speech (TTS) for Modern Hebrew is challenging due to its complex writing system and underspecified phonetic features such as stress.
To address this, we present Phonikud, a lightweight, open-source grapheme-to-phoneme system that produces fully-specified IPA transcriptions with minimal latency.
We also introduce ILSpeech, a new Hebrew speech dataset with expert IPA annotations, designed for both benchmarking and training.
Our results demonstrate that Phonikud improves phoneme prediction accuracy and enables fast, effective Hebrew TTS models.
Works with real-time TTS like Piper using standard IPA phonemes.
Runs locally - can be used on Raspberry Pi, Home Assistant, or other edge devices for private, efficient operation.
Fine-tunes the TTS models with as little as 2 hours of data by leveraging models from other languages.
Handles Hebrew stress and vocal shva missed by existing methods.
Can be used on screen readers with low delay, even offline or remote.
Published a studio-quality Hebrew speech dataset with ~2 hours of audio and hand-annotated IPA phonemes.
Released Phonikud weights, Hebrew TTS models, and full training code.
You can edit phonemes directly or let the G2P system handle it automatically, giving control over stress and vowel sounds.
See how Phonikud transforms Hebrew text through each stage.
Comparative evaluation of Phonikud against existing Hebrew TTS approaches
Text Sample |
ElevenLabs
Eleven v3
|
Google
Gemini v2.5
|
RoboShaul
1st place
|
Phonikud (Ours)
Ours v1 (alpha)
|
---|---|---|---|---|
ืืื ืฆืคื ืืกึซืจื ืืจืื ืืื ืฉืฆึซืคื ืืึซืื ๐ธ | ||||
ืืื ืจืฆื ืืช ืื ืื ืืื ืืื ืจึซืฆื ืืืจ ืืืงืึซืืื ืืืชื ๐โโ๏ธ | ||||
ืืื ืชืจื ืืืืื ืืฉ ืืืจึซืงืก ืขื ืชึซืจื ๐ฅฌ |
@misc{kolani2025phonikud,
title={Phonikud: Hebrew Grapheme-to-Phoneme Conversion for Real-Time Text-to-Speech},
author={Yakov Kolani and Maxim Melichov and Cobi Calev and Morris Alper},
year={2025},
eprint={2506.12311},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2506.12311},
}