The MILA Morphological Analysis Tool takes as input undotted Hebrew text (formatted either as plain text or as tokenized XML following MILA's standards). The Analyzer then returns, for each token, all the possible morphological analyses of the token, reflecting part of speech, transliteration, gender, number, definiteness, and possessive suffix.
Enter up to 100 tokens of undotted Hebrew text:
XML output follow MILA's XML standards for corpora.
Password-protected. Please register to access (free for non-commercial use).
Documentation (in Hebrew)
PDF file, 162 KB.
- Current implementation developed by Dalia Bojan.
- Grammar upon which the analyzer is based developed by Shlomo Yona and Shuly Wintner.
- Maintained by Yossi Jacob, MILA Research Engineer (contact).
For non-commercial research purposes, this tool is licensed under the GNU General Public License (GPL). Any publications resulting from the use of this tool should refer to it as "The MILA Hebew Morphological Analysis Tool" and cite:
Shlomo Yona and Shuly Wintner. "A Finite-State Morphological Grammar of Hebrew." Natural Language Engineering 14(2):173-190, April 2008. Language Resources and Evaluation 42(1):75-98, March 2008. [BibTeX]