Pashto Linguistic Tools

په دے ډبې کښې تاسو خپل ليکل ټائپ کولے شۍ۔ دا هم کولے شۍ چه چرته نه غونډ ټېکسټ کاپي پېسټ کړۍ۔

Development Work: This software on this page is developed jointly by the Center of Computational Linguistics (CoCL), FAST National University of Computer & Emerging Sciences (NUCES), Peshawar, and the Pashto Academy (PA), University of Peshawar. Development work is undertaken by Dr. Taimoor Khan and Dr. Omar Usman Khan from CoCL, whereas data curation is by Dr. Nasrullah Wazir from PA. The tools here are in continual development, and made available for purpose of field testing and community feedback.

How to Help: We are in need of additional words for our corpus. If you would like to donate pashto text collections available with you or your organization (with more than 10,000 words), please contact us.

Note: (1) This is not a permanent URL. (2) These tool does not work on Internet Explorer.

Pashto Spell Checker پښتو هجايي غلطۍ نيونکی (v1.0 18/April/2021)

About Tool: This is a probabalistic tool that checks for isolated non-word errors within some Pashto text. The check is performed against a corpus [a1] of 1.01 million words (.125 million unique words), while corrections are suggested against character shufflings yielding the highest probabilities.

How to Use: Just type your text (or copy paste your text CTRL+SHIFT+V) in the given text-box (highlighted in pink). Click on the Check Spelling button, and suitable corrections will be identified below the text. The first word has the highest probability, whereas the last word has the least probability. Color fades to white for weak probability words. The tool does not check for spelling as you type.

[a1] Pashto Pashto Dictionary, Pashto Academy, University of Peshawar, Available Online

Pashto Syllable Checker (v1.0 21/April/2021)

About Tool: This tool identifies syllable structure (C)(C)(C)V(C)(C) defined in [b1, b2], where parenthesis denotes an optional inclusion, C are consonantal phonemes, V are vowels, (C)(C)(C) represent the Onset, and (C)(C) represents the coda.

How to Use: Just type your text (or copy paste your text CTRL+SHIFT+V) in the given text-box (highlighted in pink). Click on the Syllable Analysis button. Segmented words of the text will appear containing the number of syllables, their syllable clusters, and the actual syllables. The last line shows the total number of syllables, followed by their text and cluster based histograms.

[b1] Tegey, H. and Robson, B., A Reference Grammar of Pashto, Dept. of Education, Center for Applied Linguistics, 1996
[b2] Khan, M. K., Pashto Phonology: Relationship between Syllable Structure and Word Order, PhD Thesis, AJK University, 2012