Perso-Arabic script
From Wikipedia, the free encyclopedia
| Persian alphabet | |||||
|---|---|---|---|---|---|
| ﺍ ﺏ پ ﺕ ﺙ ﺝ چ | |||||
| ﺡ ﺥ ﺩ ﺫ ﺭ ﺯ ژ | |||||
| ﺱ ﺵ ﺹ ﺽ ﻁ ﻅ | |||||
| ﻉ ﻍ ﻑ ﻕ ک گ | |||||
| ﻝ ﻡ ﻥ ﻭ هـ ی | |||||
| History · Transliteration Diacritics · Hamza ء Numerals · Numeration |
|||||
| Persian language |
|---|
|
Regional and social varieties: Grammar: Language features: Writing systems:
|
The Perso-Arabic script is a writing system that is based on the Arabic alphabet. Originally being used exclusively for the Arabic language, the Arabic script was modified to match the demands of being a writing system for the Persian language, adding four letters: پ [p], چ [ʧ], ژ [ʒ], and گ [g]. Many languages which use the Perso-Arabic script add additional letters. The Perso-Arabic script has been applied, beside the Persian alphabet itself, to the Urdu alphabet, Kurdish Sorani alphabet, Baloch alphabet, Punjabi Shahmukhi script, Azeri, Tajik-Persian Alphabet and several others.
In order to represent non-Arabic sounds, new letters were created by adding dots, lines, and other shapes to existing letters. For example, the retroflex sounds of Urdu are represented orthographically by adding a small ط above their non-retroflex counterparts: د [d̪] and ڈ [ɖ]. The voiceless retroflex fricative [ʂ] of Pashto is represented in writing by adding a dot above and below the س [s] letter, resulting in ښ. The close central rounded vowel [ʉ] of Kurdish is written by writing two ﻭ [u], resulting in ﻭﻭ.
The Perso-Arabic script is exclusively written cursively. That is, the majority of letters in a word connect to each other. This is also implemented on computers. Whenever the Perso-Arabic script is typed, the computer connects the letters to each other. Unconnected letters are not widely accepted.
There are many Arabic-derived alphabets which were not influenced by the Perso-Arabic script, including Jawi (used for Malay), Sorabe (Malagasy), and many alphabets used in Northern Africa. These alphabets used other innovations for writing such common sounds as [p] and [g], instead of the Perso-Arabic letters پ and گ, although the Jawi script does use the same symbol for [ʧ] ( چ ).
A characteristic feature of this script, possibly tracing back to Ancient Egyptian hieroglyphics, is that vowels are underrepresented. For example, in Classical Arabic, of the six vowels, the three short ones are normally omitted entirely (except in the Qur`aan), while the three long ones are represented ambiguously by certain consonants. Only Kashmiri and Kurdish, of the many languages using adaptations of this script, regularly indicate all vowels.
Contents |
[edit] Letters
Below are the 32 letters of Persian.
| Name | Transliteration | IPA | Final | Medial | Initial | Isolated |
|---|---|---|---|---|---|---|
| alef | ā / aa / ʼ / ʾ / ’ | various, including [ɒ] | ﺎ | ﺎ * | آ / ا * | ﺍ |
| be | b | [b] | ﺐ | ﺒ | ﺑ | ﺏ |
| pe | p | [p] | ﭗ | ﭙ | ﭙ | پ |
| te | t | [t] | ﺖ | ﺘ | ﺗ | ﺕ |
| se | s | [s] | ﺚ | ﺜ | ﺛ | ﺙ |
| jim | j | [ʤ] | ﺞ | ﺠ | ﺟ | ﺝ |
| che | c / č / ch | [ʧ] | ﭻ | ﭽ | ﭼ | ﭺ |
| he | h | [h] | ﺢ | ﺤ | ﺣ | ﺡ |
| khe | x / kh | [x] | ﺦ | ﺨ | ﺧ | ﺥ |
| dāl | d | [d] | ﺪ | ﺪ * | ﺩ * | ﺩ |
| zāl | z | [z] | ﺬ | ﺬ * | ﺫ * | ﺫ |
| re | r | [ɾ] | ﺮ | ﺮ * | ﺭ * | ﺭ |
| ze | z | [z] | ﺰ | ﺰ * | ﺯ * | ﺯ |
| zhe | ž / zh | [ʒ] | ﮋ | ﮋ * | ژ * | ژ |
| sin | s | [s] | ﺲ | ﺴ | ﺳ | ﺱ |
| šin | š / sh | [ʃ] | ﺶ | ﺸ | ﺷ | ﺵ |
| sād | s | [s] | ﺺ | ﺼ | ﺻ | ﺹ |
| zād | z | [z] | ﺾ | ﻀ | ﺿ | ﺽ |
| tā | t | [t] | ﻂ | ﻄ | ﻃ | ﻁ |
| zā | z | [z] | ﻆ | ﻈ | ﻇ | ﻅ |
| eyn | ʻ / ‘ | [ʔ] | ﻊ | ﻌ | ﻋ | ﻉ |
| qeyn | q / gh | [ɣ] / [ɢ] | ﻎ | ﻐ | ﻏ | ﻍ |
| fe | f | [f] | ﻒ | ﻔ | ﻓ | ﻑ |
| qāf | q / gh | [q] / [ɢ] | ﻖ | ﻘ | ﻗ | ﻕ |
| kāf | k | [k] | ﮏ | ﮑ | ﮐ | ک |
| gāf | g | [g] | ﮓ | ﮕ | ﮔ | گ |
| lām | l | [l] | ﻞ | ﻠ | ﻟ | ﻝ |
| mim | m | [m] | ﻢ | ﻤ | ﻣ | ﻡ |
| nun | n | [n] | ﻦ | ﻨ | ﻧ | ﻥ |
| vāv | v / u / ow | [v] / [u] | ﻮ | ﻮ * | و * | و |
| he | h | [h] | ﻪ | ﻬ | ﻫ | ﻩ |
| ye | y , i | [j] , [i] | ﯽ | ﻴ | ﻳ | ﻯ |
[edit] Exceptions
There are seven letters in the Persian alphabet that do not connect to other letters like the rest of the letters in the alphabet. These seven letters do not have initial or medial forms but the solo and the final forms are used instead because they do not allow for a connection to be made on the left hand side to the other letters in the word. For example when the letter ا alef is at the beginning of a word such as اینجا "injā" (here), the initial form of alef is used. Or in the case of اِمروز "emruz" (today) the letter ﺮ re uses the final form and the letter و vāv uses the initial form although they are in the middle of the word.
[edit] Other characters
The following are not actual letters, but rather different orthographical shapes for letters, and in the case of the lām alef, a ligature. As to ﺀ hamze, it has only a single graphic, since it is never tied to a preceding or following letter. However, it is sometimes 'seated' on a vāv, ye or alef, and in that case the seat behaves like an ordinary vāv, ye or alef respectively. Technically, hamze is not a letter, but a diacritic.
| Name | Transliteration | IPA | Final | Medial | Initial | Stand-alone |
|---|---|---|---|---|---|---|
| alef madde | ā | [ɒ] | ﺂ | — | — | ﺁ |
| he ye | -eye or -eyeh | [eje] | ﮥ | — | — | ۀ |
| lām alef | lā | [lɒ] | ﻼ | — | — | ﻻ |
Although at first glance they may seem similar, there are many differences in the way the different languages use the alphabets. For example, similar words are written differently in Persian and Arabic, as they are used differently.
The Persian alphabet adds four letters to the Arabic alphabet, [p], [g], [ʧ] (ch – chair), [ʒ] (zh – measure):
| Sound | Shape | Unicode name |
| [p] | پ | pe |
| [ʧ] (ch) | چ | che |
| [ʒ] (zh) | ژ | zhe |
| [g] | گ | gaf |
[edit] Changes from the Arabic writing system
The following is a list of differences between the Arabic writing system and the Persian writing system:
- A hamza (ء) is not written above an alif (ا) to denote a zabar or pesh and below to denote a zer.
- A hamza is not typically written in the final kaaf (ک). However, a hamza is sometimes added to a final kaaf in Persian for reasons of stylized writing.
- A hamza is not typically written in Persian to separate two vowels. In Arabic, a hamza is used in words to separate two vowels. For example, the word aljazaaer (Algeria) is written الجزائر. In Persian, this convention is dropped unless the word originates from Arabic. For example, the word chaai (tea) is written چای. In Persian grammar, words ending in yeh versus hamza-yeh have different grammatical meanings. For example, کتابھای means "the books of," whereas کتابھائی means "some books."
- The Arabic letter taa marbuuta (ة) is sometimes changed to a taa (ت) because taa marbuuta is a grammatical construct in Arabic denoting femininity. Since Persian grammar lacks gender constructs, the taa marbuuta is not necessary and is only kept to maintain fidelity to the original Arabic spelling.
- Two dots are removed in the final yeh (ی). Arabic differentiates the final yeh with the two dots and the final yeh without two dots, which is called alif maqsura and is pronounced like an alif. Because Persian drops the two dots in the final yeh, the alif maqsura cannot be differentiated from the normal final yeh just by looking at the spelling. For example, the name Musa (Moses) is written موسی. Whether the final letter in Musa denotes a Persian yeh or an alif maqsura cannot be determined just by the spelling of the word. In this case, the final letter is an alif maqsura.
- The letters peh (پ) and gaaf (گ) are added because Semitic languages like Arabic and Hebrew do not have separate letters for P and F or G and J and the phonetics of Persian requires this separation. The letters cheh (چ) and zheh/jheh (ژ) are added because Arabic lacks these phonetic sounds found in the Persian language.
[edit] Word boundaries
Typically words are separated from each other by a space. Certain morphemes (such as the plural ending '-hā') are written without a space but separated from the previous word with a zero-width non-joiner.
[edit] See also
- Arabic alphabet
- Scripts used for Persian
- Persian phonology
- Ottoman Turkish language
- History of the Arabic alphabet
- List of languages by writing system#Arabic script
- Urdu alphabet
- Nastaʿlīq script
- Shahmukhi
- Shahmukhi script
- Urdu alphabet
- Ajami script

