Wiktionary:Persian transliteration/Iranian
Appearance
This page focuses on Iranian Persian, for transliteration rules concerning Classical Persian or Dari Persian see Persian transliteration/Classical. For transliteration rules concerning Tajik, see Wiktionary:Tajik transliteration.
Persian transliteration
[edit]Persian | Wiktionary | IPA | Others (dispreferred) |
---|---|---|---|
ا (word-initial) | a, o, e | (ʔ)æ, (ʔ)o, (ʔ)e | |
ا (other positions[1]) | â | ɒː | ā |
آ | â (word-initial) -â (other positions[1]) |
(ʔ)ɒː (word-initial) ʔɒː (other positions) |
ʼā |
ب | b | b | |
پ | p | p | |
ت | t | t | |
ث | s | s | th, s̱, ṯ, s̄ |
ج | j | dʒ | ǧ |
چ | č | tʃ | ch, c |
ح | h | h | ḥ, ḩ |
خ | x | x | kh, k͟h, ḫ, ḵ |
د | d | d | |
ذ | z | z | dh, d͟h, ẕ, ḏ |
ر | r | r | |
ز | z | z | |
ژ | ž | ʒ | zh, z͟h |
س | s | s | |
ش | š | ʃ | sh, s͟h |
ص | s | s | ṣ |
ض | z | z | ḍ, ż, ẕ |
ط | t | t | ṭ, ţ |
ظ | z | z | ẓ, z̧ |
ع | '
∅ (word initial[2]) |
ʔ, ː | ʻ, ʿ |
غ | ġ | ɣ, ɢ | q, gh, g͟h, ğ |
ف | f | f | |
ق | ġ | ɣ, ɢ | q, gh, g͟h, ğ, ḳ |
ک | k | k | |
گ | g | g | |
ل | l | l | |
م | m | m | |
ن | n | n | |
و (consonant) | v | v | w |
و (long vowel) | u | uː, oː | ô, ū, ō |
و (diphthong[3]) | ow | ow | au, aw |
خوا (e.g. خواندن etc.) | xâ | xɒː | xwā-, khwā- |
خوی (e.g. خوید etc.) | xi | xiː | |
ه (consonant) | h (may appear in final position after a vowel, e.g. ده (dah)) | h | |
ـه (word-final vowel) | e | e, æ | eh, a, ah |
ی (consonant) | y | j | j |
ی (long vowel) | i | iː, eː | ê, ī, ē |
ی (diphthong[4]) | ey | ej | ai, ay |
یٰ (always word-final) | â | ɒː | ā, á |
- ↑ 1.0 1.1 In a compound such as سَرْاَنْجام (sar-anjâm) or سَرْآغاز (sar-âġâz) it is treated as a hyphen and a short vowel or â follows it. (Note that colloquially it is assimilated into the consonant before it e.g. saranjâm and sarâġâz)
- ^ Initial ع is not transliterated as it is indistinguishable from an equivalent alef with the same vowel. [1] – unless it is a dialect where the two are distinguished, i.e. Nishapuri, Kulabi and Arabized forms in Khuzestan; e.g. 'asb as a dialetcal form of اَسْب (asb)
- ^ Unless bearing a shadda, in which case it is treated as consonant, e.g. اوّل (avval), not *owval
- ^ Unless bearing a shadda, in which case it is treated as consonant, e.g. ایّوب (ayyub), not *eyyub
Diphthongs
[edit]Romanization | IPA | Final | Medial | Initial | Regular? |
---|---|---|---|---|---|
ey | /ej/ | ـِىْ | ـِیْـ | اِیْـ | Yes |
ay | /æj/ | ـَىْ | ـَیْـ | اَیْـ | Only borrowings |
oy | /oːj/ | ـُىْ | ـُیْـ | اُیْـ | Only borrowings |
ây | /ɒːj/ | ـاىْ | ـایْـ | آیْـ | Yes |
aw | /æw/ | ـَوْ | اَوْ | No | |
âw | /ɒːw/ | ـاوْ | آوْ | Only borrowings | |
uy | /uːj/ | ـوىْ | ـویْـ | اویْـ | Yes |
- In most cases, Classical Persian ay and aw have turned into Iranian Persian ey and ow respectively.
Other symbols or combinations
[edit]- ـاً (-an) (always word-final) – an (The position of [fathatan] is after the alef, not before, as is the current practice with Arabic)
- When a word ending with آ (â), ـا (â), ء (') or other variations gets the tanvin, it is written as either ءً ('an) or ـئاً (-'an) and is transliterated as – 'an in both cases.
- اِبْتِداءً ― ebtedâ'an ― in the beginning
- اِبْتِدائاً ― ebtedâ'an ― in the beginning
- ـٌ (-on) (always word-final, only in compound terms related to grammar and math, usually not written) – on
- مُضافٌاِلَیْه ― mozâfon-elayh ― genitive noun
- ـٍ (-en) (always word-final, only in unadapted borrowings, rare) – en
- بِعِبارَةٍاُخْریٰ ― be'ebâraten-oxrâ ― in other words
- All forms of hamze, including ء, ؤ, ئ, and أ are transliterated as ' (dispreferred: ʻ, ʿ) – unless the hamze is a substitute for a native glottal stop, where it is transliterated as a hyphen. (see below)
- The (ezâfe) vowel is transcribed differently depending on context:
- ـِ (-e) (ezâfe) (always word-final, after a consonant, mostly unmarked in regular writing) – -e
- ـیِ (--ye) (ezâfe) (after long vowels ا (â) or و (u), sometimes marked in regular writing) – -ye
- آفْریقایِ جُنوبی ― âfriġâ-ye jonubi ― South Africa
- ـیِّ (i-ye) (ezâfe) (always word-final with ی (i), mostly unmarked in regular writing) – i-ye
- جُمْهوریِ تاجیکِسْتان ― jomhuri-ye tâjikestân ― Republic of Tajikistan
- ـهٔ (-h-ye) (U+0647 U+0654), sometimes written as ـهی (-h-i) (always word-final) – e-ye. (Articles don't contain the hamze above "he", it is considered a diacritic and only used in the display using
|head=
. Templates link to words without the hamze.)
- indefinite article ـی (-i):
- The adjectival and abstract noun version of ـی (-i) follow the above rules without the hyphen, unless there is no consonants to which a hyphen is placed between the two vowels. – Note that if ـی (-i) is used to form an abstract noun, it gets converted to ـِگی (-egi) if the word ends with an ـه (-e)
- نه (“no, not”) – na (an exception)
- ـّ (tashdid) – geminate consonant (Arabic shadda)
- ـَ (-a) (fathe/zor) – a (Arabic fatha)
- ـِ (-e) (kasre/zir) – e (in modern Iranian, to check cases where it's "i") (Arabic kasra)
- ـُ (-o) (zamme/pish) – o (in modern Iranian, to check cases where it's "u") (Arabic damma). Used after consonants to make a short "o" sound. If used before و (o) produces a diphthong "ow":
- نُوْروز (nowruz)
- ـ۟ (sukūn/sokun) - marks absence of a vowel. Rarely used in popular Persian vocalisations, especially on final consonants. It may be necessary to use consistently in strict vocalisations to avoid any misreadings, allow automation and signalling that a word IS vocalised.
- ـه (-h) (in the word-final position after consonants for a large number of words) - e (no hyphen) (note with ezâfe the preferred spelling is ـهٔ (-h-ye))
- هَفْتِه ― hafte ― week
- ـه (-h) - when used as a colloquial copula in the 3rd person singular (he/she/it is) - -e (with a hyphen)
- تِهْرون پایْتَخْتِ ایرونِه. (colloquial) ― tehrun pâytaxt-e irun-e. ― Tehran is the capital of Iran.
- ZWNJ – - (hyphen)
- The use of hyphens for etymological reasons - suffixes, compound words, etc. when no ZWNJ is used is to be discussed. E.g. currently plural form suffix ـها (-hâ) is transliterated as "-hâ" regardless if ZWNJ is present or not. (Apart from cases described above and ZWNJ, the use of hyphen is otherwise dispreferred. A space is transliterated as a space and the absence of space or ZWNJ is transliterated as nothing.)
- Below are transliteration examples to contrast the use of ZWNJ on connecting letters, space, nothing and non-connecting letters:
- ZWNJ on joining letters کتابها (ketâb-hâ) (plural of کتاب (ketâb))
- Space کتاب ها (ketâb hâ)
- Nothing (joining letters are connected) کتابها (ketâbhâ)
- Non-joining letters (no ZWNJ is used) اتوها (otuhâ) (plural of اتو (otu))
- ـیـ (-y-): (V)y(V) (for Arabic borrowings, see below)
Arabic loanwords
[edit]- ـة (-h) (always word-final) – at (rare, only in unadapted borrowings from Arabic, normally adapted into Persian as ـت (at) or ـه (e))
- الـ (note the hyphen after "l" or the first of the assimilated letter is for etymological reasons):
- al- (normally):
- اَلْیُوم (rare) ― al-yowm ― today
- "l" can change to the following consonant if it's a "sun letter", and "a" can change to "o" (Arabic "u") in ezâfe, e.g.
- With ZWNJ: -(C)-(C)
- فارِغُالتَّحْصیل (fâreġo-t-tahsil)
- With space: (C)-(C)
- فارِغٌ التَّحْصیل (fâreġo t-tahsil)
- With ZWNJ: -(C)-(C)
- moon letters (Note that the alef remains silent (اَلِفِ وَصْل (alef-e vasl))):
- With ZWNJ: -l-
- بِیْنُالْمِلَلی (beyno-l-melali)
- With space: l-
- بِیْنُ الْمِلَلی (beyno l-melali)
- Most of the time the vowel of the last first word is "o" following Arabic grammar rules, but variations do exists:
- Suffixes: – e
- بِالْکُلّ (be-l-koll)
- sometimes: – a
- بِیْنَالْمِلَلی (beyna-l-melali)
- بِیْنَ الْمِلَلی (beyna l-melali)
- Suffixes: – e
- al- (normally):
- ـیّت, ـیّات, and ـیّه: iyat, iyât, and iye (for native ـیـ (-y-), see above)
- اَهَمّیَّت (ahammiyat)
- تَقْویَّت (taġviyat)
- خُصوصیّات (xosusiyât)
- نَشْریّه (našriye)