Sindhi transliteration

Sindhi is a language broadly spoken by the people of the historical Sindh region in the Indo subcontinent. Modern Sindhi is written in an extended Perso-Arabic script in Sindh province of Pakistan[1] and (formally) in extended-Devanagari by Sindhis in partitioned India.[2] Historically, Sindhi was written in various forms of Landa scripts and various other Indic scripts.[3]

Sindhi Transliteration is essential to convert between Arabic and Devanagari so that speakers of both the countries can read the text of each other.[4] In modern day, Sindhi script colloquially just refers to the Perso-Arabic script since majority of Sindhis are from Pakistan. It is also important to note that the Sindhi script is not same as the Urdu-Shahmukhi script,[5] hence one cannot use script conversions like Hindi-Urdu Transliteration.

Technically, a direct one-to-one mapping or rule-based script conversion is not possible between Pakistani and Indian Sindhi, majorly since Devanagari is an abugida script and Arabic-Sindhi is an abjad script, and also other constraints like multiple similar characters from Perso-Arabic which map onto a single character in Devanagari.[6] Hence it is preferred to use dictionary-based or machine learning-based transliteration between the Sindhi scripts.[7] For colloquial usage in the digital space where writing Sindhi in Latin script is prevalent, Romanisation of Sindhi is used.[8]

In addition to Sindhi, there have been attempts to design Indo-Pakistani transliteration systems for digraphic languages like Punjabi (written in Gurmukhi in East Punjab and Shahmukhi in West Punjab), Saraiki (written in an extended-Shahmukhi script in Saraikistan and unofficially in Sindhi-Devanagari script in India) and Kashmiri (written in extended Perso-Arabic by Kashmiri Muslims and extended-Devanagari by Kashmiri Hindus).[9][10] [11]

Consonants

edit

The following table provides an approximate one-to-one mapping for modern Sindhi consonants,[12] especially for computational purposes (lossless script conversion). Note that this direct script conversion will not yield correct spellings, but rather a readable text for both the readers.[13]

Sindhi Consonants
Sindhi Abjad Urdu-Shahmukhi[14] Roman Devanagari[15]
ڪ ک k
ک کھ kh
ق ق q क़
خ خ k͟h ख़
گ گ g
غ غ g͟h ग़
ڳ ڳ
گھ گھ gh
چ چ c
ڇ چھ ch
ج ج j
جھ جھ jh
ڄ ڄ
ز ز z ज़
ذ ذ z ज़
ض ض z ज़
ظ ظ z ज़
ژ ژ zh झ़
ٽ ٹ
ٺ ٹھ ṭh
ڏ ݙ
ڊ ڈ
ڍ ڈھ ḍh
ڙ ڑ ड़
ڙھ ڑھ ṛh ढ़
ت ت t
ٿ تھ th
ط ط t
د د d
ڌ دھ dh
ن ن n
ڻ ݨ
ڱ ن٘
ڃ ݩ ñ
پ پ p
ڦ پھ ph
ف ف f फ़
ب ب b
ٻ ٻ ॿ
ڀ بھ bh
م م m
ي ی y
ر ر r
ل ل l
ݪ لؕ
و و v
ش ش sh
ݜ سؕ ṣh
س س s
ص ص s
ث ث s
ه ہ h
ح ح h
ۃ ۃ h
ع ع ʿ ʿ

Single-letter Word Ligatures

edit
Sindhi Urdu Roman Devanagari
۽ اَیں a͠i ऐं
۾ میں mẽ में

Numerals

edit
Usage Numeral System
Pakistan Arabic numerals ۰ ۱ ۲ ۳ ۴ ۵ ٦ ۷ ۸ ۹
International Hindu-Arabic 0 1 2 3 4 5 6 7 8 9
India Modern Devanagari

Punctuations & Symbols

edit
Script Period Question Mark Comma Semi-colon Slash Percent End of verse
Perso-Arabic ۔ ؟ ، ؛ ؍ ٪ ۝
Modern Devanagari ? , ; / %

Web Application for Sindhi Translitration

edit

This Program is about auto Translitration of Sindhi script into asian Languages . with Artificial Intelligence features, actually transliteration programs may be on bases like letter based transliteration in which multiple alternates of any letter in second languages die and one remains..... some are word base transliterations in which as more as half million words should be first vowel-led and then would be written an other language, this is also time taken work. now this is A I time so this program called sindhi transliteration is one click solution to all multiple alternate letters.... for example if someone writes <khokhar khoraka khae tho> the program wil transliterate it accurately as کوکر خوراڪ کائي ٿو. Sindhi has three vowels which also are used as consonants like ا، و، ي. For example alif has two sounds like in ada ادا has two sounds one is of alif and an other is vowel of aa. Also, if someone writes سنڌ without zer zabar and peshu the program will convert it with vowels like <sindh>.

Sindhi Trans[16]literation سنڌي ٽرانسلٽريشن

References

edit
  1. ^ "Sindhi gains national status". The Express Tribune. 2016-10-03. Retrieved 2021-07-02.
  2. ^ "Languages Included in the Eighth Schedule of the Indian Constution [sic]". Department of Official Language, Ministry of Home Affairs. Retrieved 2018-04-09.
  3. ^ Lakhani, Rakesh (2013-06-06). "Forgotten Sindhi Script – Waranki". Indigenous Sindhis. Retrieved 2021-07-02.
  4. ^ "Transcending barriers: Software to break down the wall within the Sindhi language". The Express Tribune. 2014-09-18. Retrieved 2021-07-02.
  5. ^ "The Imposition Of Urdu". NAWAIWAQT GROUP OF NEWSPAPERS. September 10, 2015. Retrieved 12 September 2015.
  6. ^ Leghari, Mehwish; U Rahman, Mutee (2015-10-30). "Towards Transliteration between Sindhi Scripts Using Roman Script". Linguistics and Literature Review. 1 (2): 95–104. doi:10.32350/llr.12.03. SSRN 3820433.
  7. ^ Lehal, Gurpreet Singh; Saini, Tejinder Singh (December 2014). "Sangam: A Perso-Arabic to Indic Script Machine Transliteration Model": 232–239. {{cite journal}}: Cite journal requires |journal= (help)
  8. ^ Sodhar, Irum Naz; Jalbani, Akhtar Hussain; Channa, Muhammad Ibrahim; Hakro, Dil Nawaz (2021-04-01). "Romanized Sindhi Rules for Text Communication". Mehran University Research Journal of Engineering and Technology. 40 (2): 298–304. Bibcode:2021MURJE..40..298S. doi:10.22581/muet1982.2102.04. ISSN 2413-7219.
  9. ^ "Perso-Arabic To Indic Script Transliteration". sangam.learnpunjabi.org. Retrieved 2021-04-07.
  10. ^ "Saraiki - Devanagari Machine Transliteration System - SDMTS". www.sanlp.org. Retrieved 2021-08-09.
  11. ^ Lawaye, Aadil; Kak, Aadil; Mehdi, Nali (January 2010). "Building a Cross Script Kashmiri Converter: Issues and Solutions". Proceedings of Oriental COCOSDA.
  12. ^ NC, Gokul (2021-05-07), GokulNC/Indic-PersoArabic-Script-Converter, retrieved 2021-05-28
  13. ^ Pedersen, Thomas T. "Sindhi: Transliteration of Non-Roman Scripts" (PDF). Transliteration of Non-Roman Scripts.
  14. ^ Carefully note the differences. List also includes 4 implosive consonants from Saraiki alphabet (ڳ, ڄ, ݙ, ٻ)
  15. ^ All the consonants with nuqta sign (dots below) denote adaptation for PersoArabic consonants
  16. ^ "Sindhi Translitration سنڌي ٽرانسلٽريشن، Sindhi Script Converter سنڌي لپي مٽائيندڙ". fahad-memon.com/sindhi_transliteration. Retrieved 2022-04-18.