-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for converting Uyghur Latin Script (tex) to Uyghur Arabic (PDF) #482
Comments
We do support multiple scripts (same in in- and output) via the Wikipedia tells me that three scripts are common in different regions: Arabic, Latin, and Cyrillic. Given this, a |
Note, though, that the ArabXeTeX package supports transliteration also for Uyghur (via TECKit fontmappings). There is also ArabLuaTeX, but this doesn't yet support Uyghur yet. I am not sure polyglossia and ArabXeTeX can be used together. |
@jspitz is there any tutorial for |
No, just look at other languages that provide this option (e.g., Kurdish) |
If you want to input in one script and obtain output in another script you can use the mapping mechanism of XeTeX.
You need to create a map file in TECkit syntax, as in https://scripts.sil.org/cms/scripts/page.php?site_id=nrsi&id=teckit
and then convert it to tcp format using the teckit_compile tool you can download from SIL
(at https://scripts.sil.org/cms/scripts/page.php?site_id=nrsi&id=teckitdownloads). In fact TECkit takes over (without
acknowledging it) ideas that were previously implemented in Omega through Omega Translation Processes…
But beware of the fact that this mapping operation is very simple (it consists of rules "send this string to that string") so your
Latin input must contain some additional information you don't use normally, for example a character for the hamza
in the Arabic script. The system is (probably) not intelligent enough to say "these are two vowels, let me put a hamza
inbetween" (I say "probably" because I haven't yet tested it on Uyghur).
The other inconvenient is that you will need to use macros whenever you switch from one script to another, since
for both Uyghur and English (or Russian) you will use the same input script.
Yannis
Le 22 mars 2021 à 01:59, Osman Tursun ***@***.***> a écrit :
So, I wonder if the polyglossia package supports Uyghur written with Latin characters.
<http://www.imt-atlantique.fr/> Yannis HARALAMBOUS
Professor
Computer Science Department
UMR CNRS 6285 Lab-STICC
<http://perso.telecom-bretagne.eu/yannisharalambous/> <https://twitter.com/y_haralambous> <https://www.linkedin.com/in/yannis-haralambous-5529073?trk=hp-identity-name>Technopôle Brest-Iroise CS 83818
29238 Brest Cedex 3, France
Une école de l'IMT <http://www.imt.fr/>
I have no more answers (John Cage)
|
dear Osman,
here is support for converting Latin to Arabic script for the Uyghur language. I have used the ULY correspondence
as described in Wikipedia <https://en.wikipedia.org/wiki/Uyghur_alphabets>. Remains the problem of the digraphs
zh, sh and ng: how to distinguish them from standalone letters (there is no problem for ch since there is no c in the
transliteration scheme). I also used both ë and é, as in the picture on the same page there is an é instead of an ë.
I'm sending you the TeX file, the MAP file, the TEC file and the resulting PDF (the input for Latin and Arabic is strictly
the same Latin-alphabet string).
If you can send me a longer text in Latin alphabet I can typeset it and we can start debugging.
You need XeTeX to run this file.
Cheers,
Yannis
<http://www.imt-atlantique.fr/> Yannis HARALAMBOUS
Professor
Computer Science Department
UMR CNRS 6285 Lab-STICC
<http://perso.telecom-bretagne.eu/yannisharalambous/> <https://twitter.com/y_haralambous> <https://www.linkedin.com/in/yannis-haralambous-5529073?trk=hp-identity-name>Technopôle Brest-Iroise CS 83818
29238 Brest Cedex 3, France
Une école de l'IMT <http://www.imt.fr/>
Und nach einer kleinen Stille fügte Sie hinzu:
Jeder Weg, der dorthin führt, war am Ende der richtige. (Michael Ende)
|
@yannis1962 We cannot see the attachments here? Are they supposed to be included in polyglossia or used separately? Also, how hard would it be to support transliteration for Serbian Cyrillic? (I can provide bidirectional Unicode mappings.) |
Le 22 mars 2021 à 23:53, Ivan Kokan ***@***.***> a écrit :
dear Osman, here is support for converting Latin to Arabic script for the Uyghur language. I have used the ULY correspondence as described in Wikipedia https://en.wikipedia.org/wiki/Uyghur_alphabets <https://en.wikipedia.org/wiki/Uyghur_alphabets>. Remains the problem of the digraphs zh, sh and ng: how to distinguish them from standalone letters (there is no problem for ch since there is no c in the transliteration scheme). I also used both ë and é, as in the picture on the same page there is an é instead of an ë. I'm sending you the TeX file, the MAP file, the TEC file and the resulting PDF (the input for Latin and Arabic is strictly the same Latin-alphabet string). If you can send me a longer text in Latin alphabet I can typeset it and we can start debugging. You need XeTeX to run this file. Cheers, Yannis http://www.imt-atlantique.fr/ <http://www.imt-atlantique.fr/> Yannis HARALAMBOUS Professor Computer Science Department UMR CNRS 6285 Lab-STICC http://perso.telecom-bretagne.eu/yannisharalambous/ <http://perso.telecom-bretagne.eu/yannisharalambous/> https://twitter.com/y_haralambous <https://twitter.com/y_haralambous> <https://www.linkedin.com/in/yannis-haralambous-5529073?trk=hp-identity-name>Technopôle <https://www.linkedin.com/in/yannis-haralambous-5529073?trk=hp-identity-name%3ETechnop%C3%B4le> Brest-Iroise CS 83818 29238 Brest Cedex 3, France Une école de l'IMT http://www.imt.fr/ <http://www.imt.fr/> Und nach einer kleinen Stille fügte Sie hinzu: Jeder Weg, der dorthin führt, war am Ende der richtige. (Michael Ende)
@yannis1962 <https://github.com/yannis1962> We cannot see the attachments here? Are they supposed to be included in polyglossia or used separately?
Sorry, the list server must have removed the attachments. you can fetch them from here: https://we.tl/t-bBPjBZghGB
(only for a week or so).
They are only test files to show Osman how easy it is to create transliterations.
Also, how hard would it be to support transliteration for Serbian Cyrillic? (I can provide bidirectional Unicode mappings.)
If you have bidirectional Unicode mappings, then it is trivial.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub <#482 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAFXC7JM7SM7JW4XT3KNZA3TE7C5VANCNFSM4ZSEW6UQ>.
<http://www.imt-atlantique.fr/> Yannis HARALAMBOUS
Professor
Computer Science Department
UMR CNRS 6285 Lab-STICC
<http://perso.telecom-bretagne.eu/yannisharalambous/> <https://twitter.com/y_haralambous> <https://www.linkedin.com/in/yannis-haralambous-5529073?trk=hp-identity-name>Technopôle Brest-Iroise CS 83818
29238 Brest Cedex 3, France
Une école de l'IMT <http://www.imt.fr/>
Der ganze Mensch bringt sich dar in der Art, wie er seine Worte darbringt. (Rudolf Koch)
|
@yannis1962 Thanks for your help. Overall, it is working. However, there is a bug related to " ' " (hamza). Is it because it is a one-to-one mapping ? For example, "wijdan'gha" -> "ۋىجدانئغا" is not correct. It should be "ۋىجدانغا". |
Maybe you can also compare your efforts with the mappings provided by ArabXeTeX. And then, we also need a LuaTeX implementation if we want to provide this. |
Hi polyglossia team, thanks for your support for the Uyghur language. I have started to edit the Uyghur latex file in the latex editor. However, I find it is a little bit hard to edit documents with Uyghur Arabic script. So, I wonder if the polyglossia package supports Uyghur written with Latin characters. Let me explain it more. There are two main scripts for the Uyghur language now. One is Uyghur Arabic, another is Uyghur Latin. It is easier to edit the source file with Uyghur Latin. However, Uyghur Arabic is used in our daily life. If Polygossia renders the Latin alphabet to Arabic when generating the pdf, it would be very helpful. Is it possible? I can provide the python code for converting Uyghur Latin to Arabic.
The text was updated successfully, but these errors were encountered: