Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for converting Uyghur Latin Script (tex) to Uyghur Arabic (PDF) #482

Open
neouyghur opened this issue Mar 22, 2021 · 10 comments
Open

Comments

@neouyghur
Copy link
Contributor

Hi polyglossia team, thanks for your support for the Uyghur language. I have started to edit the Uyghur latex file in the latex editor. However, I find it is a little bit hard to edit documents with Uyghur Arabic script. So, I wonder if the polyglossia package supports Uyghur written with Latin characters. Let me explain it more. There are two main scripts for the Uyghur language now. One is Uyghur Arabic, another is Uyghur Latin. It is easier to edit the source file with Uyghur Latin. However, Uyghur Arabic is used in our daily life. If Polygossia renders the Latin alphabet to Arabic when generating the pdf, it would be very helpful. Is it possible? I can provide the python code for converting Uyghur Latin to Arabic.

@jspitz
Copy link
Collaborator

jspitz commented Mar 22, 2021

We do support multiple scripts (same in in- and output) via the script option. We do not have a case yet where we support transliteration, though.

Wikipedia tells me that three scripts are common in different regions: Arabic, Latin, and Cyrillic. Given this, a script option would make sense.

@jspitz
Copy link
Collaborator

jspitz commented Mar 22, 2021

Note, though, that the ArabXeTeX package supports transliteration also for Uyghur (via TECKit fontmappings). There is also ArabLuaTeX, but this doesn't yet support Uyghur yet.

I am not sure polyglossia and ArabXeTeX can be used together.

@neouyghur
Copy link
Contributor Author

@jspitz is there any tutorial for script option?

@jspitz
Copy link
Collaborator

jspitz commented Mar 22, 2021

No, just look at other languages that provide this option (e.g., Kurdish)

@yannis1962
Copy link

yannis1962 commented Mar 22, 2021 via email

@yannis1962
Copy link

yannis1962 commented Mar 22, 2021 via email

@ivankokan
Copy link
Contributor

dear Osman, here is support for converting Latin to Arabic script for the Uyghur language. I have used the ULY correspondence as described in Wikipedia https://en.wikipedia.org/wiki/Uyghur_alphabets. Remains the problem of the digraphs zh, sh and ng: how to distinguish them from standalone letters (there is no problem for ch since there is no c in the transliteration scheme). I also used both ë and é, as in the picture on the same page there is an é instead of an ë. I'm sending you the TeX file, the MAP file, the TEC file and the resulting PDF (the input for Latin and Arabic is strictly the same Latin-alphabet string). If you can send me a longer text in Latin alphabet I can typeset it and we can start debugging. You need XeTeX to run this file. Cheers, Yannis http://www.imt-atlantique.fr/ Yannis HARALAMBOUS Professor Computer Science Department UMR CNRS 6285 Lab-STICC http://perso.telecom-bretagne.eu/yannisharalambous/ https://twitter.com/y_haralambous <https://www.linkedin.com/in/yannis-haralambous-5529073?trk=hp-identity-name>Technopôle Brest-Iroise CS 83818 29238 Brest Cedex 3, France Une école de l'IMT http://www.imt.fr/ Und nach einer kleinen Stille fügte Sie hinzu: Jeder Weg, der dorthin führt, war am Ende der richtige. (Michael Ende)

@yannis1962 We cannot see the attachments here? Are they supposed to be included in polyglossia or used separately?

Also, how hard would it be to support transliteration for Serbian Cyrillic? (I can provide bidirectional Unicode mappings.)

@yannis1962
Copy link

yannis1962 commented Mar 22, 2021 via email

@neouyghur neouyghur reopened this Mar 22, 2021
@neouyghur
Copy link
Contributor Author

@yannis1962 Thanks for your help. Overall, it is working. However, there is a bug related to " ' " (hamza). Is it because it is a one-to-one mapping ? For example, "wijdan'gha" -> "ۋىجدانئ‍غا" is not correct. It should be "ۋىجدانغا".

@jspitz
Copy link
Collaborator

jspitz commented Mar 23, 2021

Maybe you can also compare your efforts with the mappings provided by ArabXeTeX. And then, we also need a LuaTeX implementation if we want to provide this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants