Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ODT with dozens of (identical) inline styles breaks translation #23

Open
xavivars opened this issue Dec 29, 2022 · 1 comment
Open

ODT with dozens of (identical) inline styles breaks translation #23

xavivars opened this issue Dec 29, 2022 · 1 comment
Labels
enhancement New feature or request

Comments

@xavivars
Copy link

xavivars commented Dec 29, 2022

Here you have an example where translations from it are pretty much useless.

Doc-original.odt

(File has been slightly modified to alter the personal details)

Looking at the content of the first paragraph, you find something like this:

<text:p text:style-name="P45"><text:span text:style-name="T47">R</text:span><text:span text:style-name="T47">esolución </text:span><text:span text:style-name="T32">de la Dirección General de</text:span><text:span text:style-name="T32"> C</text:span><text:span text:style-name="T32">alidad </text:span><text:span text:style-name="T32">y Educación Ambie</text:span><text:span text:style-name="T32">ntal por la que se procede </text:span><text:span text:style-name="T32">a</text:span><text:span text:style-name="T32">l cambio de </text:span><text:span text:style-name="T32">titularidad</text:span><text:span text:style-name="T143"> de </text:span><text:span text:style-name="T26">la</text:span><text:span text:style-name="T26"> autorización ambiental integrada </text:span><text:span text:style-name="T26">otorgada a la empresa</text:span><text:span text:style-name="T32"> </text:span><text:span text:style-name="T32">XXXXXXXX XXXXX XXXXXX</text:span><text:span text:style-name="T32">,</text:span><text:span text:style-name="T32"> </text:span><text:span text:style-name="T32">con </text:span><text:span text:style-name="T32">número de </text:span><text:span text:style-name="T32">N</text:span><text:span text:style-name="T32">IF </text:span><text:span text:style-name="T32">P6345435F</text:span><text:span text:style-name="T32"> </text:span><text:span text:style-name="T32">para un</text:span><text:span text:style-name="T32">a </text:span><text:span text:style-name="T32">planta de tratamiento de resi</text:span><text:span text:style-name="T32">duos urbanos</text:span><text:span text:style-name="T29">, </text:span><text:span text:style-name="T29">con </text:span><text:span text:style-name="T29">NIMA </text:span><text:span text:style-name="T29">1111111111</text:span><text:span text:style-name="T29"> </text:span><text:span text:style-name="T29">y <text:s/></text:span><text:span text:style-name="T29">número </text:span><text:span text:style-name="T29">de RICV</text:span><text:span text:style-name="T29"> </text:span><text:span text:style-name="T29">111</text:span><text:span text:style-name="T29">/AAI/CV </text:span><text:span text:style-name="T32">ubicad</text:span><text:span text:style-name="T32">a</text:span><text:span text:style-name="T32"> en </text:span><text:span text:style-name="T32">el </text:span><text:span text:style-name="T32">Camí s/n, Partida El Plà, del término municipal de (Valencia)</text:span><text:span text:style-name="T29">, </text:span><text:span text:style-name="T29">a favor de la mercantil </text:span><text:span text:style-name="T29">XXXXXXXX XXXXX XXXXXX, SL</text:span><text:span text:style-name="T29">, con número de NIF </text:span><text:span text:style-name="T29">B1234567</text:span><text:span text:style-name="T29">.</text:span></text:p>

But on the original file, before I obfuscated the details (names, addresses, etc), style IDs were slightly different, because they also included a officeooo:rsid="01c8d657" attribute in their definition

@TinoDidriksen
Copy link
Owner

Should be a lot better now. Need to further determine which style attributes that can be omitted and thus allow for more merging.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants