Translation for 3.0

From Apache OpenOffice Wiki
Jump to: navigation, search

This page summarizes useful information on the localization work needed for the OpenOffice.org 3.0 version.

Translation Schedule


Total Estimated Translation Volume

GUI approx. 80,000 words
Help approx. 425,000 words

New/Changed Estimated Translation Volume since 2.4

GUI approx. 4,000 words
Help approx. 35,000 words

en_US Source Strings

For every milestone on the DEV300 codeline the current en-US source strings are extracted and uploaded to:

Diffs of all new and changed strings from each milestone can be found at:

Removed strings are not part of those diffs.

How to deliver translated files

  • provide sdf files that contain translated strings only (please remove non translated strings from the sdf file)
  • provide a GSI / SDF file containing both the translated strings and the corresponding en-US source strings. Remove untranslated strings from the sdf file. Please note that the en-US strings have to be the same milestone like your translation.
  • please make sure that the GSI / SDF file format is not violated (format errors like wrong amount of tabs, shifted columns, ... ) by using "gsicheck". Please use the latest version at: http://ooo.services.openoffice.org/gsicheck/ usage: gsicheck -c myfile.sdf. In case of errors please use the log file to fix them.
  • file an issuezilla bug to "vg@openoffice.org", assign the issue to "vg@openoffice.org", Target milestone to "OOo 3.0" , Component "l10n" , Subcomponent "code" , Issue type "ENHANCEMENT". Please don't attach your file directly to the issue, but provide an URL / link pointing to your file. Please do attach only if you don't have any other web space available (http://qa.openoffice.org/issue_handling/project_issues.html)

Release Map

Release map with translation deadlines specific for the 3.0 release can be found at: http://wiki.services.openoffice.org/wiki/OOoRelease30.


Translation Schedule for Pootle Users

  • January 30th - Translation start date
  • February 19th - Translators upload all complete translated files to Pootle and stop translating
  • February 20th - Pootle admistrators update Pootle with new content
  • February 21st - Translators can start translating again
  • March 12th - Translators upload all complete translated files and stop translating
  • March 13th - Pootle admistrators update Pootle with new content
  • March 14th - Translators can start translating again
  • April 3rd - Final translation delivery date for Beta
  • April 17th - <s>POOTLE DOWNTIME for Beta Readme and smaller updates
  • April 18th - OOo Readme via issue: http://www.openoffice.org/issues/show_bug.cgi?id=88345
  • April 22nd - <s>POOTLE UPLOAD for Beta Readme and smaller updates Translation delivery for Beta
  • May 11th - Translators upload all complete translated files to Pootle and stop translating
  • May 12th - Pootle admistrators update Pootle with new content
  • May 13th - Translators can start translating again
  • June 4th at 10:00 am CEST - Translators upload all complete translated files and stop translating
  • June 5th - Pootle admistrators update Pootle with new content (en_US sdf m16)
  • June 6th - Translators can start translating again. Latest translations (approx. 5,000 words)
  • June 27th - Translation delivery date for OOo 3.0 final
  • July 4th - last delivery date for translation and language fixes


New/Changed Translation Volume - 1st handover

GUI approx. 200 words
Help approx. 18,000 words

New/Changed Translation Volume - 2nd handover

Help approx. 5,800 words

New/Changed Translation Volume - 3rd handover

GUI approx. 1,000
Help approx. 5,000

Translating with Pootle

  • http://www.sunvirtuallab.com:32300/
  • Pootle administrators make sure Pootle content is updated according to translation schedule
  • L10n lead give the Go to start translation on Pootle (announcement to the dev@l10n list)
  • Native Language leads make translation assignments following the Pootle User's Guide
  • Translation teams work with Language team to make sure the translation is reviewed
  • Native Language/translation leads make sure translation is complete within the deadlines
  • Native Language/translation leads communicate translation completion to l10n lead
  • Pootle administrator downloads the translated files and provide them to release engineering

It is recommended to use Pootle to manage the translation process and assignments. For translation it is recommended to download the files and translate them with a translation editor which support translation memory functionality. Linguistic review can be then performed right after translation. It is recommended to use the online translation editor for corrections and for small translation volume only.

Teams translating with Pootle

More and more teams are using Pootle on the sunvirtuallab to translate. Here the complete list.


Tips and Tools

Team

build a team of translators and 1-2 reviewers to work on the project. I recommend to keep the number of the reviewers to 1-2 people, since the more translators and reviewers working on the project, the less you can ensure quality and consistency.

Glossary

SunGloss (https://g11nportal.sun.com/sungloss) is the repository of the latest terminology used in the StarOffice/OpenOffice.org product. You can request access to consult the tool. We recommend to use SunGloss instead of a spreadsheet because it shows all entries (including deleted entries that were used but not anymore). In this way it's possible for all to look at the evolution of a translation, and discuss about it when it's needed, especially during the review process.

Translation Memories

Latest TMX files can be found at: http://ooo.services.openoffice.org/pub/OpenOffice.org/cws/upload/localization/tmx24/

TMX can also be created as follows:

  • download po files from Pootle
  • run po2tmx

For more information please refer to below page for how to create TM: http://wiki.services.openoffice.org/wiki/Pootle_Glossary_Guide#Translation_Memory_in_Pootle

SDF to PO Convertion Tools

The .sdf file can be converted to .po files using the oo2po tool and converted back to .sdf format using the po2oo tool from the Translate Toolkit package: http://translate.sourceforge.net/

Documentation with examples for conversion and backconversion can be found at: http://translate.sourceforge.net/wiki/toolkit/oo2po

PO files provided on the Pootle deployment for OpenOffice.org handle duplicate messages using KDE style (oo2po --duplicates=msgid_comment). For more information see here : http://translate.sourceforge.net/wiki/toolkit/duplicates_duplicatestyle

It is important that if you are uploading and merging batches of PO content to the Pootle server that your content conforms to this style, otherwise it may not merge correctly with the content already present on the server.

Gsicheck Tool

gsicheck to make sure the translated .sdf files are not corrupted can be found at: http://ooo.services.openoffice.org/gsicheck/

Translation Editors

The most used translation editors supporting the gettext .po file format are:

PoEdit, Kbabel, WordForge and Virtaal run on both Linux and Windows. Kbabel has long been the most popular PO editor for these platforms, but gTranslator and PoEdit have planned improvements which may make them more competitive, and WordForge is a new editor which is rapidly becoming popular.

gTranslator runs on Linux and some BSD platforms.

LocFactoryEditor runs only on Mac OSX. It handles XLIFF natively, and supports Apple formats, gettext formats, SVN submission and submission by email to projects like the TP (TP Robot) and Debian (Debian BTS). It also converts between PO compendia and TMX.

Please make sure you are running the latest version of gettext, to benefit from its new features, like contextual handling and comparison with previous original strings.

Don't forget that spellcheckers like Aspell have a wide range of dictionaries for well over 70 languages. Spellcheckers not only check your spelling: they are great for catching typos. ;)

Please add further information on these and other tools that may help other translators to perform their job.

Coverage

The following figures are calculated by accumulating entries in the files named localize.sdf released as contrib/rc/3.0.0rc1/OOo_3.0.0rc1_src_l10n.tar.bz2.
Unused entries are excluded based on the following assumptions:

The number of entries to be translated

OpenOffice.org 3.0.0rc1 has, at told, 80,483 entries to be translated.

UI Help Total
27,640 52,843 80,483

UI denotes User Interface; Help denotes On-line Help.

Coverage of each language

The following table shows the coverage of translation in OpenOffice.org 3.0.0rc1 (OOO300_m5).

UI
Coverage From 0 % 10 % 20 % 30 % 40 % 50 % 60 % 70 % 80 % 90 %
To 10 % 20 % 30 % 40 % 50 % 60 % 70 % 80 % 90 % 100 %
The number of languages 24 1 0 1 0 1 2 5 21 53
Cumulative 108 84 83 83 82 82 81 79 74 53
Language Coverage
af 90 % af
ar 96 % ar
as 1 % as
as-IN 79 % as-IN
be-BY 97 % be-BY
bg 100 % bg
bn 14 % bn
bn-BD 0 % bn-BD
bn-IN 0 % bn-IN
br 98 % br
bs 88 % bs
ca 100 % ca
cs 100 % cs
cy 92 % cy
cz 1 % cz
da 100 % da
de 100 % de
dz 89 % dz
el 97 % el
en-GB 96 % en-GB
en-ZA 89 % en-ZA
eo 96 % eo
es 100 % es
et 100 % et
eu 6 % eu
fa 64 % fa
fi 89 % fi
fr 100 % fr
ga 91 % ga
gl 100 % gl
gl-ES 1 % gl-ES
gu 94 % gu
gu-IN 88 % gu-IN
he 91 % he
hi-IN 89 % hi-IN
hr 96 % hr
hu 100 % hu
it 100 % it
ja 100 % ja
ka 92 % ka
kk 60 % kk
km 99 % km
kn 2 % kn
kn-IN 0 % kn-IN
ko 100 % ko
ku 81 % ku
ln 1 % ln
lo 0 % lo
lt 100 % lt
lv 0 % lv
mk 100 % mk
ml 1 % ml
ml-IN 75 % ml-IN
mn 95 % mn
mr 1 % mr
mr-IN 76 % mr-IN
ms 0 % ms
my 72 % my
nb 100 % nb
ne 95 % ne
nl 100 % nl
nn 100 % nn
no 0 % no
nr 89 % nr
ns 89 % ns
oc 2 % oc
or 1 % or
or-IN 87 % or-IN
pa-IN 97 % pa-IN
pl 100 % pl
pt 100 % pt
pt-BR 100 % pt-BR
ru 100 % ru
rw 96 % rw
sh 99 % sh
sh-YU 0 % sh-YU
sk 93 % sk
sl 100 % sl
sr 99 % sr
sr-CS 0 % sr-CS
ss 89 % ss
st 89 % st
sv 100 % sv
sw 0 % sw
sw-TZ 89 % sw-TZ
ta 93 % ta
ta-IN 88 % ta-IN
te 1 % te
te-IN 87 % te-IN
tg 92 % tg
th 88 % th
ti 1 % ti
ti-ER 35 % ti-ER
tlh 0 % tlh
tn 58 % tn
tr 100 % tr
ts 89 % ts
uk 99 % uk
ur 1 % ur
ur-IN 81 % ur-IN
uz 75 % uz
ve 88 % ve
vi 100 % vi
xh 89 % xh
zh-CN 100 % zh-CN
zh-TW 100 % zh-TW
zu 89 % zu
en-US 100 % en-US
Help
Coverage From 0 % 10 % 20 % 30 % 40 % 50 % 60 % 70 % 80 % 90 %
To 10 % 20 % 30 % 40 % 50 % 60 % 70 % 80 % 90 % 100 %
The number of languages 47 2 2 2 0 0 3 3 29 20
Cumulative 108 61 59 57 55 55 55 52 49 20
Language Coverage
af 0 % af
ar 36 % ar
as 0 % as
as-IN 0 % as-IN
be-BY 0 % be-BY
bg 89 % bg
bn 0 % bn
bn-BD 0 % bn-BD
bn-IN 0 % bn-IN
br 87 % br
bs 83 % bs
ca 61 % ca
cs 90 % cs
cy 84 % cy
cz 0 % cz
da 90 % da
de 100 % de
dz 82 % dz
el 83 % el
en-GB 87 % en-GB
en-ZA 83 % en-ZA
eo 82 % eo
es 100 % es
et 89 % et
eu 0 % eu
fa 0 % fa
fi 83 % fi
fr 100 % fr
ga 83 % ga
gl 69 % gl
gl-ES 0 % gl-ES
gu 81 % gu
gu-IN 82 % gu-IN
he 84 % he
hi-IN 82 % hi-IN
hr 86 % hr
hu 100 % hu
it 100 % it
ja 100 % ja
ka 30 % ka
kk 0 % kk
km 90 % km
kn 0 % kn
kn-IN 0 % kn-IN
ko 100 % ko
ku 14 % ku
ln 0 % ln
lo 0 % lo
lt 84 % lt
lv 0 % lv
mk 85 % mk
ml 0 % ml
ml-IN 0 % ml-IN
mn 2 % mn
mr 0 % mr
mr-IN 0 % mr-IN
ms 0 % ms
my 0 % my
nb 25 % nb
ne 79 % ne
nl 93 % nl
nn 21 % nn
no 0 % no
nr 0 % nr
ns 0 % ns
oc 0 % oc
or 0 % or
or-IN 0 % or-IN
pa-IN 0 % pa-IN
pl 90 % pl
pt 90 % pt
pt-BR 100 % pt-BR
ru 93 % ru
rw 82 % rw
sh 82 % sh
sh-YU 0 % sh-YU
sk 84 % sk
sl 90 % sl
sr 82 % sr
sr-CS 0 % sr-CS
ss 0 % ss
st 0 % st
sv 100 % sv
sw 0 % sw
sw-TZ 82 % sw-TZ
ta 60 % ta
ta-IN 82 % ta-IN
te 0 % te
te-IN 79 % te-IN
tg 85 % tg
th 83 % th
ti 0 % ti
ti-ER 2 % ti-ER
tlh 0 % tlh
tn 0 % tn
tr 88 % tr
ts 0 % ts
uk 10 % uk
ur 0 % ur
ur-IN 79 % ur-IN
uz 7 % uz
ve 0 % ve
vi 85 % vi
xh 0 % xh
zh-CN 100 % zh-CN
zh-TW 100 % zh-TW
zu 0 % zu
en-US 100 % en-US

Instructions

tar -jxf contrib/rc/3.0.0rc1/OOo_3.0.0rc1_src_l10n.tar.bz2
f=`find OOO300_m5/ -name localize.sdf`
a=OOO300_m5_all_excluding_unused_entries.csv
perl statictics_excluding_unused_entries_20080925.pl $f > $a 2> OOO300_m5_all_excluded.txt
grep -e '^helpcontent2' $a > h
grep -e -v '^helpcontent2' $a > u
head -1 $a > 1
cat u > OOO300_m5_all_excluding_unused_entries_ui.csv 
cat 1 h > OOO300_m5_all_excluding_unused_entries_help.csv 
wc -l OOO300_m5_all_excluding_unused_entries_*.csv 
   52844 OOO300_m5_all_excluding_unused_entries_help.csv
   27641 OOO300_m5_all_excluding_unused_entries_ui.csv
   80485 total

wc -l OOO300_m5_all_excluded.txt
   10496 OOO300_m5_all_excluded.txt

Open those .csv files with OpenOffice.org 2.4.1 and summarize the figures. 

Files

(a) has links to (b) and (c).

Those files are attached in the issue 94289.

Q & A

Personal tools