OSTD

Posts related to the Open Source Translation Database.

Disgusting: for L in `cat lang.txt | cut -f 2,3,4,5 -d’ ‘ | sed ‘s/^.//’ | sed ‘s/.$//’ | sort`; do echo -n “$L “; done More disgusting: cat lang.txt | sort | awk ‘{ a=substr($2$3, 2); sub(“)$”, “”, a); print ” \””$1″\”, \””a”\”, \”The <a href=\x27http://littlesvr.ca/ostd/\x27>OSTD</a>\”,” ; }’ It reminds me of when I …

Continue reading I’m ashamed I wrote this

I was going to show the OSTO to Chris Tyler and earlier that day, because demos never work, I tried it, from the Seneca network. Turns out already the OSTD is a victim of its own success. When translating the ISO Master POT file I get almost 6000 translated strings in 153 languages. I you …

Continue reading Size matters

Finally a couple of days ago the import of all the translated strings from most of the software in Debian into OSTD has been completed. Now there is a grand total of 11236263 translated strings! It took 1059647 seconds, which is just over 12 days. That’s 0.094 seconds per translation. I’m sure it could be …

Continue reading Debian import complete

Most of the po files in the Debian tarball follow the naming convention packagename_version_languagecode.po So for all of those I could figure out the language code using a regular expression (or three) on the filename. Armed with that and the exceptions I mentioned in the last post on this topic I was able to get …

Continue reading Language codes, part 2

While analyzing the files I got from Debian I ran into a lot of language codes that weren’t in my database already. It was an interesting exercise, involving me learning about the existence of languages such as Javanese and countries that I already forgot about. The problem is that some of the language codes are …

Continue reading Language codes, part 1

Christian Perrier from the debian-i18n list has done me a huge favour. He created a tarball with every translation in every language for every piece of software in Debian! You may imagine it’s huge as did I, but I was shocked at just how big it is. Almost 2 GB of gzip-compressed PO files from …

Continue reading Lots of translations

Here’s an OSTD feature I was really excited about and looking forward to implement: allow the user to put in some version control info for a project so that I can run a nightly cron job and pull any new translations that have been added to that project. Would have been a great feature, and …

Continue reading Oh, you Git!