Localization engineers are the miracle workers behind the scenes of localization workflows, and without them many of the projects we see couldn’t happen. The skillsets they possess go far beyond the sort of things that most translators know how to do, and often require the ability to code. I’ve already written a little about these sorts of things in the last three or four articles I published this month, mainly because the use of AI (tools like ChatGPT for example) is opening up the possibility for the rest of us mere mortals to benefit from the sort of things they do. Today I’m extending on another such skill that I have introduced only once before back in 2013, a decade ago! It is a very technical, and yet powerful thing to be able to tap into, so now with the help of ChatGPT I’m going to do it again!
As I’m getting lost in my own thoughts around just what to talk about next with regard to AI technologies and in particular ChatGPT… and as I’m pondering about the effect this is going to have on our industry I recalled a couple of questions around the use of XPath in the community. One of these questions was yesterday and it related to how to use XPath to extract one of the languages in a TMX file using the XML filetype in Trados Studio. Not a particularly tricky thing to do, and I imagined the user was just editing the content or maybe changing the language pair by translating one of the languages into something else, or something like that. But what struck me was the XPath expression he used.
Continuing the theme of how to make use of AI technologies to help with the more technical nature of localization I thought I could revisit an article I wrote back in 2013… this month a decade ago! In that article I explained how to write a very basic stylesheet that could be used to provide more context when translating XML files. To do that I had to learn some basics myself and that did give me enough of a skillset to pretty much create stylesheets for all kinds of basic html table based previews that I come across… but I can never claim to be an expert and if the styling or the XML was more complex I might not be able to do it at all.
With all the excitement and interest around ChatGPT these days, and with the numerous interesting projects we’re working on at RWS that involve the use of AI, its hard not to allow the lure of the technology to pull you in. Last month I wrote an article on corrupt translation memories, and in doing this I dabbled a little with SQLite queries… an SDLTM is a SQLite database. They are pretty much new to me as I’ve only used very simple statements… one liner lookups for example… in the past. So I had to read some basics and learn a little when I did that. Nothing new, I like to read this sort of material and learn a little something new anyway. So when I was asked this evening how do you remove translation units from a translation memory that have the same source and same target I immediately started to think SQLite.
This is a topic that probably occurred a lot more in the old days of Trados and Translators Workbench where it was relatively easy to corrupt a translation memory. In those days the translation memory consisted of five file types with the extensions .tmw, .mwf, .mtf, .mdf and .iix and when problems did occur it was probably related to the files that supported indexing and lookup speeds for example. The .tmw file itself that contained the translation units was less likely to be the source of the problem. So fixing it could often be achieved by creating a new translation memory with the same settings, and then simply replacing the .tmw in the new translation memory with the old one… finally reorganising. This didn’t always help, but if often did!
Everyone is probably familiar with a similar phrase, often mistakenly attributed with biblical origins, “the Lord helps those who help themselves”. The phrase actually originated in ancient Greece through one of Aesop’s fables called “Hercules and The Wagoner“:
Back in 2015 I wrote an article called “Good bugs… bad bugs!” which was all about the unintended positive side effect as a result of computer software not working as intended. I’d actually forgotten about this article until this weekend as I was pondering my own behaviour in responding to a post in the RWS Community. In fact it was my wife that got me thinking as I allowed the community thread to frustrate me because I couldn’t understand why some users can’t see reason… my reason! I had comfortably created two buckets in my mind.. either they are just incapable of understanding and I’m talking to a brick wall or they just won’t understand because they don’t want to listen since it doesn’t suit their own agenda. It didn’t help that none of my suggestions were even acknowledged, but nonetheless it took my wife to remind me that perhaps I wasn’t listening to them properly!
The most viewed article I have ever written by far was “So how many words do you think it was?” which I wrote in 2012 almost ten years ago. I revised it once in 2015 and whilst I could revise it again based on the current versions of Trados Studio I don’t really see the point. The real value of that article was understanding how the content can influence a word-count and why there could be differences between different applications, or versions of the same application, when analysing a text. But I do think it’s worth revisiting in the context of MT (machine translation) which is often measured in characters as opposed to words… and oh yes, another long article warning!
Growing a product range, buying new companies, being bought yourself, adopting new technology, reorganising etc… all of this creates significant change across an organisation that often feels as though you’re on a merry-go-round where things change as you go around until you’re back to where you started and then it all changes again. I can only imagine that feeling applies to customers and employees alike as each revolution strives to be better than the last, easier to navigate, meaningful in its purpose and full of the promise of success once properly implemented… and yet slightly confusing at the same time!
Why would you have to? Surely Ai can translate itself? If not it sounds like a pretty big topic… or I’m just confused. Acronyms can do this to you and these days we do have good reason to be confused… Multiterm/Machine Translation (MT), National Aeronautics and Space Administration/North America South America (NASA), Role Playing Game/ Rocket Propelled Grenade (RPG), Wages For Housework/Working From Home (WFH)… the latter essentially being the same!! The list is huge and these days I find myself looking something up almost every day. Ai is another one… Artificial Intelligence is probably what crossed your mind right from the start, particularly since I put it on top of a brain! I actually found 164 meanings for this acronym but only one of them matches the topic for my article… and that is Adobe Illustrator which should be a far more manageable topic for translation!