The PerfectMatch…

mage: A heart made of puzzle pieces with a piece missing.In the world of translation, Trados Studio’s PerfectMatch feature is like the overachieving student who always gets straight A’s, and its academic partner is the brilliant but slightly disorganised professor.  PerfectMatch, with its meticulous and precise matching capabilities, often finds itself patiently sorting through the professor’s vast but somewhat chaotic repository of knowledge.  Picture PerfectMatch as the diligent student, poring over texts late into the night, determined to find that one perfect translation match.  Meanwhile, the academic partner is the genius who wrote the book on translation but can’t quite remember where they put it. Together, they form an unlikely but unstoppable duo – the PerfectMatch feature meticulously cross-referencing every word while the academic partner brings a wealth of knowledge, albeit sometimes hidden under a pile of papers.  It’s a partnership where precision meets wisdom, creating translations that are not just accurate, but also enlightened!

A lovely thought… but perhaps not quite reality.  I was prompted to write this article after a couple or three coinciding events.  The first was a user in the RWS Community asking how to make a particular feature related to the use of Perfect Match work; the second was a new course released by our training teams called “Working with Perfect Match“; the third was the Localization Engineer course I went through in my last article.  The feature in question is one of the three options you have when PerfectMatching files in a project… and I should add that the PerfectMatch feature is only available in the Professional version of Trados Studio.  Freelance users have a neat feature called “Update file” that achieves a similar result and uses PerfectMatch in the background to do it, but is only for one file at a time and will actually replace the files in your existing project.  The professional version is a little more sophisticated and is great for managing mid-project updates when working with large projects and multiple target languages.

The PerfectMatch feature works something like this:

  1. Document Analysis: PerfectMatch starts by analyzing the new version of a document that needs updating, comparing it with the previous version of the same document.
  2. Identification of Matches: It identifies segments (sentences or phrases) in the new document that exactly match those in the previous version.  This includes both untranslated source text and already translated target text.
  3. Locking of Matches: Once these perfect matches are identified, PerfectMatch can optionally lock these segments.  Locking means these segments will not need to be translated again, as they are identical to what was previously translated.
  4. Focus on New and Changed Content: The translator can then focus on translating only the new or changed segments in the document.  This significantly reduces the amount of work required, as previously translated content is reused.
  5. Consistency and Efficiency: This process ensures consistency in translation, especially for documents that are frequently updated, like manuals or product descriptions.  It also saves time and resources by avoiding re-translation of unchanged content.

So in a nutshell it’s a very handy feature for a translation office.

The very specific features this article is about are how you use it in practice, and in particular this part:

Screenshot of Trados Studio's PerfectMatch feature, showing menu options 'Matching Previous Documents from Folder...', 'Specific Previous Document...', and 'Previous Documents from Map...'

The three options for adding matching files… although the first two are simple and well documented in the help, videos explaining how to use it, and also the new training course.  But what’s not so well documented… even if it’s in the product help when you know exactly what to search for (PerfectMatch map files), and it’s not part of the new training course (it’s only Level 2 so there may be more in the pipeline) as far as I can see, is how to use “Previous Documents from Map…”.  This feature is particularly useful for a translation office who probably automate a lot of the work they do and when you have projects going into 20+ target languages, and files being sent to different locations for translation, the management of this task and creation of the map files isn’t simple… or rather it’s a logistically challenging task rather than a complex one.

So coming back to the triggers for this article they boil down to being able to provide an answer to the community question, an explanation of the feature that is omitted from the available documentation and training, and an opportunity to show why the localization engineering course is so useful.  But first let me explain how this is intended to work.

Some Product Help…

To make this simple I’m going to cheat a little and this is because the tech pubs team pointed me to the 2022 SR2 version of the help where they do explain quite nicely how this works:

Key concepts -> About Perfect Match -> PerfectMatch map files

The basic idea being you create a tab delimited file with a .txt extension formatted like this:

Sample Folder A\SampleFileAV1.docx [...]\Users\jsmith\Documents\Studio 2015\Projects\Project 7\de-DE\SampleFileAV2.docx.sdlxliff

Sample Folder B\SampleFileBV1.docx [...]\Users\jsmith\Documents\Studio 2015\Projects\Project 7\de-DE\de-DE\SampleFileBV2.docx.sdlxliff

Sample Folder C\SampleFileCV1.docx [...]\Users\jsmith\Documents\Studio 2015\Projects\Project 7\de-DE\SampleFileCV2.docx.sdlxliff

The key concepts being that this is a 2-column file with the first column being a relative path to the file in your project (without the sdlxliff extension) and the second column being an absolute path to the previously translated bilingual file (that does have an sdlxliff extension).

Seems simple enough until you consider an example of how this might be used in practice.

My Translation Office…

So to test this out I mocked up an office that has a localization engineer preparing projects and four different translation teams.  These teams all work with project templates prepared by the localization engineer to ensure that all projects are managed internally using the appropriate customised templates to suit the work being carried out.  This also means in my office we don’t need to work with project packages and we instead work with the bilingual files.  We could work with packages, but I quite like the file-based approach as it’s light and convenient and I can easily manage the work this way without any difficulty at all.

So my Translation Offices are set up like this on my drive:

A screenshot depicting a folder structure related to translation work. The main folder '[TranslationOffices]' contains subfolders for different countries: '[France]', '[Germany]', '[Italy]', and '[Spain]'. Inside the '[France]' folder, there is a further subfolder named '[New_Projects]', which itself contains '[003 - Files Only]' and '[004 - Files and Folders]'. The folders are depicted with traditional folder icons, and the structure indicates an organized file system for managing translation projects.

Nothing over complicated and I only work with 4 target languages in my fictional company.  I have a location on my hard drive for each office, a folder for new projects within each office and a list of the projects being worked on in the new projects folder.

My Localization Office…

I also have a Localization Office where I have folders matching the project names of those I create in Trados Studio and these will contain my Map files:

Screenshot of a directory tree with a main folder titled '[LocalizationOffice]'. Within it are two subfolders: '[003 - Files Only]' and '[004 - Files and Folders]'.

Again nothing complex and I’m using this folder for my map files in this PerfectMatch process.  The original Trados Studio projects are created by me in the default Trados Studio projects location and I copy the sdlxliff files from the target language folders into the appropriate locations for my Translation Office.

All nice and simple, but could be a logistic nightmare and easily prone to errors without careful management and control of the process.  So let’s talk about what the Localization Office can do here to automate this process somewhat.  To do this let’s look at what we need to do in practice for a Perfect Match process to work:

  1. Create a Trados Studio Project with the original source files
  2. Copy the sdlxliff files from this project into the Translation Office folders based on language so each office can manage their own translation process
  3. Create a Map file so Trados Studio will know where these files are for a PerfectMatch operation if needed later in the process

To try and help visualise this a little better I mapped it out… roughly:

Flowchart detailing the process for managing translation projects in different European countries. The central loop outlines the core steps: creating Trados Studio projects, copying bilingual files to translation offices, creating map files for a perfect match, and updating projects. Four spokes lead to individual processes for Italy, Spain, France, and Germany, each involving creating country-specific projects, translating and reviewing, and exporting bilingual files.

This then boils down to just three simple things.  The first, create your Studio Project, I briefly discussed as I introduced powershell automation to my blog many years ago in 2013 with an article called “The Powershell what?“.  It explains briefly using Powershell to work with the APIs in Trados Studio to really speed up the things you do.  It would also be remiss of me not to mention an excellent github project created by Evzen Polenka called “SDL Trados Studio Automation Kit (STraSAK)“… although it might need rebranding these days to “TraSAK”.  It needs updating as Evzen has retired from localization and it only supports up to the 2021 version of Trados Studio, but the content provides a nice overview of the sort of things a localization engineer can do for you if you hire one!  In fact I’ll reproduce the table Evzen created here as it’s a nice eye opener for anyone who wasn’t aware:

Command Description
New-Project Create new project – optionally based on project template or another project – in specified location, using specified source and target languages and TMs from specified location. Get source files from specified location and automatically convert them to translatable format and copy them to target languages. Optionally also pre-translate and analyze the files, saving results to Trados 2007-formatted log.
PseudoTranslate Pseudo-translate specified project, using specified pseudo-translation options. Optionally also export the pseudo-translated target files.
Export-Package Create translation packages from specified project, using specified package options, and save them to specified location
Import-Package Import return packages from specified location in a specified project
ConvertTo-TradosLog Convert Studio XML-based report to Trados 2007-formatted log
Export-TargetFiles Export target files from specified project to specified location
Update-MainTMs Update main translation memories of specified project
New-FileBasedTM Create new translation memory in specified location, using specified options
Export-TMX Export one or more Trados Studio translation memories to TMX, optionally applying a filter
Import-TMX Import content from TMX file in a specified TM, optionally applying a filter

For this article however, I’m not going to include the project automation part as I want to show the Studio UI as we work through this.  The second and third items I can address with another script of some sort, not using the Trados Studio APIs, but just copying files around and creating a tab delimited text file for the mapping in PerfectMatch.

Not being an expert on scripting, but at least knowing what I need it to do, I used ChatGPT to create this script.  It took me at least a dozen iterations of the script to get it right by working in small chunks exactly as I would do the process manually.  I did have to go back and edit a few things but all in all this was a surprisingly simple task.  I actually started by asking ChatGPT what the best way to do this would be and it came back with some suggested code for moving files around written in Python.  This felt like overkill and so I asked if using a DOS script and Robocopy might be easier.  Based on my initial question the answer was positive, but then I realised I wanted to point to an sdlproj file in the Trados Studio project folders and actually parse it to see what the project name was, where it was and record the relative paths to the files, and of course what languages were in the project.  So for this Python was looking preferable.  But as I have used Powershell in a limited way before I opted for that since Powershell is also pretty good at parsing XML… and ChatGPT of course obliged.

A dozen iterations later by building up the script in bite sized chunks (you cannot just ask ChatGPT to do the whole task) and I have a script that allows me to work like this:

  1. Create my Trados Studio Project (the manual way for this exercise)
  2. Run my script and provide the path to the created project
  3. The script then copies the files into the appropriate Translation Office language folders, creates the Map files (one for each language) which it places into the appropriate Localization Office folder

To run it I just saved the file as a *.ps1 file, associated Powershell with this extension in Windows and then I can double click my script file and the whole thing takes a few seconds to run after I created my initial Trados Studio project.  My Translation Offices all have their files, I have my mapping files and now I’m ready for when I receive updated files from my client.

I’m not going to document everything I did because there are many ways to work in a Translation business and I am not suggesting for a second that this is what everyone should do.  This is just an exercise to explain how the Mapping files in Perfect Match work, and also (I hope) a further intro to what benefits employing a localization engineer could bring you in your business today.

But all of this is best demonstrated live, so here’s a video of the process in operation and examples of how to PerfectMatch with a Map file.

Approx. 25 mins… so get a cup of tea before watching!

Post Article Update:
And to conclude with a post published article update… if you use this feature and agree that having the ability to update a project with one map file as opposed to one map file for each language then vote for this idea:

Improve the PerfectMatch “Previous documents from Map…” feature

Back to school… again!

The image depicts a stylised portrait of a person wearing academic attire. The individual has a neatly trimmed beard and moustache, and they are wearing a blue graduation cap with a tassel on the right side. They appear to be smiling contentedly, with their eyes closed in a serene expression. The figure's graduation gown is dark blue, and they are wearing a white shirt with a green tie underneath. The image has a clean and modern vector art style, with flat colours and simple shapes for features.After I did my last studies, apart from all the endless mandatory HR type training we have to endure these days, I thought that would be it for any sort of formal training for me.  In fact the main reason for me doing my last formal studies, TCLoc Masters degree at the University of Strasbourg, was to fill the gaps I thought I had given a complete lack of education in the field I’ve been working for the last 17-years.  That degree was very useful and I definitely learned a lot and filled some gaps, but whilst there was an element of technical localization to it I think it only scratched the surface and didn’t really cover the sort of skills that I think are needed, not just for localization engineers, but also for professional translators and project managers, working in technical localization today.

Continue reading “Back to school… again!”

Character counts…

The most viewed article I have ever written by far was “So how many words do you think it was?” which I wrote in 2012 almost ten years ago.  I revised it once in 2015 and whilst I could revise it again based on the current versions of Trados Studio I don’t really see the point.  The real value of that article was understanding how the content can influence a word-count and why there could be differences between different applications, or versions of the same application, when analysing a text.  But I do think it’s worth revisiting in the context of MT (machine translation) which is often measured in characters as opposed to words… and oh yes, another long article warning!

Continue reading “Character counts…”

The portfolio merry-go-round…

Growing a product range, buying new companies, being bought yourself, adopting new technology, reorganising  etc… all of this creates significant change across an organisation that often feels as though you’re on a merry-go-round where things change as you go around until you’re back to where you started and then it all changes again.  I can only imagine that feeling applies to customers and employees alike as each revolution strives to be better than the last, easier to navigate, meaningful in its purpose and full of the promise of success once properly implemented… and yet slightly confusing at the same time!

Continue reading “The portfolio merry-go-round…”

Translating AI…

Why would you have to?  Surely Ai can translate itself?  If not it sounds like a pretty big topic… or I’m just confused.  Acronyms can do this to you and these days we do have good reason to be confused… Multiterm/Machine Translation (MT), National Aeronautics and Space Administration/North America South America (NASA), Role Playing Game/ Rocket Propelled Grenade (RPG), Wages For Housework/Working From Home (WFH)… the latter essentially being the same!!  The list is huge and these days I find myself looking something up almost every day.  Ai is another one… Artificial Intelligence is probably what crossed your mind right from the start, particularly since I put it on top of a brain!  I actually found 164 meanings for this acronym but only one of them matches the topic for my article… and that is Adobe Illustrator which should be a far more manageable topic for translation!

Continue reading “Translating AI…”

Let’s learn about XML…

This year at the Spring Trados Roadshows the emphasis was firmly placed upon education.  Almost all the presentations were based on providing translators, project managers, localization engineers etc. with great material to help them as they work with the Trados toolsets.

I had a few presentations at this event and decided it might be useful to post a few of them here, especially the ones that might help with some of the common filetype questions we see in the communities from time to time.

Continue reading “Let’s learn about XML…”

Not your usual stuff!

Time seems to be going faster as I’m getting older as it doesn’t seem that long ago since we saw the release of the 2019 version of SDL Trados Studio.  But here we are, it is that time again and many users will already have noticed they have a shiny new version in their account… SDL Trados Studio 2021.  Fast as it is, we don’t want to do these product launches too often because I can tell you it’s a major undertaking requiring no small amount of coordination between the product management teams, core development teams, AppStore team, support teams, customer success teams, marketing teams, sales teams, back office teams, IT teams, 3rd party developers who provide plugins and more.  In addition to this we often have other projects on the go and many of the teams worked on the new website which also went live this week, AND everyone did all of this while having to work isolated from their colleagues while working from home.  Quite an achievement and I certainly feel proud to be part of this SDL team, and not just because of how well they all work together.

Continue reading “Not your usual stuff!”

The “Old Farts Language Code Club”…

The Old Farts Language Code Club! This is a new club, inspired by a comment in the SDL Community from a prospective member. I’m not sure yet of the age at which you can qualify for membership, but in addition to the age requirements, which may have stringent rules to prevent any young whippersnappers from joining, it’s essential that prospective members have a good grasp of the language codes used in Trados Studio. I’m also not sure of the demand, so I may open a waiting list that could include anyone who already makes good use of the language codes in Trados Studio but isn’t an old fart yet!

Continue reading “The “Old Farts Language Code Club”…”

Wot! No target!!

The origin of Chad (if you’re British) or Kilroy (if you’re American) seems largely supposition.  The most likely story I could find, or rather the one I like the most, is that it was created by the late cartoonist George Edward Chatterton ‘Chat’ in 1937 to advertise dance events at a local RAF (Royal Air Force) base.  After that Chad is remembered for bringing attention to any shortages, or shortcomings, in wartime Britain with messages like Wot! No eggs!!, and Wot! No fags!!.  It’s not used a lot these days, but for those of us aware of the symbolism it’s probably a fitting exclamation when you can’t save your target file after completing a translation in Trados Studio!  At least that would be the polite exclamation since this is one of the most frustrating scenarios you may come across!

At the start of this article I fully intended this to be a simple description of the problems around saving the target file, but like so many things I write it hasn’t turned out that way!  But I found it a useful exercise so I hope you will too.  So, let’s start simple despite that introduction because the reasons for this problem usually boil down to one or more of these three things:

  1. Not preparing the project so it’s suitable for sharing
  2. Corruption of a project file
  3. A problem with the source file or the Studio filetype

Continue reading “Wot! No target!!”

A business resolution for 2019…

There are three things that have stood out for me this year.  The first is how much support SDL have provided to their users to make sure they are able to work successfully with their investment.  The second is how little many users are aware of this, and the third is just how many users have used Trados for a decade or more and were not aware of what a support & maintenance contract can bring you.  This last one has been the biggest surprise to me as I’ve spoken to people who thought a support contract was more than the cost of the software; to people who thought it was support only and to people who didn’t know SDL provided any support at all!  So, one of my resolutions for 2019 will be to try and make sure that all our users are more aware of how to get help, even if they don’t want to purchase a support & maintenance contract.  So, I’ll cover these things:

  • Support & Maintenance Contract
  • SDL Community
  • The Customer Experience Team
  • The SDL AppStore Team
  • The SDL Marketing Team
  • Training
  • Customer Experience Program

Continue reading “A business resolution for 2019…”