Studio Tips

The handling of numbers and units in Studio is always something that raises questions and over the years I’ve tackled it in various articles.  But one thing I don’t believe I have specifically addressed, and I do see this rear its head from time to time, is how to handle the spaces between a number and its unit.  So it thought it might be useful to tackle it in a simple article so I have a reference point when asked this question, and perhaps it’ll be useful for you at the same time.

I have a background in Civil Engineering so when I think about this topic I naturally fall back to “The International System of Units (SI)” which has a clear definition on this topic:

There is a little more to it than this, and you can find the full specification here, but the essence is you always have a space.  I don’t think it defines how wide the space should be or whether it’s a non-breaking space or not, although in practice I think most technical writers would commonly use a non-breaking space.  There is a page here defining the rules if you’re interested.

Interestingly I started this article with an image showing numbers and a percentage symbol, and yet this is not an SI unit at all.  Rather it’s a number, normally meaning 0.01 and when used in junction with an SI unit there should be a space between the number and the percentage symbol.  So, where is this taking me?  This topic of whether you should use a space or not is quite often the source of a debate, and often confusion, between users and even though I have an engineering background I don’t really get too hung up on it.  For me the most important thing is that I have a way of dealing it with it.  So irrespective of what the source contains, or what the styleguide is asking for, I need to be able to handle it.

Handling it in SDL Trados Studio

It’s quite possible that the styleguide you are working to, or simply the verbal request you are working to, could have different rules for one language compared to another.  I occasionally come across a translator referring me to some document on the internet I’ve never heard of that sets out the rules for this kind of thing in their language pair.  Fair enough … Studio supports this by allowing you to define the way spaces are handled by language.  In fact it’s this very ability that makes it difficult for users to know about this feature at all!  Studio has this concept of All Language Pairs which looks like this:


I’d hazard a guess that most users, unless they are familiar with multilingual projects, only ever use the settings here under All Language Pairs.  You won’t find anything at all in here related to measurements and their spaces.  In fact you may have even looked at the specific language pairs underneath, thought they were the same and wondered why we even had them?  Well, the reason for having them is because it is possible to use different settings for each language in a multilingual project.  If you receive packages you might have wondered why the Translation Memories in the package you received are actually under the specific language pair, well this is the reason why.  If you place your Translation Memories at the All Language Pairs level then you avoid having to add them in multiple locations but it’s one setting for all.  It’s all about choice.

But hang on a minute… what about measurements?  Well this is an odd one that I don’t have a good answer for because it’s not possible to apply a setting for measurements that can apply to all languages.  I have no idea why!  But if you open up one of the specific language pairs you’ll see something like this:

Now we see a few differences:

  • we have an Auto-substitution node that expands to give us settings for Dates and Times, and also for Measurements.
  • we can add AutoSuggest Dictionaries at this level (I also have no good explanation for why here and not at All Language Pairs… surely they could be recognised in the same way a TM is?)
  • we don’t have Language Resources (I think these do make sense in All Language Pairs seeing as they relate to the specific language pairs anyway and the setting is unique to the resource template you choose)

It’s the Measurements I’m most interested in here, although I’d recommend you review the others too as you might find some interesting surprises.  If I click on this node I see this:

Aha… now everything should become clear.  There should be enough options in here to satisfy all the requirements you are likely to have.  You can match the source, you can set your own and Studio will automatically handle this for you using interactive translation when you press Ctrl+comma to pick up your placeable, or in pre-translation when the numbers and units are automatically recognised by Studio.  Also note that you can use the settings you need in a Project template which means you don’t have to keep changing the settings every time you need something different to the defaults for a new Project.

Reality bites!

So that’s all good news so far… but I do like to try and be real about the features in Studio so here’s the problem.  In the last paragraph I underlined “when the numbers and units are automatically recognised” for a reason.  If they are not recognised then none of this will apply and you have to start looking for workarounds.  The workaround you need will depend on how much, if any, of the numbers are recognised.  I have been running a test since prior to Studio 2014 SP2 (maybe around Studio 2011 I think) and even in my most recent check in Studio 2017 CU5 we still only recognise 19% of all SI units that have been correctly written.  Interestingly if I write them incorrectly by not having a space between them then Studio recognises 56% of them which gives you a better chance of handling them out of the box!  So you better break out your workaround hat if you’re a technical translator.  By a strange quirk of fate you technical translators could actually be the best group of translators to have to deal with this problem as solving these types of issues is probably in your nature!  If you’re interested in the file with these results then I have put them here as they might be helpful for two reasons:

  1. you’ll know which ones are being recognised and which are not.  This could be helpful if you think you’re the one doing something wrong.
  2. you can feedback if you get different results in your language pair.  I only tested from English to German and this might also make a difference to the results… it shouldn’t do, but it might.

If you’re interested in the workarounds, then they would probably be variants of these:

  • allow Studio to do its thing and search and replace using regular expressions afterwards (will only work if Studio still recognises the measurements but transposes it incorrectly)
  • use the Regex Match AutoSuggest Provider to interactively get the transposition you need (links to a great article from Nora Diaz on this tool)
  • use the Terminjector Translation Provider to deliver the transposition you need
  • edit the source file before you translate it so the measurements used are correctly recognised (you’ll need the application that created the source, and hope it supports some kind of regex/wildcard search & replace)
  • edit the source file after the project is created so the measurements are correctly recognised (SDLXLIFF Toolkit)

If all of this is having you nod your head then go and vote for this idea or even this one… or if you also want to see a conversion of the units which used to be “unreliably” possible in Translators Workbench then go and vote for this one.  Otherwise I hope the article was useful and that you are one of the lucky majority who are working with files that can be handled perfectly well out of the box in Studio using the settings above.

Every now and then I see an application and I think… this one is going to be a game changer for Studio users.  There have been a few, but the top two for me have been the “SDLXLIFF to Legacy Converter” which really helped users working with mixed workflows between the old Trados tools and the new Studio 2009, and the “Glossary Converter” which has totally changed the way translators view working with terminology and in my opinion has also been responsible for some of the improvements we see in the Studio/MultiTerm products today.  There are many more, and AnyTM is a contender, but if I were to only pick my top three where I instantly thought WOW!, then the first two would feature.  So what about the third?  You could say I have the benefit of hindsight with the first two although I’m not joking about my reaction when I first saw them, but the third is brand new and I’m already predicting success!

Read More

I’m back on the topic of PDF support!  I have written about this a few times in the past with “I thought Studio could handle a PDF?” and “Handling PDFs… is there a best way?“, and this could give people the impression I’m a fan of translating PDF files.  But I’m not!  If I was asked to handle PDF files for translation I’d do everything I could to get hold of the original source file that was used to create the PDF because this is always going to be a better solution.  But the reality of life for many translators is that getting the original source file is not always an option.  I was fortunate enough to be able to attend the FIT Conference in Brisbane a few weeks ago and I was surprised at how many freelance translators and agencies I met dealt with large volumes of PDF files from all over the world, often coming from hospitals where the content was a mixture of typed and handwritten material, and almost always on a 24-hr turnaround.  The process of dealing with these files is really tricky and normally involves using Optical Character Recognition (OCR) software such as Abbyy Finereader to get the content into Microsoft Word and then a tidy up exercise in Word.  All of this takes so long it’s sometimes easier to just recreate the files in Word and translate them as you go!  Translate in Word…sacrilege to my ears!  But this is reality and looking at some of the examples of files I was given there are times when I think I’d even recommend working that way!

Read More

A nice picture of a cutie cat… although I’m really looking for a cutie linguist and didn’t think it would be appropriate to share my vision for that!  More seriously the truth isn’t as risqué… I’m really after Qt Linguist.  Now maybe you come across this more often than I do so the solutions for dealing with files from the Qt product, often shared as *.TS files, may simply role off your tongue.  I think the first time I saw them I just looked at the format with a text editor, saw they looked pretty simple and created a custom filetype to deal with them in Studio 2009.  Since that date I’ve only been asked a handful of times so I don’t think about this a lot… in fact the cutie cat would get more attention!  But in the last few weeks I’ve been asked four times by different people and I’ve seen a question on proZ so I thought it may be worth looking a little deeper.

Read More

001Not Marvel Comics, but rather the number four which does have some pretty interesting properties.  It’s the only cardinal number in the English language to have the same number of letters as its value; in Buddhism there are four noble truths; in Harry Potter there are four Houses of Hogwarts; humans have four canines and four wisdom teeth; in chemistry there are four basic states of matter… but more importantly, for translators using Studio 2017 there are four ways, out of the box, to get started!

Now with that very tenuous link let’s get to the point.  Four ways to start translating, all of them pretty easy but they all have their pros and cons.  So getting to grips with this from the start is going to help you decide which is best for you.  First of all what are they?

  1. Translate single document
  2. Create a project
  3. Drag and drop your files
  4. Right-click and “Translate in SDL Trados Studio”

And now we know what they are should you use one process for all, or can you mix and match?  I mix and match all the time, mainly between 1. and 2. but let’s look at the differences first and you can make your own mind up.

Read More

001It’s been a while since I wrote anything about the SDLXLIFF Toolkit.. in fact I haven’t done since it was first released with the 2014 version of Studio.  Now that we have added a few new things such as SDLPLUGINS so that apps are better integrated and can be more easily distributed with Studio we have launched a new version of the toolkit for Studio 2017.  What’s new?  To be honest not a lot, but there are a couple of things that I think warrant this visit.

First of all, the app is now a plugin and this means it loads faster, is always available and there are a few tricks to being able to get the most from this.  Secondly, there are a few fixes to the search & replace features that make it possible to complete tasks that Studio will fail with and to do this the API team completely rebuilt the regex engine.  So whilst you won’t see too many changes, there are a few under the hood.

The best way to illustrate this is to show you so I have created a short video below where I have tried to explain how best to use the toolkit now it’s a plugin and not a standalone application, and I used the problems described below to demonstrate how it works.  If you want to know what else it can do I have reproduced part of the original guide below the video as that seems to have been lost over the years.  This might be helpful for a few of the more obscure features you may not have realised were possible.

Read More

001One of my favourite features in Studio 2017 is the filetype preview.  The time it can save when you are creating custom filetypes comes from the fun in using it.  I can fill out all the rules and switch between the preview and the rules editor without having to continually close the options, open the file, see if it worked and then close the file and go back to the options again… then repeat from the start… again… and again…   I guess it’s the little things that keep us happy!

I decided to look at this using a YAML file as this seems to be coming up quite a bit recently.  YAML, pronounced “Camel”, stands for “YAML Ain’t Markup Language” and I believe it’s a superset of the JSON format, but with the goal of making it more human readable.  The specification for YAML is here, YAML Specification, and to do a really thorough job I guess I could try and follow the rules set out.  But in practice I’ve found that creating a simple Regular Expression Delimited Text filetype based on the sample files I’ve seen has been the key to handling this format.  Looking ahead I think it would be useful to see a filetype created either as a plugin through the SDL AppStore, or within the core product just to make it easier for users not comfortable with creating their own filetypes.  But I digress…

Read More

%d bloggers like this: