Sunteți pe pagina 1din 22

Translation tools

and workflow

Translation
Contents
Translating for the European Commission 1
Translators' needs 3
Our language resources 3
Translation workflow in the European Commission 4
Tools 6
■ Administration and documentation tools 6
■ Translation tools 9
Euramis 13
What next? 19

More information on the European Union is available on the internet


(http://europa.eu).

Luxembourg: Publications Office of the European Union, 2016

Print ISBN 978-92-79-62206-9 doi:10.2782/872834 HC-06-16-046-EN-C


PDF ISBN 978-92-79-62204-5 doi:10.2782/703257 HC-06-16-046-EN-N

© European Union, 2016


Reproduction is authorised provided the source is acknowledged.

Printed in Luxembourg
Printed on white chlorine-free paper
Translating for the
European Commission

The Directorate-General for Translation Committee of the Regions, etc.) have


is one of the biggest translation services their own translation departments,
in the world. It is also the largest single whereas the various specialised
department in the European Commission. decentralised EU agencies and bodies
send their translation work to the
We have a total number of staff of Translation Centre for the Bodies of the
around 2 200, including translators European Union.
and staff performing tasks supporting
translation work (management, DG Translation is organised according to
assistants, communication, information languages. Each official EU language has
technology, training, etc.). This figure its own language department, which is
includes all staff in Brussels, Luxembourg organised in translation units. Translators
and the local offices in the Member therefore work in single language units
States. that specialise in particular subjects. They
translate out of several languages, but
We translate some 2 million pages a almost always into their mother tongue.
year. Requests for translations go through a
central demand-management unit that
Other EU institutions and bodies (the
negotiates the deadlines and how many
Parliament, Council, Court of Justice,
language versions are needed of each
Central Bank, Court of Auditors, European
incoming text and then assigns the jobs
Economic and Social Committee,
to the appropriate units.
2
3

Translators’ needs

Information technology is playing an ■■ Possibility to reuse previously


ever-increasing role in translators’ translated texts (translation memories,
daily work. We therefore make electronic archives, etc.).
various computer tools available to
■■ Central and local assistance. The role
translators, who use them according
of secretaries has evolved from being
to their translation needs and personal
a typist into a fully fledged translation
preferences. The main document formats
assistant. Secretaries and translators
we work with are Word, Excel, PowerPoint,
work hand in hand more than ever,
HTML and XML.
with pre- and post-processing being
Irrespective of their preferred working handled by secretaries and translators
methods, all translators’ needs are focusing on the actual translation
basically the same. work. Assistance is provided centrally
by a helpdesk and an alignment and
■■ Appropriate terminology (dictionaries, pre-processing team, and locally
glossaries, terminology databases, etc.). within the language departments
■■ Reference documents (paper and themselves.
electronic archives, aligned texts, etc.).

Our language resources

To perform our tasks, we have a wide ■■ previous translations available from


variety of language resources at our internal archives and other sources;
disposal:
■■ machine translation, which, at the
■■ terminology in many different forms European Commission, is used as a
(terminology databases, electronic translation aid by translators and as
dictionaries, etc.); a way for requesting Commission
departments to understand the gist
■■ translation memories enabling
of a text.
genuine data sharing;
4

Translation workflow
in the European
Commission
Brief explanation of tools
n Poetry
DG Translation’s portal for receiving translation requests from the Commission’s
other departments.
n ManDesk
Software for managing translation requests inside DG Translation.
n Euramis
Central translation memory of DG Translation.
n TraDesk
Interface for translation document management and access to electronic archives
where earlier translations are stored.
n CAT tool (Computer-Aided Translation)
Translation memory software for local management of already translated text
retrieved from the central memory.
n DGT Vista
Search engine for DG Translation’s archive of translated documents.
n EUR-Lex
Online database of European Union law (eur-lex.europa.eu).
n IATE
Terminology database of all European Union institutions (iate.europa.eu).

All translation requests are sent by Poetry to Euramis for automatic retrieval of
previously translated contents that are relevant to the request at hand. The results are
automatically stored by Euramis in TraDesk from where translators then create a local
translation package with all relevant information for their document.

After interactive translation with the CAT tool, users clean up, export and save all
translated documents to Euramis.
5

TRANSLATION WORKFLOW IN THE EUROPEAN COMMISSION (*)

Translation is sent
to the requesting
COMMISSION DEPT. Commission department

Commission
department ManDesk Translation
sends a new is archived
translation
request
POETRY DGT Vista
Demand
management
unit in
DG Translation ManDesk
accepts it Translation unit’s
ManDesk secretariat releases
translation

Original document is Pre-processing team


automatically processed tracks down additional
by Euramis in order reference materials, EURAMIS
to find similar earlier if necessary
translations

Original translator
EURAMIS, NOTE IN TraDesk produces the final
version
Second translator
Translator revises translation
Head of a ManDesk translates
translation unit using CAT tool
receives
Word HTML
translation
request Excel …
XML …
Translator TraDesk
creates
translation
file

(*) All tools mentioned in the diagram are explained in detail in the following pages.
6

Tools

Administration and documentation tools

Poetry

Poetry is the software used for the Some of the many clear advantages
electronic transmission of translation of Poetry are:
requests from our clients (other
■■ fast transmission;
Commission departments) to DG
Translation. The documents to be ■■ integration into our electronic file-
translated are bundled in an electronic archiving system, TraDesk;
folder containing the translation
■■ availability of original and reference
request, the original document
documents in electronic form;
for translation, and any reference
documents, all of which the requester ■■ improved electronic workflow.
can send for translation in one go.
7

ManDesk (Manager’s Desktop)


ManDesk is the software used to manage receives specific tasks in an inbox from
and track translation requests inside where they will be processed, finalised
DG Translation. The system is used to and then passed on automatically to the
assign translation tasks to units and next actor until all parts of a translation
to translators and to deliver translated request are finished.
documents back to requesters. ManDesk
implements a standardised workflow and ManDesk is also used to initiate
routes translation requests to the various outsourcing of a translation task by
units who prepare, execute and quality sending it to Trèfle, a system that makes
check any linguistic product. translation tasks available to external
contractors.
ManDesk follows an inbox concept, which
means that each actor in a workflow

TraDesk (Translator’s Desktop)


TraDesk is the interface used by in-house ■■ comparisons between different
translators to view their current tasks versions;
and all related information. From within
■■ ongoing translations;
TraDesk, a translator initiates the creation
of a translation project or processes other ■■ completed translations.
tasks like the evaluation of a translation
TraDesk also allows translators to record
outsourced to an external contractor.
additional notes on a given translation
TraDesk provides access to the original package and to share information with
document and to all files needed during other translators working on the same
the translation process, including: or a similar file regarding terminology,
hints to understanding better parts of the
■■ reference documents;
source text or clarifications obtained from
■■ pre-processing files; the requester.
8

DGTVista
DGTVista is a document search and enables them to find virtually any
display engine. It contains all incoming document received or produced by DG
(mainly original texts) and outgoing Translation within a matter of seconds.
documents (mainly translations) from
and to every Commission department DGTVista’s particular features — short
since 1994. The DGTVista search response time, bilingual parallel
interface offers translators a range of scrolling, easy downloading of
search criteria (contents of the text documents, full-text search facility) —
but also document number, author, make it a powerful translation aid.
requesting department, title, etc.) and

EUR-Lex
EUR-Lex is an online repository of pub- chapter below), in order to create a trans-
lished EU legislation. It can be accessed lation memory based on EUR-Lex content.
by anyone free of charge at eur-lex. Further ways to query EUR-Lex are via
europa.eu. Quest and DocFinder (see explanations
further below).
It contains the treaties, secondary
legislation and preparatory acts in
all official EU languages, as well as
national implementing measures and
case-law of the Court of Justice of
the European Union. It is also possible
to consult the Official Journal of the
European Union.
Our translators can access EUR-Lex
via the web for consultation purposes,
as well as indirectly through the
Euramis interface (see the specific
9

Translation tools

DG Translation has four main types of translation aids: terminology tools,


translation memory technology, machine translation and speech recognition.
There are two levels of translation memories — central and local — each using a
different system.

Terminology tools
IATE

IATE (InterActive Terminology for Europe) terminologists then validate the


is the terminology database shared by entries to ensure that the contents of
all EU institutions and bodies. It is also the database are of a high quality.
accessible to the public at iate.europa.eu.
Specific source-language terms or
The database contains over 8.6 million abbreviations and their equivalents
terms and about half a million in other official EU languages can be
abbreviations, covering all official searched for in IATE. Searches can also
EU languages, as well as Latin. The be refined by specifying the domain
IATE database was developed and is or context in which a term is used.
maintained by an interinstitutional
team, but the terminological content is Search results indicate the institution
managed by the language departments that created the entry and the context
in each EU institution individually. in which the term is used. Entries also
Every translator in DG Translation have a reliability code, a numerical value
can create entries in any language of 1 to 4, 4 meaning ‘very reliable’ and
in the database. Mother tongue 1 meaning ‘reliability not verified’.
10

DocFinder

During the translation process it is often ‘Regulation (EC) No 44/2006’. With


necessary to consult reference documents DocFinder the full text of a reference
for background information. Or, if a document can be retrieved within
related document has already been seconds; highlighting the document
translated into a specific target language, reference in the Word document and
the translation might prove useful for then clicking the DocFinder Word macro
some terminology questions. button is all that is needed. Alternatively,
the document reference can be entered
DocFinder helps to quickly locate and manually in the DocFinder web interface.
display a document based on references DocFinder is also used by other EU
in the text being translated, such as institutions.

Quest

Quest is a metasearch tool designed simplifying and speeding up terminology


to drastically reduce the time it searches. Our translators and those
takes translators to find solutions to of other EU institutions can select the
terminology problems. Quest enables source language and up to three target
translators to search a multitude of languages on screen and specify exactly
DG Translation’s internal and public which databases they wish to include in
terminology sources in the time it would the search. A Quest button in the Word
normally take to search a single source. toolbar makes it possible to launch
searches for selected terms directly from
The web interface was developed in DG a Word document. Like DocFinder, Quest
Translation with a view to centralising, is used across the EU institutions.
11

Translation memory technology


The Euramis central translation memory
DG Translation has developed a to look up words and expressions and
huge central translation memory get a translation with context from
as part of the Euramis project (see documents already translated and stored
specific chapter below), with the in the central translation memory.
underlying idea of providing facilities
for genuine data sharing between At present, the Euramis central memory
translators of EU institutions. contains over 1 billion segments
covering all official EU languages.
The Euramis central translation memory
is used automatically once DG Translation English and French are the most
has accepted a translation request. An commonly used source languages,
automatic retrieval will find segments reflecting the fact that nearly all
already translated that could be reused documents sent for translation are
or could be useful for translating the new written in one of them. When it comes
document because of their similarity. to target languages, retrievals are more
evenly distributed. Automated Euramis
Translators will use the Euramis pre-processing is carried out on original
Concordance functionality interactively documents in various formats.
during the translation process in order

The local memory


Our translators use a CAT tool to work DG Translation has defined a given set of
locally with the content retrieved from metadata stored in Euramis (document
the Euramis central translation memory. number, year, client/requesting service
This CAT tool was selected following an and document type) so that every
interinstitutional call for tenders and was segment gets a specific label in the
customised to meet the specific needs of translation memory.
all European institutions’ translators.
The CAT tool is particularly useful since
The tool gives translators access to all a high proportion of legislative and
language and phraseology resources preparatory documents are based on
from a local translation memory. When previous texts or existing legislation. It is
the user enters an original text, similar mainly used as a front end for the local
or identical segments from previously and interactive processing of data which
translated texts pop up as translation is retrieved from, or is to be saved in, the
suggestions for the job in hand. Euramis central translation memory.
12

Machine translation
The principle of machine translation (MT) Since 2010 DG Translation has been
is well known: a document is roughly developing MT@EC, the common
translated from a source language into a Machine Translation service offered by
target language on the basis of a system the European Commission to translators
of dictionaries and linguistic rules or by in the EU institutions’ translation
using statistical techniques. departments, European and national
public administrations and EC information
Machine translation as offered by DG systems and online services. MT@EC
Translation is used mainly for the three makes use of data-driven (statistical)
following purposes. approaches, but depending on the needs
■■ Gisting: Capable of translating a also rule-based, hybrid and commercial
large number of pages per hour components can be integrated. Compared
without human intervention, machine to previous systems MT@EC offers better
translation gives rapid access to translation quality and covers more
information in languages that languages. It offers the possibility to
requesters do not know. develop customised solutions to fit the
specific needs of users in a flexible and
■■ Translating is the principal reason for cost-efficient way.
requesting machine translation by our
translators. For translation purposes, MT@EC currently offers a total of 552
the raw machine output is always language pairs of which 76 are direct,
edited. while the remaining ones are available
through English as a pivot language. An
■■ Drafting in a language other than the
average of approximately 100 000 pages
requester’s mother tongue or main
are translated by the system daily, with
language. Some administrators prefer
peaks of up to 400 000 pages. Depending
to write a text in their own language
on the length of the documents
first, request a machine translation
submitted, the requested translations will
and then correct the output.
typically be available within only a few
minutes.

Speech recognition
A number of our translators use speech Uptake of speech recognition technology
recognition software. This allows users to in DG Translation has been limited by
dictate text directly onto their computer the fact that this technology has been
in a natural, continuous way, achieving a developed by vendors for only some of
high degree of accuracy and efficiency. the EU official languages. Tests are being
The software is a real time-saver for carried out with other products to try to
translators because they no longer need find suitable software for other languages
to type a large part of their work or have as well, but the supply is limited in this
it typed by secretaries. The ergonomic field.
and health benefits are also obvious,
as adverse physical effects associated
with intensive typing and mouse use are
reduced.
13

Euramis

Euramis concept
Euramis (European Advanced Multilingual Euramis is unique because:
Information System) refers to a series
■■ there is no other translation tool in the
of client–server applications that provide
world capable of dealing with such a
access to a variety of services in the field
huge volume of material (translation
of natural language processing. Euramis
segments);
is based on the following principles:
■■ it is truly multilingual;
■■ central storage of linguistic
resources with a view to data sharing ■■ it is incredibly fast; and
(translation memories);
■■ it is the backbone of all the EU
■■ mass processing of linguistic data; institutions’ translation tools.
■■ integration of various language Quality assurance is a major concern
applications and services with a view for all the EU institutions’ translators.
to giving one-stop access to, for Therefore, to further improve consistency
example, translation memories; and allow genuine data sharing between
■■ workflow automation. different institutions, Euramis can
be accessed by translators from the
The Euramis project was launched in Parliament, Council, Court of Justice,
1995. The underlying idea was to relieve Central Bank, Court of Auditors, European
translators of the more repetitive work Economic and Social Committee,
and to achieve greater consistency Committee of the Regions and the
in language and methodology, thus Translation Centre for the Bodies of the
contributing to better quality assurance. European Union.

How does Euramis work


Euramis services are launched user to submit instructions and any files
automatically as part of the document to be processed. Euramis then fulfils the
workflow, so in most cases the data request by retrieving the data requested
needed by translators is available when from the sources shown in the illustration
they begin work. However, requests can below and returns the results to the user
also be submitted manually through the by email or places them on a dedicated
Euramis web portal. This enables the server.
14

concordance

 retrieval
 download
 alignment

Euramis
Web Portal

EURAMIS
ENGINE WEB
E-mail SERVICES
USER

Requests
from partner
institutions

Euramis
Web Portal

EUR-Lex download
By typing EUR-Lex document references The download function also allows
into the appropriate box of the Euramis automatic alignment of downloaded legal
web interface or by attaching a file that acts at server level. In that case, the user
contains EUR-Lex references, the user receives a file in TMX format. A similar
can receive the full titles or text of the service exists for internal Commission
corresponding legal acts in one or more documents (Align DGT documents).
EU languages (up to 30 documents) by
email.

Alignment
Alignment is an operation consisting of Euramis web portal and the results
splitting two existing texts in different are received by email or placed on a
languages (source and target) into dedicated server.
segments, usually sentences, and then
placing these segments in parallel. The The user has to enter metadata to
result of the operation is a single file identify the document. This is the same
containing the various segment pairs information used in the local CAT tool
which, after being checked (see Euramis and thus makes it possible to identify
Editor), is fed into the local and/or central segments in the central translation
translation memory. memory.

Like most Euramis applications, requests An alignment can be made for:


for alignments are launched via the
15

■■ instant use during the interactive In both cases corrections can be made
translation process; using the Euramis editor.
■■ storage in the Euramis central
translation memory for later use.

Euramis Editor
The Euramis Editor is simple to use and functions can also be used. It is also
offers all the functions users might need possible to spellcheck the target text.
when correcting an alignment. The tool
offers a number of powerful features that Euramis Editor also makes it possible to
support users in ensuring the quality of automatically check for redundant entries,
aligned files. select and delete them, anonymise files
and change or add attribute values
Sentence cells can be deleted, merged (metadata) to a given document. The tool
or split. It is normally assumed that the offers advanced filtering options, including
sequence of source and target sentences regular expressions searches.
is parallel, but the cut, copy and paste
Finally, corrected alignments can be
saved directly to a Euramis database.
16

Central memory retrieval


A retrieval from the Euramis central in the source document with the best
translation memory is launched on the matches found. Colours are used to
basis of an original document. indicate whether they are a 100 %
match or fuzzy match or the original
There are several search and selection text has been left. The column on the
criteria and other options. right-hand side contains metadata. It
may be used for freelance translators
The user can opt for two different output
and can serve as a visual aid for
formats as follows.
decision makers (how much text is
■■ TMX file: linguistic content to be actually new and which tools would be
imported into the local CAT tool for best to use for a given document).
interactive translation. This format
allows inclusion of the most relevant Translation memory retrieval can also
documents (when a certain number of be refined using a given set of filters
translation units come from the same corresponding to the metadata used
document, the whole document is during the translation process.
retrieved and added to the TMX file).
For each request, a report is created
■■ HTML output: a pre-translated file. containing detailed information about the
Sentences are automatically replaced results retrieved.
17

Combined services
Euramis offers several combined services, to integrate various language services
for example ‘EUR-Lex + Alignment’, with each other and provide for one-stop
‘SGVista + Alignment’ and ‘Align DGT access.
documents’. The underlying idea was

Save
Euramis allows you to save to the central a local translation memory, i.e. the
translation memory (TMX files produced outcome of translating a document).
by a corrected alignment or by exporting

Document Search
Document Search lets users search for the database manager. Sentences with
specific documents already stored in the same attributes (document number,
Euramis translation memories. Once a requesting service, document type, year
document has been found, users can and language) are regarded as pertaining
view it, download it or send feedback to to the same document.

Euramis concordance
Euramis Concordance searches Euramis ■■ view all sentences found in the same
translation memories for the text entered document (source and target);
in the text box. If one or more matches
■■ show the whole document where the
are found, a result table is created, which
sentences were found (i.e. view all
contains the sentences found on the left-
the segments saved for the given
hand side and their translations on the
document);
right-hand side.
■■ download all sentences from the
Sentences from the same document (i.e. document;
with the same attributes) found in the
same translation memory are grouped ■■ send feedback on the document.
together under a heading identifying the
There is also an ‘advanced options’
document.
section that offers more scope for fine-
Each document heading contains tuning searches.
‘Show the remaining hit(s)’, ‘Browse
Euramis Concordance is a very useful
document’, ‘Download document’
and heavily used tool, as proved by over
and ‘Write a feedback on this
80 000 queries processed every day on
document’ buttons, which users
average with almost immediate response
can click respectively in order to:
times.
18
19

What next?

DG Translation will continue to adapt its ■■ further workflow automation;


tools and workflow to new developments
■■ further integration of language
in translation technology, focusing mainly
applications and services;
on:
■■ further integration at interinstitutional
■■ further development of machine
level.
translation;

You find more information and publications on


the Directorate-General for Translation
of the European Commission and our contact
information on our website
ec.europa.eu/info/departments/translation
HC-06-16-046-EN-N

ISBN 978-92-79-62204-5
doi:10.2782/703257

S-ar putea să vă placă și