The Term Extraction Module (TEM) ☍ allows you to extract candidate terms from one or more documents and/or from one or more URLs.
The module offers monolingual and bilingual extractions, and the languages that are currently supported are DE, EN, ES, FR and IT. There is no need to specify the language of the documents that you submit, as the module will recognise it automatically.
In this page, you will find out how to:
Read more on user groups and access rights.
CREATE A TEM REQUEST
To create a term extraction request:
- Go to the ‘Term processing’
menu, and click on the ‘Term Extraction Module (TEM)’ tab.
- Use the sliding button to indicate whether the request is monolingual or bilingual.
- For monolingual requests, upload one or more source documents and/or insert one or more URLs. For bilingual requests, also provide the target files and/or URLs.
![]() |
![]() |
- Name your term extraction request.
- Choose whether to apply an exclusion list, i.e. a list of terms that should not be proposed as candidate terms.
- Click on ‘Create’ to submit your TEM request.
EXCLUSION LIST
You can either generate your own exclusion list using the template available for download at the bottom of the page, or use one of two proposed exclusion lists containing:
- the most frequent EN words in the DGT corpus, or
- the most duplicated EN terms in IATE.
To apply an exclusion list, upload at least two source (or target) files. You will then be given the option to mark one of them as an exclusion file (only one exclusion list can be applied per request).
You can also apply both proposed exclusion lists automatically (feature only available for English). Choose from the following three thresholds:
- Low: excludes 33 % of the content of the two lists.
- Medium: excludes 66 % of the content of the two lists.
- High: excludes all the content of the two lists.
(*) User GROUPs and access rights
Check below to see which IATE user groups can create term extraction requests:
User group | Create TEM request |
---|---|
NON-LOGGED-IN USER | No |
TRANSLATOR and above (except LIMITED) | Yes |