Skip to content
IATE logo 🖶
lorem ipsum

Term Extraction Module (TEM) (*)

The Term Extraction Module (TEM) ☍ allows you to extract candidate terms from one or more documents and/or from one or more URLs.

The module offers monolingual and bilingual extractions, and the languages that are currently supported are DE, EN, ES, FR and IT. There is no need to specify the language of the documents that you submit, as the module will recognise it automatically.

In this page, you will find out how to:

  • Create a TEM request
  • Use an exclusion list

Read more on user groups and access rights.

CREATE A TEM REQUEST

To create a term extraction request:

  • Go to the ‘Term processing’ menu, and click on the ‘Term Extraction Module (TEM)’ tab.
  • Use the sliding button to indicate whether the request is monolingual or bilingual.
  • For monolingual requests, upload one or more source documents and/or insert one or more URLs. For bilingual requests, also provide the target files and/or URLs.
The most common editable formats are accepted (Word, Excel, PowerPoint, editable PDF, HTML, XML, CSV, etc.).
For bilingual term extractions, the process is run on parallel documents or URLs, with English required as either the source or target language. The bilingual extraction will then propose pairs of candidate terms. Users with Administrator and Terminologist+ roles can additionally create bilingual raw entries from these pairs.
  • Name your term extraction request.
  • Choose whether to apply an exclusion list, i.e. a list of terms that should not be proposed as candidate terms.
  • Click on ‘Create’ to submit your TEM request.
EXCLUSION LIST

You can either generate your own exclusion list using the template available for download at the bottom of the page, or use one of two proposed exclusion lists containing:

  • the most frequent EN words in the DGT corpus, or
  • the most duplicated EN terms in IATE.

To apply an exclusion list, upload at least two source (or target) files. You will then be given the option to mark one of them as an exclusion file (only one exclusion list can be applied per request).

You can also apply both proposed exclusion lists automatically (feature only available for English). Choose from the following three thresholds:

  • Low: excludes 33 % of the content of the two lists.
  • Medium: excludes 66 % of the content of the two lists.
  • High: excludes all the content of the two lists.
(*) User GROUPs and access rights

Check below to see which IATE user groups can create term extraction requests:

User groupCreate TEM request
NON-LOGGED-IN USERNo
TRANSLATOR and above (except LIMITED)Yes

Related Pages

TEM candidate management

  • General information
    • Introduction to the IATE Online Help
    • About IATE
    • Multilingual interface
    • Accessibility statement
    • Create an IATE account and log on/off
    • Local storage and browser cache
    • Contact
  • User dashboard (*)
    • User preferences (*)
    • Bookmarks (*)
    • Watch lists (*)
    • Notifications (*)
  • Search
    • Main search
      • Expanded search
        • Matching options
        • Search by term types
        • Search in specific fields
        • Filters
    • Search by collection
    • Search by URL
    • Results
      • Standard view vs interpreters’ view
    • Exports (*)
  • Entry overview
    • Full entry view
    • Entry structure
      • Language-Independent Level
      • Language Level
      • Term Level
    • Feedback on an entry (*)
  • Entry management (*)
    • General input criteria
    • List of fields
      • Domains
      • Collections
      • Field completion score (*)
    • References
      • Best practices related to references (*)
      • Types of references
      • Entry-to-entry links
      • Clipboard
    • General editing features
      • Action buttons
      • Contextual menu
      • Formatting
    • Entry creation (*)
    • Data modification (*)
    • Duplicate detection (*)
    • Deletion (*)
    • Undeletion (*)
    • History/audit
  • Advanced features (*)
    • Table view (*)
    • Experimental features (*)
    • Post-adoption checks (*)
  • Terminology projects (*)
    • Project list (*)
    • Create and edit a project (*)
    • Preparatory material (*)
    • Project entries (*)
    • Project assignments (*)
    • My assignments (*)
    • My assigned entries (*)
    • Internal forum (*)
    • External forum (*)
  • Document processing (*)
    • Term Extraction Module (TEM) (*)
      • TEM candidate management (*)
  • Documentation
    • Documentation & tutorials
    • Useful shortcuts
  • Technical info (*)
    • Download IATE (*)
  • Statistics

This handbook is part of IATE, the European Union terminology portal.

Powered by PressBook WordPress theme