The challenge

TaMTAS addresses two connected barriers: the language in which science circulates and the complexity of scientific writing itself.

The deeper view

The project proposes terminology-aware, document-level machine translation and augmentation for life sciences. Large Reasoning Models treat translation as a reasoning task, while quality estimation, automatic post-editing and audience adaptation improve reliability and accessibility.

A linguistic barrier

English dominance disadvantages non-native researchers and leaves less-represented languages with fewer scientific resources and terms.

A comprehension barrier

Even after translation, complex structures and specialist terminology can keep scientific knowledge inaccessible to students and the public.

Eight objectives, one integrated system.

SO1
Compile and enrich multilingual scientific corpora
SO2
Advance terminology extraction and integration
SO3
Train document-level translation with LRMs
SO4
Detect terminology errors with quality estimation
SO5
Join quality estimation and automatic post-editing
SO6
Adapt translated text to different audiences
SO7
Validate with stakeholders in real settings
SO8
Publish reusable outputs openly

The deeper view

A linguistic barrier

A comprehension barrier

Eight objectives, one integrated system.

Compile and enrich multilingual scientific corpora

Advance terminology extraction and integration

Train document-level translation with LRMs

Detect terminology errors with quality estimation

Join quality estimation and automatic post-editing

Adapt translated text to different audiences

Validate with stakeholders in real settings

Publish reusable outputs openly