What We Do

QUALITY EVALUATION AND BENCHMARKING

Pause

Linguistic
Quality Assurance Bespoke Translation
Workflows Quality Evaluation &
Benchmarking Knowledge Share

Data-driven decision-making

Human quality evaluation, automated quality estimation

cApStAn masters the art of selecting the technology that works best for your project, of testing and of evaluating. We use professional versions of NMT engines and paid versions of language models specially designed for translation. We use AB testing and human evaluation to assess their output. Once the provider is selected, we leverage automated quality estimation (AIQE), establish benchmarks for each locale and provide confidence labels for translated content.

cApStAn Modular Approach – Our language services are organized in 20 modules that can be combined on the basis of needs, requirements and goals to arrange the best workflow for each project.
Download Our Modular Approach Document

Quality Evaluation and Benchmarking

Quality Evaluation of Automated Translation (AT)

What? Evaluation of the output of different engines, models and providers
How? Blind test on a sample of content translated by different engines.

For each translated segment, our linguists use a multidimensional framework to attribute accuracy and fluency scores. They also provide feedback about key issues. The system computes the proportion of segments that required no post-editing at all. At the end of this process, we have comparable feedback on translation quality for the different engines or models we evaluated.

We determine which solution performs best for a given language pair and domain

AI Quality Estimation (AIQE) and Thresholds

What? Comparison of automatically generated confidence scores with edit distance
How? Once we have selected the MT engine or language model, we use it to translate a sufficiently large sample of the content. A post-editor reviews and improves the automated translation as needed. An algorithm computes the edit distance (the number of deletions, insertions or substitution required to transform one string into another). At the same time, we automatically generate confidence scores for each segment of the translated sample. Our translation technology team compares the edit distance and the confidence scores to determine a threshold below which full post-editing is required.

Leveraging AI to help revisers focus on parts that actually need revision

Dual Verification

What? Instruments with specialized subject matter
How? Same process as full verification, but both a linguist and a subject matter expert (SME) work together to verify, and a cApStAn project manager combines their feedback into an actionable report.

Check content localization in addition to linguistic quality

Automated check using VeryFire™

What? Large-scale data collection instruments
How? Our translation technologists program project-specific rules and language-specific rules. VeryFire™, cApStAn’s in-house QA tool, automatically checks adherence to these rules or to a glossary, and flags all violations.

Add an extra quality check step

Machine translation quality evaluation

What? Instruments translated using machine translation
How? A combination of algorithms and targeted human evaluation, which will help make an informed decision about post-editing needs.

Take advantage of machine translation

Scroll down to read our linguistic quality control case studies.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Quality Evaluation and Benchmarking

What We Do

QUALITY EVALUATION AND BENCHMARKING

Data-driven decision-making

Human quality evaluation, automated quality estimation

Quality Evaluation and Benchmarking

Quality Evaluation of Automated Translation (AT)

AI Quality Estimation (AIQE) and Thresholds

Dual Verification

Automated check using VeryFire™

Machine translation quality evaluation

Check Out Some of Our Case Studies

Contact Us

Brussels

Philadelphia