Publications/Academia (see Google Scholar, Semantic Scholar)


WMT 2023 Shared Task on Machine Translation with Terminologies
Kirill Semenov, Vilém Zouhar, Tom Kocmi, Dongdong Zhang Wangchunshu Zhou, Yuchen Eleanor Jiang
EMNLP 2023
A Diachronic Perspective on User Trust in AI under Uncertainty
Shehzaad Zuzar Dhuliawala,= Vilém Zouhar,= Mennatallah El-Assady, Mrinmaya Sachan
EMNLP 2023
Enhancing Textbooks with Visuals from the Web for Improved Learning
Janvijay Singh, Vilém Zouhar, Mrinmaya Sachan
EMNLP 2023
Re-visiting Automated Topic Model Evaluation with Large Language Models
Dominik Stammbach, Vilém Zouhar, Alexander Hoyle, Mrinmaya Sachan, Elliott Ash
EMNLP 2023
Tokenization and the Noiseless Channel
Vilém Zouhar, Clara Meister, Juan Luis Gastaldi, Li Du, Mrinmaya Sachan, Ryan Cotterell
ACL 2023
Evaluating Optimal Reference Translations
Vilém Zouhar, Věra Kloudová, Martin Popel, Ondřej Bojar
JNLE, to appear 2024
A Formal Perspective on Byte-Pair Encoding
Vilém Zouhar, Clara Meister, Juan Luis Gastaldi, Li Du, Tim Vieira, Mrinmaya Sachan, Ryan Cotterell
ACL 2023
RELIC: Investigating Large Language Model Responses using Self-Consistency
Furui Cheng, Vilém Zouhar, Simran Arora, Mrinmaya Sachan, Hendrik Strobelt, Mennatallah El-Assady
In review 2023
PWESuite: Phonetic Word Embeddings and Tasks They Facilitate
Vilém Zouhar, Kalvin Chang, Chenxuan Cui, Nathaniel Carlson, Nathaniel Robinson, Mrinmaya Sachan, David Mortensen
In review 2023
Poor Man's Quality Estimation: Predicting Ref.-Based MT Metrics Without Reference
Vilém Zouhar, Shehzaad Dhuliawala, Wangchunshu Zhou, Nico Daheim, Tom Kocmi, Yuchen Eleanor Jiang, Mrinmaya Sachan
EACL 2023
Sentence Ambiguity, Grammaticality and Complexity Probes
Sunit Bhattacharya,= Vilém Zouhar,= Ondřej Bojar
BlackboxNLP 2022
Shrinking Knowledge Base Size: Dimension Reduction, Splitting & Filtering
Vilém Zouhar
Master thesis 2022
Knowledge Base Index Compression via Dimensionality and Precision Reduction
Vilém Zouhar, Marius Mosbach, Miaoran Zhang, Dietrich Klakow
SpaNLP 2022
Neural Machine Translation Quality and Post-Editing Performance
Vilém Zouhar, Ondřej Bojar, Martin Popel, Aleš Tamchyna
EMNLP 2021
Providing Backtranslation Improves Users Confidence in MT, Not Quality
V. Zouhar, M. Novák, M. Žilinec, O. Bojar, M. Obregón, R. L. Hill, F. Blain, M. Fomicheva, L. Specia, L. Yankovskaya
NAACL 2021
Artefact Retrieval: Overview of NLP Models with Knowledge Base Access
Vilém Zouhar, Marius Mosbach, Debanjali Biswas, Dietrich Klakow
AKBC CSKB 2021
Sampling and Filtering of Neural Machine Translation Distillation Data
Vilém Zouhar
NAACL SRW 2021
Leveraging Neural Machine Translation for Word Alignment
Vilém Zouhar, Daria Pylypenko
PBML 116
WMT20 Document-Level Markable Error Exploration
Vilém Zouhar, Tereza Vojtěchová, Ondřej Bojar
WMT 2020
Extending Ptakopět for MT User Interaction Experiments
Vilém Zouhar, Michal Novák
PBML 115
Outbound Translation User Interface Ptakopět: A Pilot Study
Vilém Zouhar, Ondřej Bojar
LREC 2020
Enabling Outbound Machine Translation
Vilém Zouhar
Bachelor thesis 2020