Publications/Academia (see Google Scholar, Semantic Scholar)

Re-visiting Automated Topic Model Evaluation with Large Language Models
Dominik Stammbach, Vilém Zouhar, Alexander Hoyle, Mrinmaya Sachan, Elliott Ash
In review
Enhancing Textbooks with Visuals from the Web for Improved Learning
Janvijay Singh, Vilém Zouhar, Mrinmaya Sachan
In review
PWESuite: Phonetic Word Embeddings and Tasks They Facilitate
Vilém Zouhar, Kalvin Chang, Chenxuan Cui, Nathaniel Carlson, Nathaniel Robinson, Mrinmaya Sachan, David Mortensen
In review
Multimodal Shannon Game with Images
Vilém Zouhar,= Sunit Bhattacharya,= Ondřej Bojar
Preprint
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Vilém Zouhar, Shehzaad Dhuliawala, Wangchunshu Zhou, Nico Daheim, Tom Kocmi, Yuchen Eleanor Jiang, Mrinmaya Sachan
EACL 2023
Sentence Ambiguity, Grammaticality and Complexity Probes
Sunit Bhattacharya,= Vilém Zouhar,= Ondřej Bojar
BlackboxNLP 2022
Stroop Effect in Multi-Modal Sight Translation
Sunit Bhattacharya, Vilém Zouhar, Věra Kloudová, Ondřej Bojar
Preprint
Fusing Sentence Embeddings Into LSTM-based Autoregressive Language Models
Vilém Zouhar, Marius Mosbach, Dietrich Klakow
Preprint
Shrinking Knowledge Base Size: Dimension Reduction, Splitting & Filtering
Vilém Zouhar
Master thesis 2022
Knowledge Base Index Compression via Dimensionality and Precision Reduction
Vilém Zouhar, Marius Mosbach, Miaoran Zhang, Dietrich Klakow
ACL Spa-NLP 2022
EMMT: A simultaneous eye-tracking, 4-electrode EEG and audio corpus for multi-modal reading and translation scenarios
Sunit Bhattacharya, Věra Kloudová, Vilém Zouhar, Ondřej Bojar
Preprint
Neural Machine Translation Quality and Post-Editing Performance
Vilém Zouhar, Ondřej Bojar, Martin Popel, Aleš Tamchyna
EMNLP 2021
Providing Backtranslation Improves Users Confidence in MT, Not Quality
Vilém Zouhar, Michal Novák, Matúš Žilinec, Ondřej Bojar, Mateo Obregón, Robin L. Hill, Frédéric Blain, Marina Fomicheva, Lucia Specia, Lisa Yankovskaya
NAACL 2021
Artefact Retrieval: Overview of NLP Models with Knowledge Base Access
Vilém Zouhar, Marius Mosbach, Debanjali Biswas, Dietrich Klakow
AKBC CSKB 2021
Sampling and Filtering of Neural Machine Translation Distillation Data
Vilém Zouhar
NAACL SRW 2021
Leveraging Neural Machine Translation for Word Alignment
Vilém Zouhar, Daria Pylypenko
PBML 116
WMT20 Document-Level Markable Error Exploration
Vilém Zouhar, Tereza Vojtěchová, Ondřej Bojar
WMT20
Extending Ptakopět for MT User Interaction Experiments
Vilém Zouhar, Michal Novák
PBML 115
Outbound Translation User Interface Ptakopet: A Pilot Study
Vilém Zouhar, Ondřej Bojar
LREC 2020
A Collection of Machine Learning Excercises
Martin Holub, Barbora Vidová Hladká, Vilém Zouhar
Teaching material
Statistical Natural Language Processing Presentations
Vilém Zouhar, Awantee Deshpande, Julius Steuer
Teaching material
Enabling Outbound Machine Translation
Vilém Zouhar
Bachelor thesis 2020
Evaluating Optimal Reference Translations
Vilém Zouhar, Věra Kloudová, Martin Popel, Ondřej Bojar
In review
Paper on tokenization (1)
main author
In review
Paper on tokenization (2)
main author
In review
Paper on beam-search
n-th author
In review