default search action
Ondrej Bojar
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j28]Michelle Elizabeth, Ondrej Bojar:
Revamping the SLTev Tool for Evaluation of Spoken Language Translation. Prague Bull. Math. Linguistics 121: 5-14 (2024) - [j27]Michal Novák, Peter Polák, Katerina Rysova, Magdaléna Rysová, Ondrej Bojar:
Towards Automated Spoken Language Assessment: A Study of ASR Transcription of Examinations for Non-Native Speakers of Czech. Prague Bull. Math. Linguistics 122: 43- (2024) - [c208]Matthias Sperber, Ondrej Bojar, Barry Haddow, Dávid Javorský, Xutai Ma, Matteo Negri, Jan Niehues, Peter Polák, Elizabeth Salesky, Katsuhito Sudoh, Marco Turchi:
Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation. LREC/COLING 2024: 6484-6495 - [c207]Josef Jon, Ondrej Bojar:
GAATME: A Genetic Algorithm for Adversarial Translation Metrics Evaluation. LREC/COLING 2024: 7562-7569 - [c206]Dominika Durisková, Daniela Jurásová, Matús Zilinec, Eduard Subert, Ondrej Bojar:
Khan Academy Corpus: A Multilingual Corpus of Khan Academy Lectures. LREC/COLING 2024: 9743-9752 - [c205]Uladzislau Yorsh, Martin Holena, Ondrej Bojar, David Herel:
On Difficulties of Attention Factorization through Shared Memory. Tiny Papers @ ICLR 2024 - [c204]Adam Osuský, Dávid Javorský, Ondrej Bojar:
InsBERT: Word Importance from Artificial Insertions. ITAT 2024: 94-104 - [c203]Tom Kocmi, Eleftherios Avramidis, Rachel Bawden, Ondrej Bojar, Anton Dvorkovich, Christian Federmann, Mark Fishel, Markus Freitag, Thamme Gowda, Roman Grundkiewicz, Barry Haddow, Marzena Karpinska, Philipp Koehn, Benjamin Marie, Christof Monz, Kenton Murray, Masaaki Nagata, Martin Popel, Maja Popovic, Mariya Shmatova, Steinthór Steingrímsson, Vilém Zouhar:
Findings of the WMT24 General Machine Translation Shared Task: The LLM Era Is Here but MT Is Not Solved Yet. WMT 2024: 1-46 - [c202]Miroslav Hrabal, Josef Jon, Martin Popel, Nam Luu, Danil Semin, Ondrej Bojar:
CUNI at WMT24 General Translation Task: LLMs, (Q)LoRA, CPO and Model Merging. WMT 2024: 232-246 - [c201]Shantipriya Parida, Ondrej Bojar, Idris Abdulmumin, Shamsuddeen Hassan Muhammad, Ibrahim Said Ahmad:
Findings of WMT2024 English-to-Low Resource Multimodal Translation Task. WMT 2024: 677-683 - [i82]Vilém Zouhar, Ondrej Bojar:
Quality and Quantity of Machine Translation References for Automated Metrics. CoRR abs/2401.01283 (2024) - [i81]Uladzislau Yorsh, Martin Holeña, Ondrej Bojar, David Herel:
On Difficulties of Attention Factorization through Shared Memory. CoRR abs/2404.00798 (2024) - [i80]Sunit Bhattacharya, Ondrej Bojar:
Understanding the role of FFNs in driving multilingual behaviour in LLMs. CoRR abs/2404.13855 (2024) - [i79]Matthias Sperber, Ondrej Bojar, Barry Haddow, Dávid Javorský, Xutai Ma, Matteo Negri, Jan Niehues, Peter Polák, Elizabeth Salesky, Katsuhito Sudoh, Marco Turchi:
Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation. CoRR abs/2406.03881 (2024) - [i78]Tom Kocmi, Eleftherios Avramidis, Rachel Bawden, Ondrej Bojar, Anton Dvorkovich, Christian Federmann, Mark Fishel, Markus Freitag, Thamme Gowda, Roman Grundkiewicz, Barry Haddow, Marzena Karpinska, Philipp Koehn, Benjamin Marie, Kenton Murray, Masaaki Nagata, Martin Popel, Maja Popovic, Mariya Shmatova, Steinþór Steingrímsson, Vilém Zouhar:
Preliminary WMT24 Ranking of General MT Systems and LLMs. CoRR abs/2407.19884 (2024) - [i77]Patrik Zavoral, Dusan Varis, Ondrej Bojar:
Adversarial Testing as a Tool for Interpretability: Length-based Overfitting of Elementary Functions in Transformers. CoRR abs/2410.13802 (2024) - [i76]Ibrahim Said Ahmad, Antonios Anastasopoulos, Ondrej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, William Chen, Qianqian Dong, Marcello Federico, Barry Haddow, Dávid Javorský, Mateusz Krubinski, Tsz Kin Lam, Xutai Ma, Prashant Mathur, Evgeny Matusov, Chandresh Maurya, John P. McCrae, Kenton Murray, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, Atul Kr. Ojha, John E. Ortega, Sara Papi, Peter Polák, Adam Pospísil, Pavel Pecina, Elizabeth Salesky, Nivedita Sethiya, Balaram Sarkar, Jiatong Shi, Claytone Sikasote, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Brian Thompson, Marco Turchi, Alex Waibel, Shinji Watanabe, Patrick Wilken, Petr Zemánek, Rodolfo Zevallos:
Findings of the IWSLT 2024 Evaluation Campaign. CoRR abs/2411.05088 (2024) - [i75]Sara Papi, Peter Polak, Ondrej Bojar, Dominik Machácek:
How "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System? CoRR abs/2412.18495 (2024) - 2023
- [c200]Josef Jon, Ondrej Bojar:
Breeding Machine Translations: Evolutionary approach to survive and thrive in the world of automated evaluation. ACL (1) 2023: 2191-2212 - [c199]Dominik Machácek, Peter Polak, Ondrej Bojar, Raj Dabre:
Robustness of Multi-Source MT to Transcription Errors. ACL (Findings) 2023: 3707-3723 - [c198]Dávid Javorský, Ondrej Bojar, François Yvon:
Assessing Word Importance Using Models Trained for Semantic Tasks. ACL (Findings) 2023: 8846-8856 - [c197]Shantipriya Parida, Idris Abdulmumin, Shamsuddeen Hassan Muhammad, Aneesh Bose, Guneet Singh Kohli, Ibrahim Said Ahmad, Ketan Kotwal, Sayan Deb Sarkar, Ondrej Bojar, Habeebah A. Kakudi:
HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language. ACL (Findings) 2023: 10162-10183 - [c196]Sunit Bhattacharya, Ondrej Bojar:
Unveiling Multilinguality in Transformer Models: Exploring Language Specificity in Feed-Forward Networks. BlackboxNLP@EMNLP 2023: 120-126 - [c195]Frantisek Kmjec, Ondrej Bojar:
Team Iterate @ AutoMin 2023 - Experiments with Iterative Minuting. INLG (Generation Challenges) 2023: 114-120 - [c194]Tirthankar Ghosal, Ondrej Bojar, Marie Hledíková, Tom Kocmi, Anna Nedoluzhko:
Overview of the Second Shared Task on Automatic Minuting (AutoMin) at INLG 2023. INLG (Generation Challenges) 2023: 138-167 - [c193]Peter Polák, Brian Yan, Shinji Watanabe, Alex Waibel, Ondrej Bojar:
Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency Tradeoff. INTERSPEECH 2023: 3979-3983 - [c192]Andrej Perkovic, Jernej Vicic, Dávid Javorský, Ondrej Bojar:
Shortening of the Results of Machine Translation using Paraphrasing Dataset. ITAT 2023: 121-130 - [c191]Sweta Agrawal, Antonios Anastasopoulos, Luisa Bentivogli, Ondrej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, Mingda Chen, William Chen, Khalid Choukri, Alexandra Chronopoulou, Anna Currey, Thierry Declerck, Qianqian Dong, Kevin Duh, Yannick Estève, Marcello Federico, Souhir Gahbiche, Barry Haddow, Benjamin Hsu, Phu Mon Htut, Hirofumi Inaguma, Dávid Javorský, John Judge, Yasumasa Kano, Tom Ko, Rishu Kumar, Pengwei Li, Xutai Ma, Prashant Mathur, Evgeny Matusov, Paul McNamee, John P. McCrae, Kenton Murray, Maria Nadejde, Satoshi Nakamura, Matteo Negri, Ha Nguyen, Jan Niehues, Xing Niu, Atul Kr. Ojha, John E. Ortega, Proyag Pal, Juan Pino, Lonneke van der Plas, Peter Polák, Elijah Rippeth, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Yun Tang, Brian Thompson, Kevin Tran, Marco Turchi, Alex Waibel, Mingxuan Wang, Shinji Watanabe, Rodolfo Zevallos:
Findings of the IWSLT 2023 Evaluation Campaign. IWSLT@ACL 2023: 1-61 - [c190]Dominik Machácek, Ondrej Bojar, Raj Dabre:
MT Metrics Correlate with Human Ratings of Simultaneous Speech Translation. IWSLT@ACL 2023: 169-179 - [c189]Peter Polak, Danni Liu, Ngoc-Quan Pham, Jan Niehues, Alexander Waibel, Ondrej Bojar:
Towards Efficient Simultaneous Speech Translation: CUNI-KIT System for Simultaneous Track at IWSLT 2023. IWSLT@ACL 2023: 389-396 - [c188]Frantisek Trebuna, Kristína Szabová, Ondrej Bojar:
Searching for Reasons of Transformers' Success: Memorization vs Generalization. TSD 2023: 25-32 - [c187]Tom Kocmi, Eleftherios Avramidis, Rachel Bawden, Ondrej Bojar, Anton Dvorkovich, Christian Federmann, Mark Fishel, Markus Freitag, Thamme Gowda, Roman Grundkiewicz, Barry Haddow, Philipp Koehn, Benjamin Marie, Christof Monz, Makoto Morishita, Kenton Murray, Makoto Nagata, Toshiaki Nakazawa, Martin Popel, Maja Popovic, Mariya Shmatova:
Findings of the 2023 Conference on Machine Translation (WMT23): LLMs Are Here but Not Quite There Yet. WMT 2023: 1-42 - [c186]Josef Jon, Martin Popel, Ondrej Bojar:
CUNI at WMT23 General Translation Task: MT and a Genetic Algorithm. WMT 2023: 119-127 - [c185]Ivana Kvapilíková, Ondrej Bojar:
Low-Resource Machine Translation Systems for Indic Languages. WMT 2023: 954-958 - [i74]Vilém Zouhar, Sunit Bhattacharya, Ondrej Bojar:
Multimodal Shannon Game with Images. CoRR abs/2303.11192 (2023) - [i73]Dominik Machácek, Peter Polák, Ondrej Bojar, Raj Dabre:
Robustness of Multi-Source MT to Transcription Errors. CoRR abs/2305.16894 (2023) - [i72]Shantipriya Parida, Idris Abdulmumin, Shamsuddeen Hassan Muhammad, Aneesh Bose, Guneet Singh Kohli, Ibrahim Said Ahmad, Ketan Kotwal, Sayan Deb Sarkar, Ondrej Bojar, Habeebah Adamu Kakudi:
HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language. CoRR abs/2305.17690 (2023) - [i71]Josef Jon, Ondrej Bojar:
Breeding Machine Translations: Evolutionary approach to survive and thrive in the world of automated evaluation. CoRR abs/2305.19330 (2023) - [i70]Dávid Javorský, Ondrej Bojar, François Yvon:
Assessing Word Importance Using Models Trained for Semantic Tasks. CoRR abs/2305.19689 (2023) - [i69]Dominik Machácek, Raj Dabre, Ondrej Bojar:
Turning Whisper into Real-Time Transcription System. CoRR abs/2307.14743 (2023) - [i68]Josef Jon, Dusan Varis, Michal Novák, João Paulo Aires, Ondrej Bojar:
Negative Lexical Constraints in Neural Machine Translation. CoRR abs/2308.03601 (2023) - [i67]Josef Jon, Ondrej Bojar:
Character-level NMT and language similarity. CoRR abs/2308.04398 (2023) - [i66]Frantisek Kmjec, Ondrej Bojar:
Minuteman: Machine and Human Joining Forces in Meeting Summarization. CoRR abs/2309.05272 (2023) - [i65]Peter Polák, Brian Yan, Shinji Watanabe, Alex Waibel, Ondrej Bojar:
Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency Tradeoff. CoRR abs/2309.11379 (2023) - [i64]Peter Polák, Ondrej Bojar:
Long-Form End-to-End Speech Translation via Latent Alignment Segmentation. CoRR abs/2309.11384 (2023) - [i63]Ivana Kvapilíková, Ondrej Bojar:
Boosting Unsupervised Machine Translation with Pseudo-Parallel Data. CoRR abs/2310.14262 (2023) - [i62]Sunit Bhattacharya, Ondrej Bojar:
Unveiling Multilinguality in Transformer Models: Exploring Language Specificity in Feed-Forward Networks. CoRR abs/2310.15552 (2023) - [i61]Vilém Zouhar, Vera Kloudová, Martin Popel, Ondrej Bojar:
Evaluating Optimal Reference Translations. CoRR abs/2311.16787 (2023) - 2022
- [c184]Toshiaki Nakazawa, Hideya Mino, Isao Goto, Raj Dabre, Shohei Higashiyama, Shantipriya Parida, Anoop Kunchukuttan, Makoto Morishita, Ondrej Bojar, Chenhui Chu, Akiko Eriguchi, Kaori Abe, Yusuke Oda, Sadao Kurohashi:
Overview of the 9th Workshop on Asian Translation. WAT@COLING 2022: 1-36 - [c183]Sunit Bhattacharya, Vilém Zouhar, Ondrej Bojar:
Sentence Ambiguity, Grammaticality and Complexity Probes. BlackboxNLP@EMNLP 2022: 40-50 - [c182]Nalin Kumar, Ondrej Bojar:
Genre Transfer in NMT: Creating Synthetic Spoken Parallel Sentences using Written Parallel Data. ICON 2022: 224-233 - [c181]Antonios Anastasopoulos, Loïc Barrault, Luisa Bentivogli, Marcely Zanon Boito, Ondrej Bojar, Roldano Cattoni, Anna Currey, Georgiana Dinu, Kevin Duh, Maha Elbayad, Clara Emmanuel, Yannick Estève, Marcello Federico, Christian Federmann, Souhir Gahbiche, Hongyu Gong, Roman Grundkiewicz, Barry Haddow, Benjamin Hsu, Dávid Javorský, Vera Kloudová, Surafel Melaku Lakew, Xutai Ma, Prashant Mathur, Paul McNamee, Kenton Murray, Maria Nadejde, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, John Ortega, Juan Miguel Pino, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Yogesh Virkar, Alexander Waibel, Changhan Wang, Shinji Watanabe:
Findings of the IWSLT 2022 Evaluation Campaign. IWSLT@ACL 2022: 98-157 - [c180]Peter Polák, Ngoc-Quan Pham, Tuan-Nam Nguyen, Danni Liu, Carlos Mullov, Jan Niehues, Ondrej Bojar, Alexander Waibel:
CUNI-KIT System for Simultaneous Speech Translation Task at IWSLT 2022. IWSLT@ACL 2022: 277-285 - [c179]Peter Polák, Muskaan Singh, Anna Nedoluzhko, Ondrej Bojar:
ALIGNMEET: A Comprehensive Tool for Meeting Annotation, Alignment, and Evaluation. LREC 2022: 1771-1779 - [c178]Anna Nedoluzhko, Muskaan Singh, Marie Hledíková, Tirthankar Ghosal, Ondrej Bojar:
ELITR Minuting Corpus: A Novel Dataset for Automatic Minuting from Multi-Party Meetings in English and Czech. LREC 2022: 3174-3182 - [c177]Idris Abdulmumin, Satya Ranjan Dash, Musa Abdullahi Dawud, Shantipriya Parida, Shamsuddeen Hassan Muhammad, Ibrahim Said Ahmad, Subhadarshi Panda, Ondrej Bojar, Bashir Shehu Galadanci, Bello Shehu Bello:
Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation. LREC 2022: 6471-6479 - [c176]Muskan Garg, Seema Wazarkar, Muskaan Singh, Ondrej Bojar:
Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers. LREC 2022: 6837-6847 - [c175]Kartik Shinde, Tirthankar Ghosal, Muskaan Singh, Ondrej Bojar:
Automatic Minuting: A Pipeline Method for Generating Minutes from Multi-Party Meeting Proceedings. PACLIC 2022: 691-702 - [c174]Tom Kocmi, Rachel Bawden, Ondrej Bojar, Anton Dvorkovich, Christian Federmann, Mark Fishel, Thamme Gowda, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Rebecca Knowles, Philipp Koehn, Christof Monz, Makoto Morishita, Masaaki Nagata, Toshiaki Nakazawa, Michal Novák, Martin Popel, Maja Popovic:
Findings of the 2022 Conference on Machine Translation (WMT22). WMT 2022: 1-45 - [c173]Dávid Javorský, Dominik Machácek, Ondrej Bojar:
Continuous Rating as Reliable Human Evaluation of Simultaneous Speech Translation. WMT 2022: 154-164 - [c172]Josef Jon, Martin Popel, Ondrej Bojar:
CUNI-Bergamot Submission at WMT22 General Translation Task. WMT 2022: 280-289 - [c171]Kirill Semenov, Ondrej Bojar:
Automated Evaluation Metric for Terminology Consistency in MT. WMT 2022: 450-457 - [e12]Philipp Koehn, Loïc Barrault, Ondrej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Alexander Fraser, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Tom Kocmi, André Martins, Makoto Morishita, Christof Monz, Masaaki Nagata, Toshiaki Nakazawa, Matteo Negri, Aurélie Névéol, Mariana Neves, Martin Popel, Marco Turchi, Marcos Zampieri:
Proceedings of the Seventh Conference on Machine Translation, WMT 2022, Abu Dhabi, United Arab Emirates (Hybrid), December 7-8, 2022. Association for Computational Linguistics 2022, ISBN 978-1-959429-29-6 [contents] - [i60]Tom Kocmi, Dominik Machácek, Ondrej Bojar:
The Reality of Multi-Lingual Machine Translation. CoRR abs/2202.12814 (2022) - [i59]Dávid Javorský, Dominik Machácek, Ondrej Bojar:
Comprehension of Subtitles from Re-Translating Simultaneous Speech Translation. CoRR abs/2203.02458 (2022) - [i58]Christian Huber, Rishu Kumar, Ondrej Bojar, Alexander Waibel:
Short-Term Word-Learning in a Dynamically Changing Environment. CoRR abs/2203.15404 (2022) - [i57]Sunit Bhattacharya, Vera Kloudová, Vilém Zouhar, Ondrej Bojar:
EMMT: A simultaneous eye-tracking, 4-electrode EEG and audio corpus for multi-modal reading and translation scenarios. CoRR abs/2204.02905 (2022) - [i56]Sunit Bhattacharya, Rishu Kumar, Ondrej Bojar:
Team ÚFAL at CMCL 2022 Shared Task: Figuring out the correct recipe for predicting Eye-Tracking features using Pretrained Language Models. CoRR abs/2204.04998 (2022) - [i55]Peter Polák, Ngoc-Quan Ngoc, Tuan-Nam Nguyen, Danni Liu, Carlos Mullov, Jan Niehues, Ondrej Bojar, Alexander Waibel:
CUNI-KIT System for Simultaneous Speech Translation Task at IWSLT 2022. CoRR abs/2204.06028 (2022) - [i54]Idris Abdulmumin, Satya Ranjan Dash, Musa Abdullahi Dawud, Shantipriya Parida, Shamsuddeen Hassan Muhammad, Ibrahim Said Ahmad, Subhadarshi Panda, Ondrej Bojar, Bashir Shehu Galadanci, Bello Shehu Bello:
Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation. CoRR abs/2205.01133 (2022) - [i53]Peter Polák, Muskaan Singh, Anna Nedoluzhko, Ondrej Bojar:
ALIGNMEET: A Comprehensive Tool for Meeting Annotation, Alignment, and Evaluation. CoRR abs/2205.05433 (2022) - [i52]Sunit Bhattacharya, Vilém Zouhar, Ondrej Bojar:
Sentence Ambiguity, Grammaticality and Complexity Probes. CoRR abs/2210.06928 (2022) - [i51]Sukanta Sen, Ondrej Bojar, Barry Haddow:
Simultaneous Translation for Unsegmented Input: A Sliding Window Approach. CoRR abs/2210.09754 (2022) - [i50]Dominik Machácek, Ondrej Bojar, Raj Dabre:
MT Metrics Correlate with Human Ratings of Simultaneous Speech Translation. CoRR abs/2211.08633 (2022) - [i49]Josef Jon, Martin Popel, Ondrej Bojar:
CUNI Submission in WMT22 General Task. CoRR abs/2211.16174 (2022) - 2021
- [j26]Tirthankar Ghosal, Muskaan Singh, Anja Nedoluzhko, Ondrej Bojar:
Report on the SIGDial 2021 special session on summarization of dialogues and multi-party meetings (SummDial). SIGIR Forum 55(2): 12:1-12:17 (2021) - [c170]Josef Jon, João Paulo Aires, Dusan Varis, Ondrej Bojar:
End-to-End Lexically Constrained Machine Translation for Morphologically Rich Languages. ACL/IJCNLP (1) 2021: 4019-4033 - [c169]Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondrej Bojar, Chenhui Chu, Akiko Eriguchi, Kaori Abe, Yusuke Oda, Sadao Kurohashi:
Overview of the 8th Workshop on Asian Translation. WAT@ACL/IJCNLP 2021: 1-45 - [c168]Shantipriya Parida, Subhadarshi Panda, Ketan Kotwal, Amulya Ratna Dash, Satya Ranjan Dash, Yashvardhan Sharma, Petr Motlícek, Ondrej Bojar:
NLPHut's Participation at WAT2021. WAT@ACL/IJCNLP 2021: 146-154 - [c167]Ebrahim Ansari, Ondrej Bojar, Barry Haddow, Mohammad Mahmoudi:
SLTEV: Comprehensive Evaluation of Spoken Language Translation. EACL (System Demonstrations) 2021: 71-79 - [c166]Ondrej Bojar, Dominik Machácek, Sangeet Sagar, Otakar Smrz, Jonás Kratochvíl, Peter Polak, Ebrahim Ansari, Mohammad Mahmoudi, Rishu Kumar, Dario Franceschini, Chiara Canton, Ivan Simonini, Thai-Son Nguyen, Felix Schneider, Sebastian Stüker, Alex Waibel, Barry Haddow, Rico Sennrich, Philip Williams:
ELITR Multilingual Live Subtitling: Demo and Strategy. EACL (System Demonstrations) 2021: 271-277 - [c165]Rudolf Rosa, Tomás Musil, Ondrej Dusek, Dominik Jurko, Patrícia Schmidtová, David Marecek, Ondrej Bojar, Tom Kocmi, Daniel Hrbek, David Kosták, Martina Kinská, Marie Nováková, Josef Dolezal, Klára Vosecká, Tomás Studeník, Petr Zabka:
THEaiTRE 1.0: Interactive Generation of Theatre Play Scripts. Text2Story@ECIR 2021: 71-76 - [c164]Dusan Varis, Ondrej Bojar:
Sequence Length is a Domain: Length-based Overfitting in Transformer Models. EMNLP (1) 2021: 8246-8257 - [c163]Vilém Zouhar, Martin Popel, Ondrej Bojar, Ales Tamchyna:
Neural Machine Translation Quality and Post-Editing Performance. EMNLP (1) 2021: 10204-10214 - [c162]Peter Polák, Muskaan Singh, Ondrej Bojar:
Explainable Quality Estimation: CUNI Eval4NLP Submission. Eval4NLP 2021: 250-255 - [c161]Arghyadeep Sen, Shantipriya Parida, Ketan Kotwal, Subhadarshi Panda, Ondrej Bojar, Satya Ranjan Dash:
Bengali Visual Genome: A Multimodal Dataset for Machine Translation and Image Captioning. FICTA (1) 2021: 63-70 - [c160]Niyati Bafna, Martin Vastl, Ondrej Bojar:
Constrained Decoding for Technical Term Retention in English-Hindi MT. ICON 2021: 1-6 - [c159]Dominik Machácek, Matús Zilinec, Ondrej Bojar:
Lost in Interpreting: Speech Translation from Source or Interpreter? Interspeech 2021: 2376-2380 - [c158]Rudolf Rosa, Tomás Musil, Ondrej Dusek, Dominik Jurko, Patrícia Schmidtová, David Marecek, Ondrej Bojar, Tom Kocmi, Daniel Hrbek, David Kosták, Martina Kinská, Marie Nováková, Josef Dolezal, Klára Vosecká, Tomás Studeník, Petr Zabka:
When a Robot Writes a Play: Automatically Generating a Theatre Play Script. ALIFE 2021: 60 - [c157]Peter Polák, Ondrej Bojar:
Coarse-To-Fine And Cross-Lingual ASR Transfer. ITAT 2021: 154-160 - [c156]Ivana Kvapilíková, Ondrej Bojar:
Machine Translation of Covid-19 Information Resources via Multilingual Transfer. ITAT 2021: 176-181 - [c155]Antonios Anastasopoulos, Ondrej Bojar, Jacob Bremerman, Roldano Cattoni, Maha Elbayad, Marcello Federico, Xutai Ma, Satoshi Nakamura, Matteo Negri, Jan Niehues, Juan Miguel Pino, Elizabeth Salesky, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Alex Waibel, Changhan Wang, Matthew Wiesner:
Findings of the IWSLT 2021 Evaluation Campaign. IWSLT 2021: 1-29 - [c154]Ondrej Bojar, Vojtech Srdecný, Rishu Kumar, Otakar Smrz, Felix Schneider, Barry Haddow, Phil Williams, Chiara Canton:
Operating a Complex SLT System with Speakers and Human Interpreters. ASLTRW@MTSummit 2021: 23-34 - [c153]Vilém Zouhar, Michal Novák, Matús Zilinec, Ondrej Bojar, Mateo Obregón, Robin L. Hill, Frédéric Blain, Marina Fomicheva, Lucia Specia, Lisa Yankovskaya:
Backtranslation Feedback Improves User Confidence in MT, Not Quality. NAACL-HLT 2021: 151-161 - [c152]Muskaan Singh, Tirthankar Ghosal, Ondrej Bojar:
An Empirical Performance Analysis of State-of-the-Art Summarization Models for Automatic Minuting. PACLIC 2021: 50-60 - [c151]Matyás Kopp, Vladislav Stankov, Jan Oldrich Kruza, Pavel Stranák, Ondrej Bojar:
ParCzech 3.0: A Large Czech Speech Corpus with Rich Metadata. TDS 2021: 293-304 - [c150]Farhad Akhbardeh, Arkady Arkhangorodsky, Magdalena Biesialska, Ondrej Bojar, Rajen Chatterjee, Vishrav Chaudhary, Marta R. Costa-jussà, Cristina España-Bonet, Angela Fan, Christian Federmann, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Leonie Harter, Kenneth Heafield, Christopher Homan, Matthias Huck, Kwabena Amponsah-Kaakyire, Jungo Kasai, Daniel Khashabi, Kevin Knight, Tom Kocmi, Philipp Koehn, Nicholas Lourie, Christof Monz, Makoto Morishita, Masaaki Nagata, Ajay Nagesh, Toshiaki Nakazawa, Matteo Negri, Santanu Pal, Allahsera Auguste Tapo, Marco Turchi, Valentin Vydrin, Marcos Zampieri:
Findings of the 2021 Conference on Machine Translation (WMT21). WMT@EMNLP 2021: 1-88 - [c149]Petr Gebauer, Ondrej Bojar, Vojtech Svandelík, Martin Popel:
CUNI Systems in WMT21: Revisiting Backtranslation Techniques for English-Czech NMT. WMT@EMNLP 2021: 123-129 - [c148]Josef Jon, Michal Novák, João Paulo Aires, Dusan Varis, Ondrej Bojar:
CUNI systems for WMT21: Multilingual Low-Resource Translation for Indo-European Languages Shared Task. WMT@EMNLP 2021: 354-361 - [c147]Michael Hanna, Ondrej Bojar:
A Fine-Grained Analysis of BERTScore. WMT@EMNLP 2021: 507-517 - [c146]Markus Freitag, Ricardo Rei, Nitika Mathur, Chi-kiu Lo, Craig Stewart, George F. Foster, Alon Lavie, Ondrej Bojar:
Results of the WMT21 Metrics Shared Task: Evaluating Metrics with Expert-based Human Evaluations on TED and News Domain. WMT@EMNLP 2021: 733-774 - [c145]Josef Jon, Michal Novák, João Paulo Aires, Dusan Varis, Ondrej Bojar:
CUNI Systems for WMT21: Terminology Translation Shared Task. WMT@EMNLP 2021: 828-834 - [e11]Toshiaki Nakazawa, Hideki Nakayama, Isao Goto, Hideya Mino, Chenchen Ding, Raj Dabre, Anoop Kunchukuttan, Shohei Higashiyama, Hiroshi Manabe, Win Pa Pa, Shantipriya Parida, Ondrej Bojar, Chenhui Chu, Akiko Eriguchi, Kaori Abe, Yusuke Oda, Katsuhito Sudoh, Sadao Kurohashi, Pushpak Bhattacharyya:
Proceedings of the 8th Workshop on Asian Translation, WAT@ACL/IJCNLP 2021, Online, August 5-6, 2021. Association for Computational Linguistics 2021, ISBN 978-1-954085-63-3 [contents] - [e10]Loïc Barrault, Ondrej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Alexander Fraser, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, Tom Kocmi, André Martins, Makoto Morishita, Christof Monz:
Proceedings of the Sixth Conference on Machine Translation, WMT@EMNLP 2021, Online Event, November 10-11, 2021. Association for Computational Linguistics 2021, ISBN 978-1-954085-94-7 [contents] - [i48]Rudolf Rosa, Tomás Musil, Ondrej Dusek, Dominik Jurko, Patrícia Schmidtová, David Marecek, Ondrej Bojar, Tom Kocmi, Daniel Hrbek, David Kosták, Martina Kinská, Marie Nováková, Josef Dolezal, Klára Vosecká, Tomás Studeník, Petr Zabka:
THEaiTRE 1.0: Interactive generation of theatre play scripts. CoRR abs/2102.08892 (2021) - [i47]Vilém Zouhar, Michal Novák, Matús Zilinec, Ondrej Bojar, Mateo Obregón, Robin L. Hill, Frédéric Blain, Marina Fomicheva, Lucia Specia, Lisa Yankovskaya:
Backtranslation Feedback Improves User Confidence in MT, Not Quality. CoRR abs/2104.05688 (2021) - [i46]Ivana Kvapilíková, Mikel Artetxe, Gorka Labaka, Eneko Agirre, Ondrej Bojar:
Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining. CoRR abs/2105.10419 (2021) - [i45]Dominik Machácek, Matús Zilinec, Ondrej Bojar:
Lost in Interpreting: Speech Translation from Source or Interpreter? CoRR abs/2106.09343 (2021) - [i44]Josef Jon, João Paulo Aires, Dusan Varis, Ondrej Bojar:
End-to-End Lexically Constrained Machine Translation for Morphologically Rich Languages. CoRR abs/2106.12398 (2021) - [i43]Peter Polák, Ondrej Bojar:
Coarse-To-Fine And Cross-Lingual ASR Transfer. CoRR abs/2109.00916 (2021) - [i42]Vilém Zouhar, Ales Tamchyna, Martin Popel, Ondrej Bojar:
Neural Machine Translation Quality and Post-Editing Performance. CoRR abs/2109.05016 (2021) - [i41]Dusan Varis, Ondrej Bojar:
Sequence Length is a Domain: Length-based Overfitting in Transformer Models. CoRR abs/2109.07276 (2021) - [i40]Josef Jon, Michal Novák, João Paulo Aires, Dusan Varis, Ondrej Bojar:
CUNI systems for WMT21: Terminology translation Shared Task. CoRR abs/2109.09350 (2021) - [i39]Josef Jon, Michal Novák, João Paulo Aires, Dusan Varis, Ondrej Bojar:
CUNI systems for WMT21: Multilingual Low-Resource Translation for Indo-European Languages Shared Task. CoRR abs/2109.09354 (2021) - 2020
- [j25]Esaú Villatoro-Tello, Shantipriya Parida, Petr Motlícek, Ondrej Bojar:
Inferring Highly-dense Representations for Clustering Broadcast Media Content. Prague Bull. Math. Linguistics 115: 31-50 (2020) - [c144]Ivana Kvapilíková, Mikel Artetxe, Gorka Labaka, Eneko Agirre, Ondrej Bojar:
Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining. ACL (student) 2020: 255-262 - [c143]Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondrej Bojar, Sadao Kurohashi:
Overview of the 7th Workshop on Asian Translation. WAT@AAC/IJCNLPL 2020: 1-44 - [c142]Shantipriya Parida, Petr Motlícek, Amulya Ratna Dash, Satya Ranjan Dash, Debasish Kumar Mallick, Satya Prakash Biswal, Priyanka Pattnaik, Biranchi Narayan Nayak, Ondrej Bojar:
ODIANLP's Participation in WAT2020. WAT@AAC/IJCNLPL 2020: 103-108 - [c141]Tom Kocmi, Ondrej Bojar:
Efficiently Reusing Old Models Across Languages via Transfer Learning. EAMT 2020: 19-28 - [c140]Ondrej Bojar, Dominik Machácek, Sangeet Sagar, Otakar Smrz, Jonás Kratochvíl, Ebrahim Ansari, Dario Franceschini, Chiara Canton, Ivan Simonini, Thai-Son Nguyen, Felix Schneider, Sebastian Stüker, Alex Waibel, Barry Haddow, Rico Sennrich, Philip Williams:
ELITR: European Live Translator. EAMT 2020: 463-464 - [c139]Rudolf Rosa, Ondrej Dusek, Tom Kocmi, David Marecek, Tomás Musil, Patrícia Schmidtová, Dominik Jurko, Ondrej Bojar, Daniel Hrbek, David Kosták, Martina Kinská, Josef Dolezal, Klára Vosecká:
THEaiTRE: Artificial Intelligence to Write a Theatre Play. AI4Narratives@IJCAI 2020: 9-13 - [c138]Dominik Machácek, Ondrej Bojar:
Presenting Simultaneous Translation in Limited Space. ITAT 2020: 34-39 - [c137]Ebrahim Ansari, Amittai Axelrod, Nguyen Bach, Ondrej Bojar, Roldano Cattoni, Fahim Dalvi, Nadir Durrani, Marcello Federico, Christian Federmann, Jiatao Gu, Fei Huang, Kevin Knight, Xutai Ma, Ajay Nagesh, Matteo Negri, Jan Niehues, Juan Miguel Pino, Elizabeth Salesky, Xing Shi, Sebastian Stüker, Marco Turchi, Alexander Waibel, Changhan Wang:
FINDINGS OF THE IWSLT 2020 EVALUATION CAMPAIGN. IWSLT 2020: 1-34 - [c136]Peter Polak, Sangeet Sagar, Dominik Machácek, Ondrej Bojar:
CUNI Neural ASR with Phoneme-Level Intermediate Step for~Non-Native~SLT at IWSLT 2020. IWSLT 2020: 191-199 - [c135]Dominik Machácek, Jonás Kratochvíl, Sangeet Sagar, Matús Zilinec, Ondrej Bojar, Thai-Son Nguyen, Felix Schneider, Philip Williams, Yuekun Yao:
ELITR Non-Native Speech Translation at IWSLT 2020. IWSLT 2020: 200-208 - [c134]Dario Franceschini, Chiara Canton, Ivan Simonini, Armin Schweinfurth, Adelheid Glott, Sebastian Stüker, Thai-Son Nguyen, Felix Schneider, Thanh-Le Ha, Alex Waibel, Barry Haddow, Philip Williams, Rico Sennrich, Ondrej Bojar, Sangeet Sagar, Dominik Machácek, Otakar Smrz:
Removing European Language Barriers with Innovative Machine Translation Technology. IWLTP@LREC 2020: 44-49 - [c133]Petra Barancíková, Ondrej Bojar:
COSTRA 1.0: A Dataset of Complex Sentence Transformations. LREC 2020: 3535-3541 - [c132]Jonás Kratochvíl, Peter Polak, Ondrej Bojar:
Large Corpus of Czech Parliament Plenary Hearings. LREC 2020: 6363-6367 - [c131]Erion Çano, Ondrej Bojar:
Two Huge Title and Keyword Generation Corpora of Research Articles. LREC 2020: 6663-6671 - [c130]Vilém Zouhar, Ondrej Bojar:
Outbound Translation User Interface Ptakopet: A Pilot Study. LREC 2020: 6967-6975 - [c129]Erion Çano, Ondrej Bojar:
How Many Pages?: Paper Length Prediction from the Metadata. NLPIR 2020: 91-95 - [c128]Petra Barancíková, Ondrej Bojar:
Costra 1.1: An Inquiry into Geometric Properties of Sentence Spaces. TDS 2020: 135-143 - [c127]Loïc Barrault, Magdalena Biesialska, Ondrej Bojar, Marta R. Costa-jussà, Christian Federmann, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Matthias Huck, Eric Joanis, Tom Kocmi, Philipp Koehn, Chi-kiu Lo, Nikola Ljubesic, Christof Monz, Makoto Morishita, Masaaki Nagata, Toshiaki Nakazawa, Santanu Pal, Matt Post, Marcos Zampieri:
Findings of the 2020 Conference on Machine Translation (WMT20). WMT@EMNLP 2020: 1-55 - [c126]Vilém Zouhar, Tereza Vojtechová, Ondrej Bojar:
WMT20 Document-Level Markable Error Exploration. WMT@EMNLP 2020: 371-380 - [c125]Nitika Mathur, Johnny Wei, Markus Freitag, Qingsong Ma, Ondrej Bojar:
Results of the WMT20 Metrics Shared Task. WMT@EMNLP 2020: 688-725 - [c124]Ivana Kvapilíková, Tom Kocmi, Ondrej Bojar:
CUNI Systems for the Unsupervised and Very Low Resource Translation Task in WMT20. WMT@EMNLP 2020: 1123-1128 - [e9]Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Anoop Kunchukuttan, Win Pa Pa, Ondrej Bojar, Shantipriya Parida, Isao Goto, Hidaya Mino, Hiroshi Manabe, Katsuhito Sudoh, Sadao Kurohashi, Pushpak Bhattacharyya:
Proceedings of the 7th Workshop on Asian Translation, WAT@AACL/IJCNLP 2020, Suzhou, China, December 4, 2020. Association for Computational Linguistics 2020, ISBN 978-1-952148-95-8 [contents] - [e8]Loïc Barrault, Ondrej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Alexander Fraser, Yvette Graham, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, André Martins, Makoto Morishita, Christof Monz, Masaaki Nagata, Toshiaki Nakazawa, Matteo Negri:
Proceedings of the Fifth Conference on Machine Translation, WMT@EMNLP 2020, Online, November 19-20, 2020. Association for Computational Linguistics 2020, ISBN 978-1-948087-81-0 [contents] - [i38]Erion Çano, Ondrej Bojar:
Two Huge Title and Keyword Generation Corpora of Research Articles. CoRR abs/2002.04689 (2020) - [i37]Erion Çano, Ondrej Bojar:
Human or Machine: Automating Human Likeliness Evaluation of NLG Texts. CoRR abs/2006.03189 (2020) - [i36]Dominik Machácek, Jonás Kratochvíl, Sangeet Sagar, Matús Zilinec, Ondrej Bojar, Thai-Son Nguyen, Felix Schneider, Philip Williams, Yuekun Yao:
ELITR Non-Native Speech Translation at IWSLT 2020. CoRR abs/2006.03331 (2020) - [i35]Erion Çano, Ondrej Bojar:
Automating Text Naturalness Evaluation of NLG Systems. CoRR abs/2006.13268 (2020) - [i34]Rudolf Rosa, Ondrej Dusek, Tom Kocmi, David Marecek, Tomás Musil, Patrícia Schmidtová, Dominik Jurko, Ondrej Bojar, Daniel Hrbek, David Kosták, Martina Kinská, Josef Dolezal, Klára Vosecká:
THEaiTRE: Artificial Intelligence to Write a Theatre Play. CoRR abs/2006.14668 (2020) - [i33]Tom Kocmi, Martin Popel, Ondrej Bojar:
Announcing CzEng 2.0 Parallel Corpus with over 2 Gigawords. CoRR abs/2007.03006 (2020) - [i32]Dominik Machácek, Ondrej Bojar:
Presenting Simultaneous Translation in Limited Space. CoRR abs/2009.09016 (2020) - [i31]Dusan Varis, Ondrej Bojar:
Unsupervised Pretraining for Neural Machine Translation Using Elastic Weight Consolidation. CoRR abs/2010.09403 (2020) - [i30]Ivana Kvapilíková, Tom Kocmi, Ondrej Bojar:
CUNI Systems for the Unsupervised and Very Low Resource Translation Task in WMT20. CoRR abs/2010.11747 (2020) - [i29]Erion Çano, Ondrej Bojar:
How Many Pages? Paper Length Prediction from the Metadata. CoRR abs/2010.15924 (2020)
2010 – 2019
- 2019
- [j24]Thuong-Hai Pham, Dominik Machácek, Ondrej Bojar:
Promoting the Knowledge of Source Syntax in Transformer NMT Is Not Needed. Computación y Sistemas 23(3) (2019) - [j23]Shantipriya Parida, Ondrej Bojar, Satya Ranjan Dash:
Hindi Visual Genome: A Dataset for Multi-Modal English to Hindi Machine Translation. Computación y Sistemas 23(4) (2019) - [j22]Ondrej Bojar, Raffaella Bernardi, Bonnie Nash-Webber:
Representation of sentence meaning (A JNLE Special Issue). Nat. Lang. Eng. 25(4): 427-432 (2019) - [j21]Daniel Kondratyuk, Ronald Cardenas, Ondrej Bojar:
Replacing Linguists with Dummies: A Serious Need for Trivial Baselines in Multi-Task Neural Machine Translation. Prague Bull. Math. Linguistics 113: 31- (2019) - [c123]Dusan Varis, Ondrej Bojar:
Unsupervised Pretraining for Neural Machine Translation Using Elastic Weight Consolidation. ACL (2) 2019: 130-135 - [c122]Toshiaki Nakazawa, Nobushige Doi, Shohei Higashiyama, Chenchen Ding, Raj Dabre, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondrej Bojar, Sadao Kurohashi:
Overview of the 6th Workshop on Asian Translation. WAT@EMNLP-IJCNLP 2019: 1-35 - [c121]Shantipriya Parida, Ondrej Bojar, Petr Motlícek:
Idiap NMT System for WAT 2019 Multimodal Translation Task. WAT@EMNLP-IJCNLP 2019: 175-180 - [c120]Erion Çano, Ondrej Bojar:
Keyphrase Generation: A Multi-Aspect Survey. FRUCT 2019: 85-94 - [c119]Erion Çano, Ondrej Bojar:
Sentiment Analysis of Czech Texts: An Algorithmic Survey. ICAART (2) 2019: 973-979 - [c118]Erion Çano, Ondrej Bojar:
Efficiency Metrics for Data-Driven Models: A Text Summarization Case Study. INLG 2019: 229-239 - [c117]Anna Nedoluzhko, Ondrej Bojar:
Towards Automatic Minuting of the Meetings. ITAT 2019: 112-119 - [c116]Petra Barancíková, Ondrej Bojar:
In Search for Linear Relations in Sentence Embedding Spaces. ITAT 2019: 125-132 - [c115]Erion Çano, Ondrej Bojar:
Keyphrase Generation: A Text Summarization Struggle. NAACL-HLT (1) 2019: 666-672 - [c114]Dominik Machácek, Jonás Kratochvíl, Tereza Vojtechová, Ondrej Bojar:
A Speech Test Set of Practice Business Presentations with Additional Relevant Texts. SLSP 2019: 151-161 - [c113]Loïc Barrault, Ondrej Bojar, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Philipp Koehn, Shervin Malmasi, Christof Monz, Mathias Müller, Santanu Pal, Matt Post, Marcos Zampieri:
Findings of the 2019 Conference on Machine Translation (WMT19). WMT (2) 2019: 1-61 - [c112]Qingsong Ma, Johnny Wei, Ondrej Bojar, Yvette Graham:
Results of the WMT19 Metrics Shared Task: Segment-Level and Strong MT Systems Pose Big Challenges. WMT (2) 2019: 62-90 - [c111]Tom Kocmi, Ondrej Bojar:
CUNI Submission for Low-Resource Languages in WMT News 2019. WMT (2) 2019: 234-240 - [c110]Ivana Kvapilíková, Dominik Machácek, Ondrej Bojar:
CUNI Systems for the Unsupervised News Translation Task in WMT 2019. WMT (2) 2019: 241-248 - [c109]Martin Popel, Dominik Machácek, Michal Auersperger, Ondrej Bojar, Pavel Pecina:
English-Czech Systems in WMT19: Document-Level Transformer. WMT (2) 2019: 342-348 - [c108]Katerina Rysova, Magdaléna Rysová, Tomás Musil, Lucie Poláková, Ondrej Bojar:
A Test Suite and Manual Evaluation of Document-Level NMT at WMT19. WMT (2) 2019: 455-463 - [c107]Tereza Vojtechová, Michal Novák, Milos Kloucek, Ondrej Bojar:
SAO WMT19 Test Suite: Machine Translation of Audit Reports. WMT (2) 2019: 481-493 - [e7]Toshiaki Nakazawa, Chenchen Ding, Raj Dabre, Anoop Kunchukuttan, Nobushige Doi, Yusuke Oda, Ondrej Bojar, Shantipriya Parida, Isao Goto, Hidaya Mino:
Proceedings of the 6th Workshop on Asian Translation, WAT@EMNLP-IJCNLP 2019, Hong Kong, China, November 4, 2019. Association for Computational Linguistics 2019, ISBN 978-1-950737-87-1 [contents] - [e6]Ondrej Bojar, Rajen Chatterjee, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, André Martins, Christof Monz, Matteo Negri, Aurélie Névéol, Mariana L. Neves, Matt Post, Marco Turchi, Karin Verspoor:
Proceedings of the Fourth Conference on Machine Translation, WMT 2019, Florence, Italy, August 1-2, 2019 - Volume 1: Research Papers. Association for Computational Linguistics 2019, ISBN 978-1-950737-27-7 [contents] - [e5]Ondrej Bojar, Rajen Chatterjee, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, André Martins, Christof Monz, Matteo Negri, Aurélie Névéol, Mariana L. Neves, Matt Post, Marco Turchi, Karin Verspoor:
Proceedings of the Fourth Conference on Machine Translation, WMT 2019, Florence, Italy, August 1-2, 2019 - Volume 2: Shared Task Papers, Day 1. Association for Computational Linguistics 2019 [contents] - [e4]Ondrej Bojar, Rajen Chatterjee, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, André Martins, Christof Monz, Matteo Negri, Aurélie Névéol, Mariana L. Neves, Matt Post, Marco Turchi, Karin Verspoor:
Proceedings of the Fourth Conference on Machine Translation, WMT 2019, Florence, Italy, August 1-2, 2019 - Volume 3: Shared Task Papers, Day 2. Association for Computational Linguistics 2019 [contents] - [i28]Erion Çano, Ondrej Bojar:
Sentiment Analysis of Czech Texts: An Algorithmic Survey. CoRR abs/1901.02780 (2019) - [i27]Erion Çano, Ondrej Bojar:
Keyphrase Generation: A Text Summarization Struggle. CoRR abs/1904.00110 (2019) - [i26]Shantipriya Parida, Ondrej Bojar, Satya Ranjan Dash:
Hindi Visual Genome: A Dataset for Multimodal English-to-Hindi Machine Translation. CoRR abs/1907.08948 (2019) - [i25]Ivana Kvapilíková, Dominik Machácek, Ondrej Bojar:
CUNI Systems for the Unsupervised News Translation Task in WMT 2019. CoRR abs/1907.12664 (2019) - [i24]Martin Popel, Dominik Machácek, Michal Auersperger, Ondrej Bojar, Pavel Pecina:
English-Czech Systems in WMT19: Document-Level Transformer. CoRR abs/1907.12750 (2019) - [i23]Dominik Machácek, Jonás Kratochvíl, Tereza Vojtechová, Ondrej Bojar:
A Speech Test Set of Practice Business Presentations with Additional Relevant Texts. CoRR abs/1908.00916 (2019) - [i22]Katerina Rysova, Magdaléna Rysová, Tomás Musil, Lucie Poláková, Ondrej Bojar:
A Test Suite and Manual Evaluation of Document-Level NMT at WMT19. CoRR abs/1908.03043 (2019) - [i21]Tereza Vojtechová, Michal Novák, Milos Kloucek, Ondrej Bojar:
SAO WMT19 Test Suite: Machine Translation of Audit Reports. CoRR abs/1909.01701 (2019) - [i20]Erion Çano, Ondrej Bojar:
Efficiency Metrics for Data-Driven Models: A Text Summarization Case Study. CoRR abs/1909.06618 (2019) - [i19]Tom Kocmi, Ondrej Bojar:
Transfer Learning across Languages from Someone Else's NMT Model. CoRR abs/1909.10955 (2019) - [i18]Petra Barancíková, Ondrej Bojar:
In Search for Linear Relations in Sentence Embedding Spaces. CoRR abs/1910.03375 (2019) - [i17]Erion Çano, Ondrej Bojar:
Keyphrase Generation: A Multi-Aspect Survey. CoRR abs/1910.05059 (2019) - [i16]Thuong-Hai Pham, Dominik Machácek, Ondrej Bojar:
Promoting the Knowledge of Source Syntax in Transformer NMT Is Not Needed. CoRR abs/1910.11218 (2019) - [i15]Vilém Zouhar, Ondrej Bojar:
Outbound Translation User Interface Ptakopet: A Pilot Study. CoRR abs/1911.10835 (2019) - [i14]Petra Barancíková, Ondrej Bojar:
COSTRA 1.0: A Dataset of Complex Sentence Transformations. CoRR abs/1912.01673 (2019) - 2018
- [j20]Martin Popel, Ondrej Bojar:
Training Tips for the Transformer Model. Prague Bull. Math. Linguistics 110: 43-70 (2018) - [c106]Ondrej Cífka, Ondrej Bojar:
Are BLEU and Meaning Representation in Opposition? ACL (1) 2018: 1362-1371 - [c105]Tom Kocmi, Shantipriya Parida, Ondrej Bojar:
CUNI NMT System for WAT 2018 Translation Tasks. WAT@PACLIC 2018 - [c104]Jindrich Helcl, Jindrich Libovický, Tom Kocmi, Tomás Musil, Ondrej Cífka, Dusan Varis, Ondrej Bojar:
Neural Monkey: The Current State and Beyond. AMTA (1) 2018: 168-176 - [c103]Shantipriya Parida, Ondrej Bojar:
Translating Short Segments with NMT: A Case Study in English-to-Hindi. EAMT 2018: 249-258 - [c102]Tom Kocmi, Dusan Varis, Ondrej Bojar:
CUNI Basque-to-English Submission in IWSLT18. IWSLT 2018: 142-146 - [c101]Dominik Machácek, Jonás Vidra, Ondrej Bojar:
Morphological and Language-Agnostic Word Segmentation for NMT. TSD 2018: 277-284 - [c100]Tom Kocmi, Ondrej Bojar:
Trivial Transfer Learning for Low-Resource Neural Machine Translation. WMT 2018: 244-252 - [c99]Ondrej Bojar, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Philipp Koehn, Christof Monz:
Findings of the 2018 Conference on Machine Translation (WMT18). WMT (shared task) 2018: 272-303 - [c98]Tom Kocmi, Roman Sudarikov, Ondrej Bojar:
CUNI Submissions in WMT18. WMT (shared task) 2018: 431-437 - [c97]Ondrej Bojar, Jirí Mírovský, Katerina Rysova, Magdaléna Rysová:
EvalD Reference-Less Discourse Evaluation for WMT18. WMT (shared task) 2018: 541-545 - [c96]Franck Burlot, Yves Scherrer, Vinit Ravishankar, Ondrej Bojar, Stig-Arne Grönroos, Maarit Koponen, Tommi Nieminen, François Yvon:
The WMT'18 Morpheval test suites for English-Czech, English-German, English-Finnish and Turkish-English. WMT (shared task) 2018: 546-560 - [c95]Silvie Cinková, Ondrej Bojar:
Testsuite on Czech-English Grammatical Contrasts. WMT (shared task) 2018: 561-569 - [c94]Qingsong Ma, Ondrej Bojar, Yvette Graham:
Results of the WMT18 Metrics Shared Task: Both characters and embeddings achieve good performance. WMT (shared task) 2018: 671-688 - [e3]Ondrej Bojar, Rajen Chatterjee, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, Christof Monz, Matteo Negri, Aurélie Névéol, Mariana L. Neves, Matt Post, Lucia Specia, Marco Turchi, Karin Verspoor:
Proceedings of the Third Conference on Machine Translation: Research Papers, WMT 2018, Belgium, Brussels, October 31 - November 1, 2018. Association for Computational Linguistics 2018, ISBN 978-1-948087-81-0 [contents] - [e2]Ondrej Bojar, Rajen Chatterjee, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, Christof Monz, Matteo Negri, Aurélie Névéol, Mariana L. Neves, Matt Post, Lucia Specia, Marco Turchi, Karin Verspoor:
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, WMT 2018, Belgium, Brussels, October 31 - November 1, 2018. Association for Computational Linguistics 2018, ISBN 978-1-948087-81-0 [contents] - [i13]Martin Popel, Ondrej Bojar:
Training Tips for the Transformer Model. CoRR abs/1804.00247 (2018) - [i12]Jakub Kudela, Irena Holubová, Ondrej Bojar:
Extracting Parallel Paragraphs from Common Crawl. CoRR abs/1804.10413 (2018) - [i11]Ondrej Cífka, Ondrej Bojar:
Are BLEU and Meaning Representation in Opposition? CoRR abs/1805.06536 (2018) - [i10]Dominik Machácek, Jonás Vidra, Ondrej Bojar:
Morphological and Language-Agnostic Word Segmentation for NMT. CoRR abs/1806.05482 (2018) - [i9]Tom Kocmi, Ondrej Bojar:
SubGram: Extending Skip-gram Word Representation with Substrings. CoRR abs/1806.06571 (2018) - [i8]Tom Kocmi, Ondrej Bojar:
Trivial Transfer Learning for Low-Resource Neural Machine Translation. CoRR abs/1809.00357 (2018) - 2017
- [j19]Jakub Kudela, Irena Holubová, Ondrej Bojar:
Extracting Parallel Paragraphs from Common Crawl. Prague Bull. Math. Linguistics 107: 39-56 (2017) - [j18]Matiss Rikters, Mark Fishel, Ondrej Bojar:
Visualizing Neural Machine Translation Attention and Confidence. Prague Bull. Math. Linguistics 109: 39-50 (2017) - [c93]Tom Kocmi, Dusan Varis, Ondrej Bojar:
CUNI NMT System for WAT 2017 Translation Tasks. WAT@IJCNLP 2017: 154-159 - [c92]Matthias Huck, Ales Tamchyna, Ondrej Bojar, Alexander M. Fraser:
Producing Unseen Morphological Variants in Statistical Machine Translation. EACL (2) 2017: 369-375 - [c91]Tom Kocmi, Ondrej Bojar:
LanideNN: Multilingual Language Identification on Text Stream. EACL (1) 2017: 927-936 - [c90]Tom Kocmi, Ondrej Bojar:
An Exploration of Word Embedding Initialization in Deep-Learning Tasks. ICON 2017: 56-64 - [c89]Matiss Rikters, Ondrej Bojar:
Paying Attention to Multi-Word Expressions in Neural Machine Translation. MTSummit (1) 2017: 86-95 - [c88]Tom Kocmi, Ondrej Bojar:
Curriculum Learning and Minibatch Bucketing in Neural Machine Translation. RANLP 2017: 379-386 - [c87]Ondrej Bojar, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Shujian Huang, Matthias Huck, Philipp Koehn, Qun Liu, Varvara Logacheva, Christof Monz, Matteo Negri, Matt Post, Raphael Rubino, Lucia Specia, Marco Turchi:
Findings of the 2017 Conference on Machine Translation (WMT17). WMT 2017: 169-214 - [c86]Antonio Jimeno-Yepes, Aurélie Névéol, Mariana L. Neves, Karin Verspoor, Ondrej Bojar, Arthur Boyer, Cristian Grozea, Barry Haddow, Madeleine Kittner, Yvonne Lichtblau, Pavel Pecina, Roland Roller, Rudolf Rosa, Amy Siu, Philippe Thomas, Saskia Trescher:
Findings of the WMT 2017 Biomedical Translation Shared Task. WMT 2017: 234-247 - [c85]Roman Sudarikov, David Marecek, Tom Kocmi, Dusan Varis, Ondrej Bojar:
CUNI submission in WMT17: Chimera goes neural. WMT 2017: 248-256 - [c84]Jan-Thorsten Peter, Hermann Ney, Ondrej Bojar, Ngoc-Quan Pham, Jan Niehues, Alex Waibel, Franck Burlot, François Yvon, Marcis Pinnis, Valters Sics, Jasmijn Bastings, Miguel Rios, Wilker Aziz, Philip Williams, Frédéric Blain, Lucia Specia:
The QT21 Combined Machine Translation System for English to Latvian. WMT 2017: 348-357 - [c83]Ondrej Bojar, Yvette Graham, Amir Kamran:
Results of the WMT17 Metrics Shared Task. WMT 2017: 489-513 - [c82]Ondrej Bojar, Jindrich Helcl, Tom Kocmi, Jindrich Libovický, Tomás Musil:
Results of the WMT17 Neural MT Training Task. WMT 2017: 525-533 - [c81]David Marecek, Ondrej Bojar, Ondrej Hübsch, Rudolf Rosa, Dusan Varis:
CUNI Experiments for WMT17 Metrics Task. WMT 2017: 604-611 - [c80]Dusan Varis, Ondrej Bojar:
CUNI System for WMT17 Automatic Post-Editing Task. WMT 2017: 661-666 - [c79]Mostafa Abdou, Vladan Gloncak, Ondrej Bojar:
Variable Mini-Batch Sizing and Pre-Trained Embeddings. WMT 2017: 680-686 - [e1]Ondrej Bojar, Christian Buck, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, Julia Kreutzer:
Proceedings of the Second Conference on Machine Translation, WMT 2017, Copenhagen, Denmark, September 7-8, 2017. Association for Computational Linguistics 2017, ISBN 978-1-945626-96-8 [contents] - [i7]Tom Kocmi, Ondrej Bojar:
LanideNN: Multilingual Language Identification on Character Window. CoRR abs/1701.03338 (2017) - [i6]Tom Kocmi, Ondrej Bojar:
Curriculum Learning and Minibatch Bucketing in Neural Machine Translation. CoRR abs/1707.09533 (2017) - [i5]Matiss Rikters, Ondrej Bojar:
Paying Attention to Multi-Word Expressions in Neural Machine Translation. CoRR abs/1710.06313 (2017) - [i4]Tom Kocmi, Ondrej Bojar:
An Exploration of Word Embedding Initialization in Deep-Learning Tasks. CoRR abs/1711.09160 (2017) - 2016
- [c78]Ales Tamchyna, Alexander M. Fraser, Ondrej Bojar, Marcin Junczys-Dowmunt:
Target-Side Context for Discriminative Models in Statistical Machine Translation. ACL (1) 2016 - [c77]Rudolf Rosa, Martin Popel, Ondrej Bojar, David Marecek, Ondrej Dusek:
Moses & Treex Hybrid MT Systems Bestiary. DMTW 2016: 1-10 - [c76]Duc Tam Hoang, Ondrej Bojar:
Pivoting Methods and Data for Czech-Vietnamese Translation via English. EAMT 2016: 190-202 - [c75]Martin Popel, Roman Sudarikov, Ondrej Bojar, Rudolf Rosa, Jan Hajic:
TectoMT - a deep linguistic core of the combined Cimera MT system. EAMT (Projects/Products) 2016 - [c74]Alexandra Birch, Omri Abend, Ondrej Bojar, Barry Haddow:
HUME: Human UCCA-Based Evaluation of Machine Translation. EMNLP 2016: 1264-1274 - [c73]Roman Sudarikov, Ondrej Dusek, Martin Holub, Ondrej Bojar, Vincent Kríz:
Verb sense disambiguation in Machine Translation. HyTra@COLING 2016: 42-50 - [c72]Ondrej Bojar, Ondrej Cífka, Jindrich Helcl, Tom Kocmi, Roman Sudarikov:
UFAL Submissions to the IWSLT 2016 MT Track. IWSLT 2016 - [c71]Tom Kocmi, Ondrej Bojar:
SubGram: Extending Skip-Gram Word Representation with Substrings. TSD 2016: 182-189 - [c70]Ondrej Bojar, Ondrej Dusek, Tom Kocmi, Jindrich Libovický, Michal Novák, Martin Popel, Roman Sudarikov, Dusan Varis:
CzEng 1.6: Enlarged Czech-English Parallel Corpus with Processing Tools Dockered. TSD 2016: 231-238 - [c69]Ondrej Bojar, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Aurélie Névéol, Mariana L. Neves, Martin Popel, Matt Post, Raphael Rubino, Carolina Scarton, Lucia Specia, Marco Turchi, Karin Verspoor, Marcos Zampieri:
Findings of the 2016 Conference on Machine Translation. WMT 2016: 131-198 - [c68]Ondrej Bojar, Yvette Graham, Amir Kamran, Milos Stanojevic:
Results of the WMT16 Metrics Shared Task. WMT 2016: 199-231 - [c67]Bushra Jawaid, Amir Kamran, Milos Stanojevic, Ondrej Bojar:
Results of the WMT16 Tuning Shared Task. WMT 2016: 232-238 - [c66]Jan-Thorsten Peter, Tamer Alkhouli, Hermann Ney, Matthias Huck, Fabienne Braune, Alexander M. Fraser, Ales Tamchyna, Ondrej Bojar, Barry Haddow, Rico Sennrich, Frédéric Blain, Lucia Specia, Jan Niehues, Alex Waibel, Alexandre Allauzen, Lauriane Aufrant, Franck Burlot, Elena Knyazeva, Thomas Lavergne, François Yvon, Marcis Pinnis, Stella Frank:
The QT21/HimL Combined Machine Translation System. WMT 2016: 344-355 - [c65]Ales Tamchyna, Roman Sudarikov, Ondrej Bojar, Alexander M. Fraser:
CUNI-LMU Submissions in WMT2016: Chimera Constrained and Beaten. WMT 2016: 385-390 - [c64]Philip Williams, Rico Sennrich, Maria Nadejde, Matthias Huck, Barry Haddow, Ondrej Bojar:
Edinburgh's Statistical Machine Translation Systems for WMT16. WMT 2016: 399-410 - [c63]Rudolf Rosa, Roman Sudarikov, Michal Novák, Martin Popel, Ondrej Bojar:
Dictionary-based Domain Adaptation of MT Systems without Retraining. WMT 2016: 449-455 - [c62]Viktor Kocur, Ondrej Bojar:
Particle Swarm Optimization Submission for WMT16 Tuning Task. WMT 2016: 518-524 - [c61]Jindrich Libovický, Jindrich Helcl, Marek Tlustý, Ondrej Bojar, Pavel Pecina:
CUNI System for WMT16 Automatic Post-Editing and Multimodal Translation Tasks. WMT 2016: 646-654 - [c60]Thanh Le, Hoa Trong Vu, Jonathan Oberländer, Ondrej Bojar:
Using Term Position Similarity and Language Modeling for Bilingual Document Alignment. WMT 2016: 710-716 - [c59]Amal Abdelsalam, Ondrej Bojar, Samhaa R. El-Beltagy:
Bilingual Embeddings and Word Alignments for Translation Quality Estimation. WMT 2016: 764-771 - [c58]Bushra Jawaid, Amir Kamran, Ondrej Bojar:
Enriching Source for English-to-Urdu Machine Translation. WSSANLP@COLING 2016: 54-63 - [i3]Jindrich Libovický, Jindrich Helcl, Marek Tlustý, Pavel Pecina, Ondrej Bojar:
CUNI System for WMT16 Automatic Post-Editing and Multimodal Translation Tasks. CoRR abs/1606.07481 (2016) - [i2]Alexandra Birch, Barry Haddow, Ondrej Bojar, Omri Abend:
HUME: Human UCCA-Based Evaluation of Machine Translation. CoRR abs/1607.00030 (2016) - [i1]Ales Tamchyna, Alexander M. Fraser, Ondrej Bojar, Marcin Junczys-Dowmunt:
Target-Side Context for Discriminative Models in Statistical Machine Translation. CoRR abs/1607.01149 (2016) - 2015
- [j17]Franky, Ondrej Bojar, Katerina Veselovská:
Resources for Indonesian Sentiment Analysis. Prague Bull. Math. Linguistics 103: 21-42 (2015) - [j16]Matous Machácek, Ondrej Bojar:
Evaluating Machine Translation Quality Using Short Segments Annotations. Prague Bull. Math. Linguistics 103: 85-110 (2015) - [j15]Duc Tam Hoang, Ondrej Bojar:
TmTriangulate: A Tool for Phrase Table Triangulation. Prague Bull. Math. Linguistics 104: 75-86 (2015) - [c57]Ales Tamchyna, Ondrej Bojar:
What a Transfer-Based System Brings to the Combination with PBMT. HyTra@ACL 2015: 11-20 - [c56]Roman Sudarikov, Ondrej Bojar:
Giving a Sense: A Pilot Study in Concept Annotation from Multiple Resources. ITAT 2015: 88-94 - [c55]Petr Fanta, Roman Sudarikov, Ondrej Bojar:
TeamUFAL: WSD+EL as Document Retrieval. SemEval@NAACL-HLT 2015: 350-354 - [c54]Ondrej Bojar, Rajen Chatterjee, Christian Federmann, Barry Haddow, Matthias Huck, Chris Hokamp, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Matt Post, Carolina Scarton, Lucia Specia, Marco Turchi:
Findings of the 2015 Workshop on Statistical Machine Translation. WMT@EMNLP 2015: 1-46 - [c53]Ondrej Bojar, Ales Tamchyna:
CUNI in WMT15: Chimera Strikes Again. WMT@EMNLP 2015: 79-83 - [c52]Milos Stanojevic, Amir Kamran, Philipp Koehn, Ondrej Bojar:
Results of the WMT15 Metrics Shared Task. WMT@EMNLP 2015: 256-273 - [c51]Milos Stanojevic, Amir Kamran, Ondrej Bojar:
Results of the WMT15 Tuning Shared Task. WMT@EMNLP 2015: 274-281 - 2014
- [j14]Ondrej Bojar, Daniel Zeman:
Czech Machine Translation in the project CzechMATE. Prague Bull. Math. Linguistics 101: 71-96 (2014) - [c50]Jan Hajic, Ondrej Bojar, Zdenka Uresová:
Comparing Czech and English AMRs. LG-LP@COLING 2014: 55-64 - [c49]Dusan Varis, Ondrej Bojar:
Japonsko-český strojový překlad. ITAT 2014: 85-92 - [c48]Bushra Jawaid, Ondrej Bojar:
Two-Step Machine Translation with Lattices. LREC 2014: 682-686 - [c47]Nianwen Xue, Ondrej Bojar, Jan Hajic, Martha Palmer, Zdenka Uresová, Xiuhong Zhang:
Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech. LREC 2014: 1765-1772 - [c46]Bushra Jawaid, Amir Kamran, Ondrej Bojar:
A Tagged Corpus and a Tagger for Urdu. LREC 2014: 2938-2943 - [c45]Ondrej Bojar, Vojtech Diatka, Pavel Rychlý, Pavel Stranák, Vit Suchomel, Ales Tamchyna, Daniel Zeman:
HindEnCorp - Hindi-English and Hindi-only Corpus for Machine Translation. LREC 2014: 3550-3555 - [c44]Ondrej Bojar, Christian Buck, Christian Federmann, Barry Haddow, Philipp Koehn, Johannes Leveling, Christof Monz, Pavel Pecina, Matt Post, Herve Saint-Amand, Radu Soricut, Lucia Specia, Ales Tamchyna:
Findings of the 2014 Workshop on Statistical Machine Translation. WMT@ACL 2014: 12-58 - [c43]Ales Tamchyna, Martin Popel, Rudolf Rosa, Ondrej Bojar:
CUNI in WMT14: Chimera Still Awaits Bellerophon. WMT@ACL 2014: 195-200 - [c42]Matous Machácek, Ondrej Bojar:
Results of the WMT14 Metrics Shared Task. WMT@ACL 2014: 293-301 - [c41]Bushra Jawaid, Amir Kamran, Ondrej Bojar:
English to Urdu Statistical Machine Translation: Establishing a Baseline. WSSANLP@COLING 2014: 37-42 - 2013
- [j13]Ondrej Bojar, Ales Tamchyna:
The Design of Eman, an Experiment Manager. Prague Bull. Math. Linguistics 99: 39-58 (2013) - [c40]Ales Tamchyna, Ondrej Bojar:
No Free Lunch in Factored Phrase-Based Machine Translation. CICLing (2) 2013: 210-223 - [c39]Ondrej Bojar, Matous Machácek, Ales Tamchyna, Daniel Zeman:
Scratching the Surface of Possible Translations. TSD 2013: 465-474 - [c38]Ondrej Bojar, Christian Buck, Chris Callison-Burch, Christian Federmann, Barry Haddow, Philipp Koehn, Christof Monz, Matt Post, Radu Soricut, Lucia Specia:
Findings of the 2013 Workshop on Statistical Machine Translation. WMT@ACL 2013: 1-44 - [c37]Ondrej Bojar, Rudolf Rosa, Ales Tamchyna:
Chimera - Three Heads for English-to-Czech Translation. WMT@ACL 2013: 92-98 - [c36]Petra Galuscáková, Martin Popel, Ondrej Bojar:
PhraseFix: Statistical Post-Editing of TectoMT. WMT@ACL 2013: 141-147 - [c35]Matous Machácek, Ondrej Bojar:
Results of the WMT13 Metrics Shared Task. WMT@ACL 2013 - 2012
- [j12]Jirí Marsík, Ondrej Bojar:
TrTok: A Fast and Trainable Tokenizer for Natural Languages. Prague Bull. Math. Linguistics 98: 75-86 (2012) - [c34]Petra Galuscáková, Ondrej Bojar:
Improving SMT by Using Parallel Data of a Closely Related Language. Baltic HLT 2012: 58-65 - [c33]Mark Fishel, Ondrej Bojar, Maja Popovic:
Terra: a Collection of Translation Error-Annotated Corpora. LREC 2012: 7-14 - [c32]Jan Berka, Ondrej Bojar, Mark Fishel, Maja Popovic, Daniel Zeman:
Automatic MT Error Analysis: Hjerson Helping Addicter. LREC 2012: 2158-2163 - [c31]Jan Hajic, Eva Hajicová, Jarmila Panevová, Petr Sgall, Ondrej Bojar, Silvie Cinková, Eva Fucíková, Marie Mikulová, Petr Pajas, Jan Popelka, Jirí Semecký, Jana Sindlerová, Jan Stepánek, Josef Toman, Zdenka Uresová, Zdenek Zabokrtský:
Announcing Prague Czech-English Dependency Treebank 2.0. LREC 2012: 3153-3160 - [c30]Ondrej Bojar, Zdenek Zabokrtský, Ondrej Dusek, Petra Galuscáková, Martin Majlis, David Marecek, Jirí Marsík, Michal Novák, Martin Popel, Ales Tamchyna:
The Joy of Parallelism with CzEng 1.0. LREC 2012: 3921-3928 - [c29]Ondrej Bojar, Dekai Wu:
Towards a Predicate-Argument Evaluation for MT. SSST@ACL 2012: 30-38 - [c28]Mark Fishel, Rico Sennrich, Maja Popovic, Ondrej Bojar:
TerrorCat: a Translation Error Categorization-based MT Quality Metric. WMT@NAACL-HLT 2012: 64-70 - [c27]Ondrej Bojar, Bushra Jawaid, Amir Kamran:
Probes in a Taxonomy of Factored Phrase-Based Models. WMT@NAACL-HLT 2012: 253-260 - [c26]Ales Tamchyna, Petra Galuscáková, Amir Kamran, Milos Stanojevic, Ondrej Bojar:
Selecting Data for English-to-Czech Machine Translation. WMT@NAACL-HLT 2012: 374-381 - [c25]Bushra Jawaid, Ondrej Bojar:
Tagger Voting for Urdu. WSSANLP@COLING 2012: 135-144 - 2011
- [j11]Ondrej Bojar:
Analyzing Error Types in English-Czech Machine Translation. Prague Bull. Math. Linguistics 95: 63-76 (2011) - [j10]Jan Berka, Martin Cerný, Ondrej Bojar:
Quiz-Based Evaluation of Machine Translation. Prague Bull. Math. Linguistics 95: 77-86 (2011) - [j9]Daniel Zeman, Mark Fishel, Jan Berka, Ondrej Bojar:
Addicter: What Is Wrong with My Translations? Prague Bull. Math. Linguistics 96: 79-88 (2011) - [j8]Ceslav Przywara, Ondrej Bojar:
eppex: Epochal Phrase Table Extraction for Statistical Machine Translation. Prague Bull. Math. Linguistics 96: 89-98 (2011) - [c24]Ondrej Hálek, Rudolf Rosa, Ales Tamchyna, Ondrej Bojar:
Named entities from Wikipedia for machine translation. ITAT 2011: 23-30 - [c23]Mark Fishel, Ondrej Bojar, Daniel Zeman, Jan Berka:
Automatic Translation Error Analysis. TSD 2011: 72-79 - [c22]Ondrej Bojar, Milos Ercegovcevic, Martin Popel, Omar Zaidan:
A Grain of Salt for the WMT Manual Evaluation. WMT@EMNLP 2011: 1-11 - [c21]Matous Machácek, Ondrej Bojar:
Approximating a Deep-Syntactic Metric for MT Evaluation and Tuning. WMT@EMNLP 2011: 92-98 - [c20]Ondrej Bojar, Ales Tamchyna:
Improving Translation Model by Monolingual Data. WMT@EMNLP 2011: 330-336 - [c19]David Marecek, Rudolf Rosa, Petra Galuscáková, Ondrej Bojar:
Two-step translation with grammatical post-processing. WMT@EMNLP 2011: 426-432 - 2010
- [c18]Ondrej Bojar, Kamil Kos, David Marecek:
Tackling Sparse Data Issue in Machine Translation Evaluation. ACL (2) 2010: 86-91 - [c17]Jirí Divis, Ondrej Bojar:
Automatic source code reduction. ITAT 2010: 9-16 - [c16]Ondrej Bojar, Adam Liska, Zdenek Zabokrtský:
Evaluating Utility of Data Sources in a Large Parallel Czech-English Corpus CzEng 0.9. LREC 2010 - [c15]Ondrej Bojar, Pavel Stranák, Daniel Zeman:
Data Issues in English-to-Hindi Machine Translation. LREC 2010 - [c14]Jana Sindlerová, Ondrej Bojar:
Building a Bilingual ValLex Using Treebank Token Alignment: First Observations. LREC 2010 - [c13]Ondrej Bojar, Kamil Kos:
2010 Failures in English-Czech Phrase-Based MT. WMT@ACL 2010: 60-66
2000 – 2009
- 2009
- [j7]Ondrej Bojar, Zdenek Zabokrtský:
CzEng 0.9: Large Parallel Treebank with Rich Annotation. Prague Bull. Math. Linguistics 92: 63-84 (2009) - [j6]Kamil Kos, Ondrej Bojar:
Evaluation of Machine Translation Metrics for Czech as the Target Language. Prague Bull. Math. Linguistics 92: 135-148 (2009) - [c12]David Kolovratník, Natalia Klyueva, Ondrej Bojar:
Statistical Machine Translation Between Related and Unrelated Languages. ITAT 2009: 31-36 - [c11]Ondrej Bojar, David Marecek, Václav Novák, Martin Popel, Jan Ptácek, Jan Rous, Zdenek Zabokrtský:
English-Czech MT in 2008. WMT@EACL 2009: 125-129 - 2008
- [j5]Ondrej Bojar, Silvie Cinková, Jan Ptácek:
Towards English-to-Czech MT via Tectogrammatical Layer. Prague Bull. Math. Linguistics 90: 57-68 (2008) - [c10]Ondrej Bojar, Miroslav Janícek, Zdenek Zabokrtský, Pavel Ceska, Peter Bena:
CzEng 0.7: Parallel Corpus with Community-Supplied Translations. LREC 2008 - [c9]Ondrej Bojar, Jan Hajic:
Phrase-Based and Deep Syntactic English-to-Czech Statistical Machine Translation. WMT@ACL 2008: 143-146 - 2007
- [c8]Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondrej Bojar, Alexandra Constantin, Evan Herbst:
Moses: Open Source Toolkit for Statistical Machine Translation. ACL 2007 - [c7]Ondrej Bojar:
English-to-Czech Factored Machine Translation. WMT@ACL 2007: 232-239 - 2006
- [j4]Ondrej Bojar, Zdenek Zabokrtský:
CzEng: Czech-English Parallel Corpus release version 0.5. Prague Bull. Math. Linguistics 86: 59-62 (2006) - [c6]Ondrej Bojar, Evgeny Matusov, Hermann Ney:
Czech-English Phrase-Based Machine Translation. FinTAL 2006: 214-224 - [c5]Ondrej Bojar, Magdalena Prokopová:
Czech-English Word Alignment. LREC 2006: 1236-1239 - [c4]Václava Benesová, Ondrej Bojar:
Czech Verbs of Communication and the Extraction of Their Frames. TSD 2006: 29-36 - 2005
- [j3]Ondrej Bojar, Jirí Semecký, Václava Benesová:
VALEVAL: Testing Vallex Consistency and Experimenting with Word-Frame Disambiguation. Prague Bull. Math. Linguistics 83: 5-18 (2005) - [c3]Ondrej Bojar, Petr Homola, Vladislav Kubon:
Problems of Reusing an Existing MT System. IJCNLP (companion) 2005 - [c2]Markéta Lopatková, Ondrej Bojar, Jirí Semecký, Václava Benesová, Zdenek Zabokrtský:
Valency Lexicon of Czech Verbs VALLEX: Recent Experiments with Frame Disambiguation. TSD 2005: 99-106 - 2004
- [j2]Ondrej Bojar:
Czech Syntactic Analysis Constraint-based - XDG: One Possible Start. Prague Bull. Math. Linguistics 81: 43-54 (2004) - [c1]Ondrej Bojar:
Problems of Inducing Large Coverage Constraint-Based Dependency Grammar for Czech. CSLP 2004: 90-103 - 2003
- [j1]Ondrej Bojar:
Towards Automatic Extraction of Verb Frames. Prague Bull. Math. Linguistics 79-80: 101-120 (2003)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-27 00:49 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint