default search action

combined dblp search
author search
venue search
publication search

ask others

Otavio Braga

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BragaXJCYSN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BragaXJCYSN24
Otavio Braga, Wei Xia, Keith Johnson, Alice Chuang, Yunfan Ye, Olivier Siohan, Tuan Anh Nguyen:
Large Scale Self-Supervised Pretraining for Active Speaker Detection. ICASSP 2024: 10036-10040
2023
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-09369
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-09369
Avner May, Dmitriy Serdyuk, Ankit Parag Shah, Otavio Braga, Olivier Siohan:
Audio-visual fine-tuning of audio-only ASR models. CoRR abs/2312.09369 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-10088
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-10088
Oscar Chang, Otavio Braga, Hank Liao, Dmitriy Serdyuk, Olivier Siohan:
On Robustness to Missing Video for Audiovisual Speech Recognition. CoRR abs/2312.10088 (2023)
2022
[j1]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/ChangBLSS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/ChangBLSS22
Oscar Chang, Otavio Braga, Hank Liao, Dmitriy Serdyuk, Olivier Siohan:
On Robustness to Missing Video for Audiovisual Speech Recognition. Trans. Mach. Learn. Res. 2022 (2022)
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BragaS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BragaS22
Otavio Braga, Olivier Siohan:
Best of Both Worlds: Multi-Task Audio-Visual Automatic Speech Recognition and Active Speaker Detection. ICASSP 2022: 6047-6051
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SerdyukBS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SerdyukBS22
Dmitriy Serdyuk, Otavio Braga, Olivier Siohan:
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Muti-Person Video. INTERSPEECH 2022: 2833-2837
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-10439
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-10439
Dmitriy Serdyuk, Otavio Braga, Olivier Siohan:
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition. CoRR abs/2201.10439 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-05206
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-05206
Otavio Braga, Olivier Siohan:
Best of Both Worlds: Multi-task Audio-Visual Automatic Speech Recognition and Active Speaker Detection. CoRR abs/2205.05206 (2022)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-05586
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-05586
Otavio Braga, Takaki Makino, Olivier Siohan, Hank Liao:
End-to-End Multi-Person Audio/Visual Automatic Speech Recognition. CoRR abs/2205.05586 (2022)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-05684
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-05684
Otavio Braga, Olivier Siohan:
A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection. CoRR abs/2205.05684 (2022)
2021
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SerdyukBS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SerdyukBS21
Dmitriy Serdyuk, Otavio Braga, Olivier Siohan:
Audio-Visual Speech Recognition is Worth $32\times 32\times 8$ Voxels. ASRU 2021: 796-802
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BragaS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BragaS21
Otavio Braga, Olivier Siohan:
A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection. ICASSP 2021: 6863-6867
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RoseSTB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RoseSTB21
Richard Rose, Olivier Siohan, Anshuman Tripathi, Otavio Braga:
End-to-End Audio-Visual Speech Recognition for Overlapping Speech. Interspeech 2021: 3016-3020
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-09536
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-09536
Dmitriy Serdyuk, Otavio Braga, Olivier Siohan:
Audio-Visual Speech Recognition is Worth 32×32×8 Voxels. CoRR abs/2109.09536 (2021)
2020
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BragaMSL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BragaMSL20
Otavio Braga, Takaki Makino, Olivier Siohan, Hank Liao:
End-to-End Multi-Person Audio/Visual Automatic Speech Recognition. ICASSP 2020: 6994-6998

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MakinoLASGBS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MakinoLASGBS19
Takaki Makino, Hank Liao, Yannis M. Assael, Brendan Shillingford, Basilio Garcia, Otavio Braga, Olivier Siohan:
Recurrent Neural Network Transducer for Audio-Visual Speech Recognition. ASRU 2019: 905-912
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-04890
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-04890
Takaki Makino, Hank Liao, Yannis M. Assael, Brendan Shillingford, Basilio Garcia, Otavio Braga, Olivier Siohan:
Recurrent Neural Network Transducer for Audio-Visual Speech Recognition. CoRR abs/1911.04890 (2019)
2014
[b1]
- view
  - electronic edition @ nyu.edu
  - details & citations
- export record
  dblp key:
  - phd/us/Braga14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/us/Braga14
Otavio Braga:
On the Human Form: Efficient acquisition, modeling and manipulation of thehuman body. New York University, USA, 2014

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.