default search action
Bowen Shi
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2024
- [j18]Bing He, Tong Qin, Bowen Shi, Weihua Dong:
How do human detect targets of remote sensing images with visual attention? Int. J. Appl. Earth Obs. Geoinformation 132: 104044 (2024) - [j17]Yang Chen, Bowen Shi:
Enhanced Heterogeneous Graph Attention Network with a Novel Multilabel Focal Loss for Document-Level Relation Extraction. Entropy 26(3): 210 (2024) - [j16]Yang Chen, Bowen Shi:
DiffFSRE: Diffusion-Enhanced Prototypical Network for Few-Shot Relation Extraction. Entropy 26(5): 352 (2024) - [j15]Bowen Shi, Weihua Dong, Zhicheng Zhan:
AdaFI-FCN: an adaptive feature integration fully convolutional network for predicting driver's visual attention. Geo spatial Inf. Sci. 27(4): 1309-1325 (2024) - [j14]Weiwei Huo, Zihan Zhang, Jingjing Qu, Jiaqi Yan, Siyuan Yan, Jinyi Yan, Bowen Shi:
Speciesism and Preference of Human-Artificial Intelligence Interaction: A Study on Medical Artificial Intelligence. Int. J. Hum. Comput. Interact. 40(11): 2925-2937 (2024) - [j13]Yang Chen, Bowen Shi, Ke Xu:
PTCAS: Prompt tuning with continuous answer search for relation extraction. Inf. Sci. 659: 120060 (2024) - [j12]Mi Zhou, Yaxuan Li, Zhilei Qiao, Bowen Shi:
Employee Ratings and Reviews Data from Glassdoor. J. Inf. Syst. 38(3): 93-105 (2024) - [j11]Vineel Pratap, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, Zhaoheng Ni, Apoorv Vyas, Maryam Fazel-Zarandi, Alexei Baevski, Yossi Adi, Xiaohui Zhang, Wei-Ning Hsu, Alexis Conneau, Michael Auli:
Scaling Speech Technology to 1, 000+ Languages. J. Mach. Learn. Res. 25: 97:1-97:52 (2024) - 2023
- [j10]Bowen Shi, Xiaoyan Huang, Jing Li, He Zhang, Han Zhao, Xiaochen Zhang, Fengyu Zhang, Chris Gerada:
Novel End-Winding Hybrid Flux Machine. IEEE Access 11: 133781-133791 (2023) - [j9]Weiwei Huo, Xinze Yuan, Xianmiao Li, Wenhao Luo, Jiaying Xie, Bowen Shi:
Increasing acceptance of medical AI: The role of medical staff participation in AI development. Int. J. Medical Informatics 175: 105073 (2023) - [j8]Bowen Shi, Ke Xu, Jichang Zhao:
Domain-relevance of influence: characterizing variations in online influence across multiple domains on social media. J. Big Data 10(1): 69 (2023) - [j7]Ruibo Chen, Yanjun Pu, Bowen Shi, Wenjun Wu:
An automatic model management system and its implementation for AIOps on microservice platforms. J. Supercomput. 79(10): 11410-11426 (2023) - 2022
- [j6]Bowen Shi, Ke Xu, Jichang Zhao:
Behavior Variations and Their Implications for Popularity Promotions: From Elites to Mass on Weibo. Entropy 24(5): 664 (2022) - [j5]Kaifeng Huang, Bihuan Chen, Congying Xu, Ying Wang, Bowen Shi, Xin Peng, Yijian Wu, Yang Liu:
Characterizing usages, updates and risks of third-party libraries in Java projects. Empir. Softw. Eng. 27(4): 90 (2022) - [j4]Jing Fu, Shitao Song, Li Guo, Weiwei Chen, Peng Wang, Lingjian Duanmu, Yijing Shang, Bowen Shi, Luyan He:
Interprovincial Joint Prevention and Control of Open Straw Burning in Northeast China: Implications for Atmospheric Environment Management. Remote. Sens. 14(11): 2528 (2022) - 2021
- [j3]Zhaokai Li, Xiaoyan Huang, Lijian Wu, He Zhang, Tingna Shi, Yan Yan, Bowen Shi, Geng Yang:
An Improved Hybrid Field Model for Calculating On-Load Performance of Interior Permanent-Magnet Motors. IEEE Trans. Ind. Electron. 68(10): 9207-9217 (2021) - 2019
- [j2]Haipeng Chen, Zhentao He, Bowen Shi, Tie Zhong:
Research on Recognition Method of Electrical Components Based on YOLO V3. IEEE Access 7: 157818-157829 (2019) - [j1]Xiujuan Xu, Yu Liu, Wei Wang, Xiaowei Zhao, Quan Z. Sheng, Zhe Wang, Bowen Shi:
ITS-Frame: A Framework for Multi-Aspect Analysis in the Field of Intelligent Transportation Systems. IEEE Trans. Intell. Transp. Syst. 20(8): 2893-2902 (2019)
Conference and Workshop Papers
- 2024
- [c53]Phillip Rust, Bowen Shi, Skyler Wang, Necati Cihan Camgöz, Jean Maillard:
Towards Privacy-Aware Sign Language Translation at Scale. ACL (1) 2024: 8624-8641 - [c52]HyoJung Han, Mohamed Anwar, Juan Pino, Wei-Ning Hsu, Marine Carpuat, Bowen Shi, Changhan Wang:
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception. ACL (1) 2024: 12896-12911 - [c51]Yue Hou, Xueyuan Chen, He Zhu, Ruomei Liu, Bowen Shi, Jiaheng Liu, Junran Wu, Ke Xu:
NC2D: Novel Class Discovery for Node Classification. CIKM 2024: 849-859 - [c50]Wenbo Li, Bowen Shi, Daidai Zhu, Aihong Yuan:
Latent Multi-view Clustering Based Adaptive Graph Constraint. CMLDS 2024: 16:1-16:7 - [c49]Bowen Shi, Peisen Zhao, Zichen Wang, Yuhang Zhang, Yaoming Wang, Jin Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian, Xiaopeng Zhang:
UMG-CLIP: A Unified Multi-granularity Vision Generalist for Open-World Understanding. ECCV (38) 2024: 259-277 - [c48]Peng-Jen Chen, Bowen Shi, Kelvin Niu, Ann Lee, Wei-Ning Hsu:
M2BART: Multilingual and Multimodal Encoder-Decoder Pre-Training for Any-to-Any Machine Translation. ICASSP 2024: 11896-11900 - [c47]Alexander H. Liu, Matthew Le, Apoorv Vyas, Bowen Shi, Andros Tjandra, Wei-Ning Hsu:
Generative Pre-training for Speech with Flow Matching. ICLR 2024 - [c46]Bowen Shi, Xiaopeng Zhang, Yaoming Wang, Jin Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian:
Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners. ICLR 2024 - [c45]Yaoming Wang, Jin Li, Xiaopeng Zhang, Bowen Shi, Chenglin Li, Wenrui Dai, Hongkai Xiong, Qi Tian:
BarLeRIa: An Efficient Tuning Framework for Referring Image Segmentation. ICLR 2024 - [c44]K. R. Prajwal, Bowen Shi, Matthew Le, Apoorv Vyas, Andros Tjandra, Mahi Luthra, Baishan Guo, Huiyu Wang, Triantafyllos Afouras, David Kant, Wei-Ning Hsu:
MusicFlow: Cascaded Flow Matching for Text Guided Music Generation. ICML 2024 - [c43]Yaoming Wang, Jin Li, Wenrui Dai, Bowen Shi, Xiaopeng Zhang, Chenglin Li, Hongkai Xiong:
Bootstrap AutoEncoders With Contrastive Paradigm for Self-supervised Gaze Estimation. ICML 2024 - 2023
- [c42]Han Li, Bowen Shi, Wenrui Dai, Hongwei Zheng, Botao Wang, Yu Sun, Min Guo, Chenglin Li, Junni Zou, Hongkai Xiong:
Pose-Oriented Transformer with Uncertainty-Guided Refinement for 2D-to-3D Human Pose Estimation. AAAI 2023: 1296-1304 - [c41]Yaoming Wang, Bowen Shi, Xiaopeng Zhang, Jin Li, Yuchen Liu, Wenrui Dai, Chenglin Li, Hongkai Xiong, Qi Tian:
Adapting Shortcut with Normalizing Flow: An Efficient Tuning Framework for Visual Recognition. CVPR 2023: 15965-15974 - [c40]Wei-Ning Hsu, Tal Remez, Bowen Shi, Jacob Donley, Yossi Adi:
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Regeneration. CVPR 2023: 18796-18806 - [c39]Ankita Pasad, Bowen Shi, Karen Livescu:
Comparative Layer-Wise Analysis of Self-Supervised Speech Models. ICASSP 2023: 1-5 - [c38]Yuetian Chen, Bowen Shi, Mei Si:
Prompt to GPT-3: Step-by-Step Thinking Instructions for Humor Generation. ICCC 2023: 437-441 - [c37]Hongwei Zheng, Han Li, Bowen Shi, Wenrui Dai, Botao Wang, Yu Sun, Min Guo, Hongkai Xiong:
ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting. ICME 2023: 2657-2662 - [c36]Junran Wu, Xueyuan Chen, Bowen Shi, Shangzhe Li, Ke Xu:
SEGA: Structural Entropy Guided Anchor View for Graph Contrastive Learning. ICML 2023: 37293-37312 - [c35]Mohamed Anwar, Bowen Shi, Vedanuj Goswami, Wei-Ning Hsu, Juan Pino, Changhan Wang:
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation. INTERSPEECH 2023: 4064-4068 - [c34]Tu Anh Nguyen, Wei-Ning Hsu, Antony D'Avirro, Bowen Shi, Itai Gat, Maryam Fazel-Zarandi, Tal Remez, Jade Copet, Gabriel Synnaeve, Michael Hassid, Felix Kreuk, Yossi Adi, Emmanuel Dupoux:
Expresso: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis. INTERSPEECH 2023: 4823-4827 - [c33]Yaoming Wang, Yuchen Liu, Xiaopeng Zhang, Jin Li, Bowen Shi, Chenglin Li, Wenrui Dai, Hongkai Xiong, Qi Tian:
VioLET: Vision-Language Efficient Tuning with Collaborative Multi-modal Gradients. ACM Multimedia 2023: 4595-4605 - [c32]Matthew Le, Apoorv Vyas, Bowen Shi, Brian Karrer, Leda Sari, Rashel Moritz, Mary Williamson, Vimal Manohar, Yossi Adi, Jay Mahadeokar, Wei-Ning Hsu:
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale. NeurIPS 2023 - [c31]Jin Li, Yaoming Wang, Xiaopeng Zhang, Bowen Shi, Dongsheng Jiang, Chenglin Li, Wenrui Dai, Hongkai Xiong, Qi Tian:
AiluRus: A Scalable ViT Framework for Dense Prediction. NeurIPS 2023 - [c30]Marcelo Sandoval-Castañeda, Yanhong Li, Bowen Shi, Diane Brentari, Karen Livescu, Gregory Shakhnarovich:
TTIC's Submission to WMT-SLT 23. WMT 2023: 344-350 - 2022
- [c29]Bowen Shi, Diane Brentari, Greg Shakhnarovich, Karen Livescu:
Searching for fingerspelled content in American Sign Language. ACL (1) 2022: 1699-1712 - [c28]Bowen Shi, Dongsheng Jiang, Xiaopeng Zhang, Han Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian:
A Transformer-Based Decoder for Semantic Segmentation with Multi-level Context Mining. ECCV (28) 2022: 624-639 - [c27]Bowen Shi, Diane Brentari, Gregory Shakhnarovich, Karen Livescu:
Open-Domain Sign Language Translation Learned from Online Video. EMNLP 2022: 6365-6379 - [c26]Bowen Shi, Wei-Ning Hsu, Kushal Lakhotia, Abdelrahman Mohamed:
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction. ICLR 2022 - [c25]Bowen Shi, Wei-Ning Hsu, Abdelrahman Mohamed:
Robust Self-Supervised Audio-Visual Speech Recognition. INTERSPEECH 2022: 2118-2122 - [c24]Bowen Shi, Abdelrahman Mohamed, Wei-Ning Hsu:
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT. INTERSPEECH 2022: 4785-4789 - [c23]Wei-Ning Hsu, Bowen Shi:
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality. NeurIPS 2022 - [c22]Bowen Shi, Diane Brentari, Gregory Shakhnarovich, Karen Livescu:
TTIC's WMT-SLT 22 Sign Language Translation System. WMT 2022: 989-993 - 2021
- [c21]Han Li, Bowen Shi, Wenrui Dai, Yabo Chen, Botao Wang, Yu Sun, Min Guo, Chenglin Li, Junni Zou, Hongkai Xiong:
Hierarchical Graph Networks for 3D Human Pose Estimation. BMVC 2021: 387 - [c20]Bowen Shi, Diane Brentari, Greg Shakhnarovich, Karen Livescu:
Fingerspelling Detection in American Sign Language. CVPR 2021: 4166-4175 - [c19]Bowen Shi, Shane Settle, Karen Livescu:
Whole-Word Segmental Speech Recognition with Acoustic Word Embeddings. SLT 2021: 164-171 - 2020
- [c18]Bowen Shi, Ming Sun, Krishna C. Puvvada, Chieh-Chi Kao, Spyros Matsoukas, Chao Wang:
Few-Shot Acoustic Event Detection Via Meta Learning. ICASSP 2020: 76-80 - [c17]Bowen Shi, Jilong Gao, Zhentao He, Tian Zhang, Tie Zhong, Haipeng Chen:
Super-Resolution Reconstruction of Electric Power Inspection Images Based on Very Deep Network Super Resolution. ICAIS (1) 2020: 729-738 - [c16]Bowen Shi, Yuhui Xu, Wenrui Dai, Botao Wang, Shuai Zhang, Chenglin Li, Junni Zou, Hongkai Xiong:
Tiny-Hourglassnet: An Efficient Design For 3d Human Pose Estimation. ICIP 2020: 1491-1495 - [c15]Ying Wang, Bihuan Chen, Kaifeng Huang, Bowen Shi, Congying Xu, Xin Peng, Yijian Wu, Yang Liu:
An Empirical Study of Usages, Updates and Risks of Third-Party Libraries in Java Projects. ICSME 2020: 35-45 - [c14]Chieh-Chi Kao, Bowen Shi, Ming Sun, Chao Wang:
A Joint Framework for Audio Tagging and Weakly Supervised Acoustic Event Detection Using DenseNet with Global Average Pooling. INTERSPEECH 2020: 846-850 - [c13]Carlos Bermejo, Tristan Braud, Ji Yang, Shayan Mirjafari, Bowen Shi, Yu Xiao, Pan Hui:
VIMES: A Wearable Memory Assistance System for Automatic Information Retrieval. ACM Multimedia 2020: 3191-3200 - [c12]Shubham Toshniwal, Haoyue Shi, Bowen Shi, Lingyu Gao, Karen Livescu, Kevin Gimpel:
A Cross-Task Analysis of Text Span Representations. RepL4NLP@ACL 2020: 166-176 - [c11]Kaifeng Huang, Bihuan Chen, Bowen Shi, Ying Wang, Congying Xu, Xin Peng:
Interactive, effort-aware library version harmonization. ESEC/SIGSOFT FSE 2020: 518-529 - 2019
- [c10]Chunmiao Liu, Bowen Shi, Chenglin Li, Junni Zou, Yingqi Chen, Hongkai Xiong:
Deep Neural Network-Based Algorithm Approximation via Multivariate Polynomial Regression. GLOBECOM 2019: 1-6 - [c9]Bowen Shi, Ming Sun, Chieh-Chi Kao, Viktor Rozgic, Spyros Matsoukas, Chao Wang:
Semi-supervised Acoustic Event Detection Based on Tri-training. ICASSP 2019: 750-754 - [c8]Bowen Shi, Aurora Martinez Del Rio, Jonathan Keane, Diane Brentari, Greg Shakhnarovich, Karen Livescu:
Fingerspelling Recognition in the Wild With Iterative Visual Attention. ICCV 2019: 5399-5408 - [c7]Bowen Shi, Ming Sun, Chieh-Chi Kao, Viktor Rozgic, Spyros Matsoukas, Chao Wang:
Compression of Acoustic Event Detection Models with Quantized Distillation. INTERSPEECH 2019: 3639-3643 - [c6]Ankita Pasad, Bowen Shi, Herman Kamper, Karen Livescu:
On the Contributions of Visual and Textual Supervision in Low-Resource Semantic Speech Retrieval. INTERSPEECH 2019: 4195-4199 - 2018
- [c5]Bowen Shi, Aurora Martinez Del Rio, Jonathan Keane, Jonathan Michaux, Diane Brentari, Greg Shakhnarovich, Karen Livescu:
American Sign Language Fingerspelling Recognition in the Wild. SLT 2018: 145-152 - 2017
- [c4]Bowen Shi, Karen Livescu:
Multitask training with unlabeled data for end-to-end sign language fingerspelling recognition. ASRU 2017: 389-396 - [c3]Srinivasan Venkatramanan, Sichao Wu, Bowen Shi, Achla Marathe, Madhav V. Marathe, Stephen G. Eubank, Lalit P. Sah, A. P. Giri, Luke A. Colavito, K. S. Nitin, V. Sridhar, R. Asokan, Rangaswamy Muniappan, G. Norton, Abhijin Adiga:
Towards robust models of food flows and their role in invasive species spread. IEEE BigData 2017: 435-444 - 2015
- [c2]Tianhao Wang, Bowen Shi, John Xu, Chris Gerada:
Accuracy improvement of carrier signal injection sensorless control for IPMSM in consideration of inverter nonlinearity. IECON 2015: 273-278 - [c1]Bowen Shi, Ji Yang, Zhanpeng Huang, Pan Hui:
Offloading Guidelines for Augmented Reality Applications on Wearable Devices. ACM Multimedia 2015: 1271-1274
Informal and Other Publications
- 2024
- [i52]Phillip Rust, Bowen Shi, Skyler Wang, Necati Cihan Camgöz, Jean Maillard:
Towards Privacy-Aware Sign Language Translation at Scale. CoRR abs/2402.09611 (2024) - [i51]HyoJung Han, Mohamed Anwar, Juan Pino, Wei-Ning Hsu, Marine Carpuat, Bowen Shi, Changhan Wang:
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception. CoRR abs/2403.14402 (2024) - [i50]Jinye Shen, Bowen Shi, Weizhang Huang:
Meshfree finite difference solution of homogeneous Dirichlet problems of the fractional Laplacian. CoRR abs/2404.04407 (2024) - [i49]Chung-Ming Chien, Andros Tjandra, Apoorv Vyas, Matt Le, Bowen Shi, Wei-Ning Hsu:
Learning Fine-Grained Controllability on Speech Generation via Efficient Fine-Tuning. CoRR abs/2406.06251 (2024) - [i48]Gaël Le Lan, Bowen Shi, Zhaoheng Ni, Sidd Srinivasan, Anurag Kumar, Brian Ellis, David Kant, Varun Nagaraja, Ernie Chang, Wei-Ning Hsu, Yangyang Shi, Vikas Chandra:
High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching. CoRR abs/2407.03648 (2024) - [i47]Yue Hou, Xueyuan Chen, He Zhu, Ruomei Liu, Bowen Shi, Jiaheng Liu, Junran Wu, Ke Xu:
NC-NCD: Novel Class Discovery for Node Classification. CoRR abs/2407.17816 (2024) - [i46]Adam Polyak, Amit Zohar, Andrew Brown, Andros Tjandra, Animesh Sinha, Ann Lee, Apoorv Vyas, Bowen Shi, Chih-Yao Ma, Ching-Yao Chuang, David Yan, Dhruv Choudhary, Dingkang Wang, Geet Sethi, Guan Pang, Haoyu Ma, Ishan Misra, Ji Hou, Jialiang Wang, Kiran Jagadeesh, Kunpeng Li, Luxin Zhang, Mannat Singh, Mary Williamson, Matt Le, Matthew Yu, Mitesh Kumar Singh, Peizhao Zhang, Peter Vajda, Quentin Duval, Rohit Girdhar, Roshan Sumbaly, Sai Saketh Rambhatla, Sam S. Tsai, Samaneh Azadi, Samyak Datta, Sanyuan Chen, Sean Bell, Sharadh Ramaswamy, Shelly Sheynin, Siddharth Bhattacharya, Simran Motwani, Tao Xu, Tianhe Li, Tingbo Hou, Wei-Ning Hsu, Xi Yin, Xiaoliang Dai, Yaniv Taigman, Yaqiao Luo, Yen-Cheng Liu, Yi-Chiao Wu, Yue Zhao, Yuval Kirstain, Zecheng He, Zijian He, Albert Pumarola, Ali K. Thabet, Artsiom Sanakoyeu, Arun Mallya, Baishan Guo, Boris Araya, Breena Kerr, Carleigh Wood, Ce Liu, Cen Peng, Dmitry Vengertsev, Edgar Schönfeld, Elliot Blanchard, Felix Juefei-Xu, Fraylie Nord, Jeff Liang, John Hoffman, Jonas Kohler, Kaolin Fire, Karthik Sivakumar, Lawrence Chen, Licheng Yu, Luya Gao, Markos Georgopoulos, Rashel Moritz, Sara K. Sampson, Shikai Li, Simone Parmeggiani, Steve Fine, Tara Fowler, Vladan Petrovic, Yuming Du:
Movie Gen: A Cast of Media Foundation Models. CoRR abs/2410.13720 (2024) - [i45]K. R. Prajwal, Bowen Shi, Matthew Le, Apoorv Vyas, Andros Tjandra, Mahi Luthra, Baishan Guo, Huiyu Wang, Triantafyllos Afouras, David Kant, Wei-Ning Hsu:
MusicFlow: Cascaded Flow Matching for Text Guided Music Generation. CoRR abs/2410.20478 (2024) - [i44]Gabrielle Kaili-May Liu, Bowen Shi, Avi Caciularu, Idan Szpektor, Arman Cohan:
MDCure: A Scalable Pipeline for Multi-Document Instruction-Following. CoRR abs/2410.23463 (2024) - 2023
- [i43]Yuetian Chen, Ruohua Li, Bowen Shi, Peiru Liu, Mei Si:
Visual Story Generation Based on Emotion and Keywords. CoRR abs/2301.02777 (2023) - [i42]Han Li, Bowen Shi, Wenrui Dai, Hongwei Zheng, Botao Wang, Yu Sun, Min Guo, Chenglin Li, Junni Zou, Hongkai Xiong:
Pose-Oriented Transformer with Uncertainty-Guided Refinement for 2D-to-3D Human Pose Estimation. CoRR abs/2302.07408 (2023) - [i41]Mohamed Anwar, Bowen Shi, Vedanuj Goswami, Wei-Ning Hsu, Juan Pino, Changhan Wang:
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation. CoRR abs/2303.00628 (2023) - [i40]Ning Liao, Bowen Shi, Min Cao, Xiaopeng Zhang, Qi Tian, Junchi Yan:
Rethinking Visual Prompt Learning as Masked Visual Token Modeling. CoRR abs/2303.04998 (2023) - [i39]Junran Wu, Xueyuan Chen, Bowen Shi, Shangzhe Li, Ke Xu:
SEGA: Structural Entropy Guided Anchor View for Graph Contrastive Learning. CoRR abs/2305.04501 (2023) - [i38]Vineel Pratap, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, Zhaoheng Ni, Apoorv Vyas, Maryam Fazel-Zarandi, Alexei Baevski, Yossi Adi, Xiaohui Zhang, Wei-Ning Hsu, Alexis Conneau, Michael Auli:
Scaling Speech Technology to 1, 000+ Languages. CoRR abs/2305.13516 (2023) - [i37]Yuetian Chen, Bowen Shi, Mei Si:
Prompt to GPT-3: Step-by-Step Thinking Instructions for Humor Generation. CoRR abs/2306.13195 (2023) - [i36]Matthew Le, Apoorv Vyas, Bowen Shi, Brian Karrer, Leda Sari, Rashel Moritz, Mary Williamson, Vimal Manohar, Yossi Adi, Jay Mahadeokar, Wei-Ning Hsu:
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale. CoRR abs/2306.15687 (2023) - [i35]Bowen Shi, Xiaopeng Zhang, Yaoming Wang, Jin Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian:
Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners. CoRR abs/2306.15876 (2023) - [i34]Hongwei Zheng, Han Li, Bowen Shi, Wenrui Dai, Botao Wang, Yu Sun, Min Guo, Hongkai Xiong:
ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting. CoRR abs/2307.09026 (2023) - [i33]Tu Anh Nguyen, Wei-Ning Hsu, Antony D'Avirro, Bowen Shi, Itai Gat, Maryam Fazel-Zarandi, Tal Remez, Jade Copet, Gabriel Synnaeve, Michael Hassid, Felix Kreuk, Yossi Adi, Emmanuel Dupoux:
EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis. CoRR abs/2308.05725 (2023) - [i32]Bowen Shi:
Toward American Sign Language Processing in the Real World: Data, Tasks, and Methods. CoRR abs/2308.12419 (2023) - [i31]Lili Yu, Bowen Shi, Ramakanth Pasunuru, Benjamin Muller, Olga Golovneva, Tianlu Wang, Arun Babu, Binh Tang, Brian Karrer, Shelly Sheynin, Candace Ross, Adam Polyak, Russell Howes, Vasu Sharma, Puxin Xu, Hovhannes Tamoyan, Oron Ashual, Uriel Singer, Shang-Wen Li, Susan Zhang, Richard James, Gargi Ghosh, Yaniv Taigman, Maryam Fazel-Zarandi, Asli Celikyilmaz, Luke Zettlemoyer, Armen Aghajanyan:
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning. CoRR abs/2309.02591 (2023) - [i30]Alexander H. Liu, Matt Le, Apoorv Vyas, Bowen Shi, Andros Tjandra, Wei-Ning Hsu:
Generative Pre-training for Speech with Flow Matching. CoRR abs/2310.16338 (2023) - [i29]Jin Li, Yaoming Wang, Xiaopeng Zhang, Bowen Shi, Dongsheng Jiang, Chenglin Li, Wenrui Dai, Hongkai Xiong, Qi Tian:
AiluRus: A Scalable ViT Framework for Dense Prediction. CoRR abs/2311.01197 (2023) - [i28]Kaibo Hu, Ting Lin, Bowen Shi:
Finite elements for symmetric and traceless tensors in three dimensions. CoRR abs/2311.16077 (2023) - [i27]Apoorv Vyas, Bowen Shi, Matthew Le, Andros Tjandra, Yi-Chiao Wu, Baishan Guo, Jiemin Zhang, Xinyue Zhang, Robert Adkins, William Ngan, Jeff Wang, Ivan Cruz, Bapi Akula, Akinniyi Akinyemi, Brian Ellis, Rashel Moritz, Yael Yungster, Alice Rakotoarison, Liang Tan, Chris Summers, Carleigh Wood, Joshua Lane, Mary Williamson, Wei-Ning Hsu:
Audiobox: Unified Audio Generation with Natural Language Prompts. CoRR abs/2312.15821 (2023) - 2022
- [i26]Bowen Shi, Wei-Ning Hsu, Abdelrahman Mohamed:
Robust Self-Supervised Audio-Visual Speech Recognition. CoRR abs/2201.01763 (2022) - [i25]Bowen Shi, Wei-Ning Hsu, Kushal Lakhotia, Abdelrahman Mohamed:
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction. CoRR abs/2201.02184 (2022) - [i24]Bowen Shi, Diane Brentari, Greg Shakhnarovich, Karen Livescu:
Searching for fingerspelled content in American Sign Language. CoRR abs/2203.13291 (2022) - [i23]Bowen Shi, Abdelrahman Mohamed, Wei-Ning Hsu:
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT. CoRR abs/2205.07180 (2022) - [i22]Bowen Shi, Diane Brentari, Greg Shakhnarovich, Karen Livescu:
Open-Domain Sign Language Translation Learned from Online Video. CoRR abs/2205.12870 (2022) - [i21]Wei-Ning Hsu, Bowen Shi:
A Single Self-Supervised Model for Many Speech Modalities Enables Zero-Shot Modality Transfer. CoRR abs/2207.07036 (2022) - [i20]Ankita Pasad, Bowen Shi, Karen Livescu:
Comparative layer-wise analysis of self-supervised speech models. CoRR abs/2211.03929 (2022) - [i19]Wei-Ning Hsu, Tal Remez, Bowen Shi, Jacob Donley, Yossi Adi:
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement. CoRR abs/2212.11377 (2022) - 2021
- [i18]Bowen Shi, Diane Brentari, Greg Shakhnarovich, Karen Livescu:
Fingerspelling Detection in American Sign Language. CoRR abs/2104.01291 (2021) - [i17]Bowen Shi, Xiaopeng Zhang, Haohang Xu, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian:
Multi-dataset Pretraining: A Unified Model for Semantic Segmentation. CoRR abs/2106.04121 (2021) - [i16]Han Li, Bowen Shi, Wenrui Dai, Yabo Chen, Botao Wang, Yu Sun, Min Guo, Chenglin Li, Junni Zou, Hongkai Xiong:
Hierarchical Graph Networks for 3D Human Pose Estimation. CoRR abs/2111.11927 (2021) - 2020
- [i15]Yuhui Xu, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Bowen Shi, Qi Tian, Hongkai Xiong:
Latency-Aware Differentiable Neural Architecture Search. CoRR abs/2001.06392 (2020) - [i14]Bowen Shi, Ming Sun, Krishna C. Puvvada, Chieh-Chi Kao, Spyros Matsoukas, Chao Wang:
Few-shot acoustic event detection via meta-learning. CoRR abs/2002.09143 (2020) - [i13]Ying Wang, Bihuan Chen, Kaifeng Huang, Bowen Shi, Congying Xu, Xin Peng, Yang Liu, Yijian Wu:
An Empirical Study of Usages, Updates and Risks of Third-Party Libraries in Java Projects. CoRR abs/2002.11028 (2020) - [i12]Kaifeng Huang, Bihuan Chen, Bowen Shi, Ying Wang, Congying Xu, Xin Peng:
Interactive, Effort-Aware Library Version Harmonization. CoRR abs/2002.11066 (2020) - [i11]Bowen Shi, Ke Xu, Jichang Zhao:
Behavior variations and their implications for popularity promotions: From elites to mass in Weibo. CoRR abs/2004.05591 (2020) - [i10]Shubham Toshniwal, Haoyue Shi, Bowen Shi, Lingyu Gao, Karen Livescu, Kevin Gimpel:
A Cross-Task Analysis of Text Span Representations. CoRR abs/2006.03866 (2020) - [i9]Bowen Shi, Shane Settle, Karen Livescu:
Whole-Word Segmental Speech Recognition with Acoustic Word Embeddings. CoRR abs/2007.00183 (2020) - [i8]Chieh-Chi Kao, Bowen Shi, Ming Sun, Chao Wang:
A Joint Framework for Audio Tagging and Weakly Supervised Acoustic Event Detection Using DenseNet with Global Average Pooling. CoRR abs/2008.03350 (2020) - 2019
- [i7]Ankita Pasad, Bowen Shi, Herman Kamper, Karen Livescu:
On the Contributions of Visual and Textual Supervision in Low-resource Semantic Speech Retrieval. CoRR abs/1904.10947 (2019) - [i6]Bowen Shi, Ming Sun, Chieh-Chi Kao, Viktor Rozgic, Spyros Matsoukas, Chao Wang:
Semi-supervised Acoustic Event Detection based on tri-training. CoRR abs/1904.12926 (2019) - [i5]Bowen Shi, Ming Sun, Chieh-Chi Kao, Viktor Rozgic, Spyros Matsoukas, Chao Wang:
Compression of Acoustic Event Detection Models with Low-rank Matrix Factorization and Quantization Training. CoRR abs/1905.00855 (2019) - [i4]Bowen Shi, Ming Sun, Chieh-Chi Kao, Viktor Rozgic, Spyros Matsoukas, Chao Wang:
Compression of Acoustic Event Detection Models With Quantized Distillation. CoRR abs/1907.00873 (2019) - [i3]Bowen Shi, Aurora Martinez Del Rio, Jonathan Keane, Diane Brentari, Greg Shakhnarovich, Karen Livescu:
Fingerspelling recognition in the wild with iterative visual attention. CoRR abs/1908.10546 (2019) - 2018
- [i2]Bowen Shi, Aurora Martinez Del Rio, Jonathan Keane, Jonathan Michaux, Diane Brentari, Greg Shakhnarovich, Karen Livescu:
American Sign Language fingerspelling recognition in the wild. CoRR abs/1810.11438 (2018) - 2017
- [i1]Bowen Shi, Karen Livescu:
Multitask training with unlabeled data for end-to-end sign language fingerspelling recognition. CoRR abs/1710.03255 (2017)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-02 22:31 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint