default search action
Victor Rühle
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Conference and Workshop Papers
- 2024
- [c8]Zhuoshi Pan, Qianhui Wu, Huiqiang Jiang, Menglin Xia, Xufang Luo, Jue Zhang, Qingwei Lin, Victor Rühle, Yuqing Yang, Chin-Yew Lin, H. Vicky Zhao, Lili Qiu, Dongmei Zhang:
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression. ACL (Findings) 2024: 963-981 - [c7]Menglin Xia, Xuchao Zhang, Camille Couturier, Guoqing Zheng, Saravan Rajmohan, Victor Rühle:
Hybrid-RACA: Hybrid Retrieval-Augmented Composition Assistance for Real-time Text Prediction. EMNLP (Industry Track) 2024: 120-131 - [c6]Dujian Ding, Ankur Mallick, Chi Wang, Robert Sim, Subhabrata Mukherjee, Victor Rühle, Laks V. S. Lakshmanan, Ahmed Hassan Awadallah:
Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing. ICLR 2024 - 2023
- [c5]Fangkai Yang, Lu Wang, Zhenyu Xu, Jue Zhang, Liqun Li, Bo Qiao, Camille Couturier, Chetan Bansal, Soumya Ram, Si Qin, Zhen Ma, Íñigo Goiri, Eli Cortez, Terry Yang, Victor Rühle, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
Snape: Reliable and Low-Cost Computing with Mixture of Spot and On-Demand VMs. ASPLOS (3) 2023: 631-643 - [c4]Santiago Zanella Béguelin, Lukas Wutschitz, Shruti Tople, Ahmed Salem, Victor Rühle, Andrew Paverd, Mohammad Naseri, Boris Köpf, Daniel Jones:
Bayesian Estimation of Differential Privacy. ICML 2023: 40624-40636 - 2022
- [c3]Fangkai Yang, Bowen Pang, Jue Zhang, Bo Qiao, Lu Wang, Camille Couturier, Chetan Bansal, Soumya Ram, Si Qin, Zhen Ma, Iñigo Goiri, Eli Cortez, Senthil Baladhandayutham, Victor Rühle, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
Spot Virtual Machine Eviction Prediction in Microsoft Cloud. WWW (Companion Volume) 2022: 152-156 - 2021
- [c2]Fatemehsadat Mireshghallah, Huseyin A. Inan, Marcello Hasegawa, Victor Rühle, Taylor Berg-Kirkpatrick, Robert Sim:
Privacy Regularization: Joint Privacy-Utility Optimization in LanguageModels. NAACL-HLT 2021: 3799-3807 - 2020
- [c1]Santiago Zanella Béguelin, Lukas Wutschitz, Shruti Tople, Victor Rühle, Andrew Paverd, Olga Ohrimenko, Boris Köpf, Marc Brockschmidt:
Analyzing Information Leakage of Updates to Natural Language Models. CCS 2020: 363-375
Informal and Other Publications
- 2024
- [i14]Lu Wang, Mayukh Das, Fangkai Yang, Junjie Sheng, Bo Qiao, Hang Dong, Si Qin, Victor Rühle, Chetan Bansal, Eli Cortez, Íñigo Goiri, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
Risk-aware Adaptive Virtual CPU Oversubscription in Microsoft Cloud via Prototypical Human-in-the-loop Imitation Learning. CoRR abs/2401.07033 (2024) - [i13]Zhuoshi Pan, Qianhui Wu, Huiqiang Jiang, Menglin Xia, Xufang Luo, Jue Zhang, Qingwei Lin, Victor Rühle, Yuqing Yang, Chin-Yew Lin, H. Vicky Zhao, Lili Qiu, Dongmei Zhang:
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression. CoRR abs/2403.12968 (2024) - [i12]Dujian Ding, Ankur Mallick, Chi Wang, Robert Sim, Subhabrata Mukherjee, Victor Rühle, Laks V. S. Lakshmanan, Ahmed Hassan Awadallah:
Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing. CoRR abs/2404.14618 (2024) - [i11]Rya Sanovar, Srikant Bharadwaj, Renée St. Amant, Victor Rühle, Saravan Rajmohan:
Lean Attention: Hardware-Aware Scalable Attention Mechanism for the Decode-Phase of Transformers. CoRR abs/2405.10480 (2024) - [i10]Kunal Jain, Anjaly Parayil, Ankur Mallick, Esha Choukse, Xiaoting Qin, Jue Zhang, Íñigo Goiri, Rujia Wang, Chetan Bansal, Victor Rühle, Anoop Kulkarni, Steve Kofsky, Saravan Rajmohan:
Intelligent Router for LLM Workloads: Improving Performance Through Workload-Aware Scheduling. CoRR abs/2408.13510 (2024) - [i9]Shivam Shandilya, Menglin Xia, Supriyo Ghosh, Huiqiang Jiang, Jue Zhang, Qianhui Wu, Victor Rühle:
TACO-RL: Task Aware Prompt Compression Optimization with Reinforcement Learning. CoRR abs/2409.13035 (2024) - [i8]Shaokun Zhang, Jieyu Zhang, Dujian Ding, Mirian Hipolito Garcia, Ankur Mallick, Daniel Madrigal, Menglin Xia, Victor Rühle, Qingyun Wu, Chi Wang:
EcoAct: Economic Agent Determines When to Register What Action. CoRR abs/2411.01643 (2024) - [i7]Redwan Ibne Seraj Khan, Kunal Jain, Haiying Shen, Ankur Mallick, Anjaly Parayil, Anoop Kulkarni, Steve Kofsky, Pankhuri Choudhary, Renée St. Amant, Rujia Wang, Yue Cheng, Ali Raza Butt, Victor Rühle, Chetan Bansal, Saravan Rajmohan:
Ensuring Fair LLM Serving Amid Diverse Applications. CoRR abs/2411.15997 (2024) - 2023
- [i6]Xuchao Zhang, Menglin Xia, Camille Couturier, Guoqing Zheng, Saravan Rajmohan, Victor Rühle:
Hybrid Retrieval-Augmented Generation for Real-time Composition Assistance. CoRR abs/2308.04215 (2023) - [i5]Lukas Wutschitz, Boris Köpf, Andrew Paverd, Saravan Rajmohan, Ahmed Salem, Shruti Tople, Santiago Zanella Béguelin, Menglin Xia, Victor Rühle:
Rethinking Privacy in Machine Learning Pipelines from an Information Flow Control Perspective. CoRR abs/2311.15792 (2023) - [i4]Mohammad Mahdi Derakhshani, Menglin Xia, Harkirat Behl, Cees G. M. Snoek, Victor Rühle:
Unlocking Spatial Comprehension in Text-to-Image Diffusion Models. CoRR abs/2311.17937 (2023) - 2022
- [i3]Santiago Zanella Béguelin, Lukas Wutschitz, Shruti Tople, Ahmed Salem, Victor Rühle, Andrew Paverd, Mohammad Naseri, Boris Köpf, Daniel Jones:
Bayesian Estimation of Differential Privacy. CoRR abs/2206.05199 (2022) - 2021
- [i2]Huseyin A. Inan, Osman Ramadan, Lukas Wutschitz, Daniel Jones, Victor Rühle, James Withers, Robert Sim:
Privacy Analysis in Language Models via Training Data Leakage Report. CoRR abs/2101.05405 (2021) - [i1]Fatemehsadat Mireshghallah, Huseyin A. Inan, Marcello Hasegawa, Victor Rühle, Taylor Berg-Kirkpatrick, Robert Sim:
Privacy Regularization: Joint Privacy-Utility Optimization in Language Models. CoRR abs/2103.07567 (2021)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-09 12:56 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint