default search action
"Paella: Low-latency Model Serving with Software-defined GPU Scheduling."
Kelvin K. W. Ng, Henri Maxime Demoulin, Vincent Liu (2023)
- Kelvin K. W. Ng, Henri Maxime Demoulin, Vincent Liu:
Paella: Low-latency Model Serving with Software-defined GPU Scheduling. SOSP 2023: 595-610
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.