Changes

Jump to navigation Jump to search
148 bytes added ,  7 October
m
ParallelGPUOS: A Concurrent OS-level GPU Checkpoint and Restore System using Validated Speculation
Line 14: Line 14:  
* 2024-07-25, [https://doi.org/10.1186/s13677-024-00687-9 MDB-KCP: persistence framework of in-memory database with CRIU-based container checkpoint in Kubernetes]
 
* 2024-07-25, [https://doi.org/10.1186/s13677-024-00687-9 MDB-KCP: persistence framework of in-memory database with CRIU-based container checkpoint in Kubernetes]
 
* 2024-06-03, [https://dl.acm.org/doi/10.1145/3660319.3660330 Live Migration of Multi-Container Kubernetes Pods in Multi-Cluster Serverless Edge Systems]
 
* 2024-06-03, [https://dl.acm.org/doi/10.1145/3660319.3660330 Live Migration of Multi-Container Kubernetes Pods in Multi-Cluster Serverless Edge Systems]
 +
* 2024-05-20, [https://arxiv.org/abs/2405.12079 ParallelGPUOS: A Concurrent OS-level GPU Checkpoint and Restore System using Validated Speculation]
 
* 2024-04-22, [https://dl.acm.org/doi/abs/10.1145/3627703.3650085 Just-In-Time Checkpointing: Low Cost Error Recovery from Deep Learning Training Failures]
 
* 2024-04-22, [https://dl.acm.org/doi/abs/10.1145/3627703.3650085 Just-In-Time Checkpointing: Low Cost Error Recovery from Deep Learning Training Failures]
 
* 2024-04-22, [https://www.dpss.inesc-id.pt/~rbruno/papers/skohli-eurosys24.pdf Pronghorn: Effective Checkpoint Orchestration for Serverless Hot-Starts]
 
* 2024-04-22, [https://www.dpss.inesc-id.pt/~rbruno/papers/skohli-eurosys24.pdf Pronghorn: Effective Checkpoint Orchestration for Serverless Hot-Starts]
 
* 2024-01-29, [https://www.sciencedirect.com/science/article/pii/S0167739X24000190 Prebaking runtime environments to improve the FaaS cold start latency]
 
* 2024-01-29, [https://www.sciencedirect.com/science/article/pii/S0167739X24000190 Prebaking runtime environments to improve the FaaS cold start latency]
* 2023-11-12, [https://dl.acm.org/doi/10.1145/3624062.3624254 Checkpoint/Restart for CUDA Kernels]
   
<!------------------------------------------------
 
<!------------------------------------------------
 
   This is to cut the rest of it for Main Page,
 
   This is to cut the rest of it for Main Page,
Line 27: Line 27:  
     the below stuff is now shown on the Main Page
 
     the below stuff is now shown on the Main Page
 
-------------------------------------------------->
 
-------------------------------------------------->
 +
* 2023-11-12, [https://dl.acm.org/doi/10.1145/3624062.3624254 Checkpoint/Restart for CUDA Kernels]
 
* 2023-10-23, [https://dl.acm.org/doi/10.1145/3605181.3626289 Evicting for the greater good: The Case for Reactive Checkpointing in Serverless Computing]
 
* 2023-10-23, [https://dl.acm.org/doi/10.1145/3605181.3626289 Evicting for the greater good: The Case for Reactive Checkpointing in Serverless Computing]
 
* 2023-04-20, [https://www.mdpi.com/2504-446X/7/5/286 A Dynamic Checkpoint Interval Decision Algorithm for Live Migration-Based Drone-Recovery System]
 
* 2023-04-20, [https://www.mdpi.com/2504-446X/7/5/286 A Dynamic Checkpoint Interval Decision Algorithm for Live Migration-Based Drone-Recovery System]
332

edits

Navigation menu