Changes

Jump to navigation Jump to search
m
Just-In-Time Checkpointing: Low Cost Error Recovery from Deep Learning Training Failures
Line 9: Line 9:  
-->
 
-->
 
</noinclude>
 
</noinclude>
 +
* 2024-04-22, [https://dl.acm.org/doi/abs/10.1145/3627703.3650085 Just-In-Time Checkpointing: Low Cost Error Recovery from Deep Learning Training Failures]
 
* 2024-04-22, [https://www.dpss.inesc-id.pt/~rbruno/papers/skohli-eurosys24.pdf Pronghorn: Effective Checkpoint Orchestration for Serverless Hot-Starts]
 
* 2024-04-22, [https://www.dpss.inesc-id.pt/~rbruno/papers/skohli-eurosys24.pdf Pronghorn: Effective Checkpoint Orchestration for Serverless Hot-Starts]
 
* 2023-11-12, [https://dl.acm.org/doi/10.1145/3624062.3624254 Checkpoint/Restart for CUDA Kernels]
 
* 2023-11-12, [https://dl.acm.org/doi/10.1145/3624062.3624254 Checkpoint/Restart for CUDA Kernels]
Line 17: Line 18:  
* 2022-12-05, [https://kubernetes.io/blog/2022/12/05/forensic-container-checkpointing-alpha/ Forensic container checkpointing in Kubernetes]
 
* 2022-12-05, [https://kubernetes.io/blog/2022/12/05/forensic-container-checkpointing-alpha/ Forensic container checkpointing in Kubernetes]
 
* 2022-11-13, [https://dl.acm.org/doi/abs/10.5555/3571885.3572000 Out of hypervisor (OoH): efficient dirty page tracking in userspace using hardware virtualization feature]
 
* 2022-11-13, [https://dl.acm.org/doi/abs/10.5555/3571885.3572000 Out of hypervisor (OoH): efficient dirty page tracking in userspace using hardware virtualization feature]
* 2022-08-07, [https://www.sciencedirect.com/science/article/pii/S1084804522001369 iContainer: Consecutive Checkpointing with Rapid Resilience for Immortal Container-based Services]
   
<!------------------------------------------------
 
<!------------------------------------------------
 
   This is to cut the rest of it for Main Page,
 
   This is to cut the rest of it for Main Page,
Line 27: Line 27:  
     the below stuff is now shown on the Main Page
 
     the below stuff is now shown on the Main Page
 
-------------------------------------------------->
 
-------------------------------------------------->
 +
* 2022-08-07, [https://www.sciencedirect.com/science/article/pii/S1084804522001369 iContainer: Consecutive Checkpointing with Rapid Resilience for Immortal Container-based Services]
 
* 2022-08-03, [https://ieeexplore.ieee.org/document/9844071 Demonstration of Containerized Central Unit Live Migration in 5G Radio Access Network]
 
* 2022-08-03, [https://ieeexplore.ieee.org/document/9844071 Demonstration of Containerized Central Unit Live Migration in 5G Radio Access Network]
 
* 2022-07-11, [https://www.usenix.org/conference/atc22/presentation/zhou-diyu RRC: Responsive Replicated Containers]
 
* 2022-07-11, [https://www.usenix.org/conference/atc22/presentation/zhou-diyu RRC: Responsive Replicated Containers]
332

edits

Navigation menu