Changes

2,876 bytes added ,  13 November
m
no edit summary
Line 9: Line 9:  
-->
 
-->
 
</noinclude>
 
</noinclude>
* 2025-02-23, [https://arxiv.org/abs/2502.16631 CRIUgpu: Transparent Checkpointing of GPU-Accelerated Workloads]
+
* 2025-11-17, [https://radostin.io/files/stoyanov-canopie-hpc-2025.pdf Engine-Agnostic Model Hot-Swapping for Cost-Effective LLM Inference]
* 2024-11-14, [https://dl.acm.org/doi/10.1145/3698038.3698510 On-demand and Parallel Checkpoint/Restore for GPU Applications]
+
* 2025-08-13, [https://www.usenix.org/conference/usenixsecurity25/presentation/li-ao Software Availability Protection in Cyber-Physical Systems]
* 2024-09-06, [https://dl.acm.org/doi/10.1145/3660319.3660330 Live Migration of Multi-Container Kubernetes Pods in Multi-Cluster Serverless Edge Systems]
+
* 2025-07-14, [https://arxiv.org/abs/2405.12079 PhoenixOS: Concurrent OS-level GPU Checkpoint and Restore with Validated Speculation]
* 2024-09-04, [https://dl.acm.org/doi/10.1145/3678015.3680477 Towards Efficient End-to-End Encryption for Container Checkpointing Systems]
+
* 2025-06-18, [https://is.muni.cz/th/ekn8q/ Improving Checkpoint/Restore Functionality in Kubernetes]
* 2024-08-21, [https://ieeexplore.ieee.org/abstract/document/10628207 The State of Container Checkpointing with CRIU: A Multi-Case Experience Report]
+
* 2025-06-17, LWN.net: [https://lwn.net/Articles/1024747/ A parallel path for GPU restore in CRIU]
* 2024-08-04, [https://dl.acm.org/doi/abs/10.1145/3672197.3673432 Custom Page Fault Handling With eBPF]
+
* 2025-06-13, [https://radostin.io/files/vspisakova-jsspp25.pdf Kubernetes Scheduling with Checkpoint/Restore: Challenges and Open Problems]
* 2024-08-03, [https://dl.acm.org/doi/10.1145/3663408.3663416 Software-based Live Migration for Containerized RDMA]
  −
* 2024-07-30, [https://ieeexplore.ieee.org/abstract/document/10606135 Packet Buffering to Minimize Service Downtime and Packet Loss During Redundancy Switchover]
  −
* 2024-07-30, [https://dl.acm.org/doi/abs/10.1145/3664476.3670895 Don't, Stop, Drop, Pause: Forensics of CONtainer CheckPOINTs (ConPoint)]
   
<!------------------------------------------------
 
<!------------------------------------------------
 
   This is to cut the rest of it for Main Page,
 
   This is to cut the rest of it for Main Page,
Line 27: Line 24:  
     the below stuff is now shown on the Main Page
 
     the below stuff is now shown on the Main Page
 
-------------------------------------------------->
 
-------------------------------------------------->
 +
* 2025-06-12, [https://doi.org/10.1109/TNSM.2025.3579051 MOSE: A Novel Orchestration Framework for Stateful Microservice Migration at the Edge]
 +
* 2025-05-14, [https://doi.org/10.1145/3672608.3707723 Elastic Vertical Memory Management for Container-based Stateful Applications in Kubernetes]
 +
* 2025-05-02, [https://doi.org/10.1007/s42514-025-00227-0 Practice and Observation: Live Migration for MPI Workload]
 +
* 2025-04-28, [https://www.usenix.org/conference/nsdi25/presentation/segarra GRANNY: Granular Management of Compute-Intensive Applications in the Cloud]
 +
* 2025-04-28, [https://ieeexplore.ieee.org/abstract/document/10979504 KubeSPT: Stateful Pod Teleportation for Service Resilience with Live Migration]
 +
* 2025-04-04, [https://doi.org/10.1109/ICIN64016.2025.10942720 Optimizing Stateful Microservice Migration in Kubernetes with MS2M and Forensic Checkpointing]
 +
* 2025-03-30, [https://doi.org/10.1145/3676641.3715988 CXLfork: Fast Remote Fork over CXL Fabrics]
 +
* 2025-02-23, [https://arxiv.org/abs/2502.16631 CRIUgpu: Transparent Checkpointing of GPU-Accelerated Workloads]
 +
* 2025-02-05, [https://doi.org/10.1007/s00607-025-01447-6 A Comprehensive Performance Evaluation of Container Migration Strategies]
 +
* 2024-11-21, [https://dl.acm.org/doi/10.1145/3698038.3698510 On-demand and Parallel Checkpoint/Restore for GPU Applications]
 +
* 2024-11-20, [https://dl.acm.org/doi/10.1145/3698038.3698513 Snapipeline: Accelerating Snapshot Startup for FaaS Containers]
 +
* 2024-09-06, [https://dl.acm.org/doi/10.1145/3660319.3660330 Live Migration of Multi-Container Kubernetes Pods in Multi-Cluster Serverless Edge Systems]
 +
* 2024-09-04, [https://dl.acm.org/doi/10.1145/3678015.3680477 Towards Efficient End-to-End Encryption for Container Checkpointing Systems]
 +
* 2024-09-02, [https://doi.org/10.1016/j.future.2024.107495 CSMD: Container state management for deployment in cloud data centers]
 +
* 2024-08-21, [https://ieeexplore.ieee.org/abstract/document/10628207 The State of Container Checkpointing with CRIU: A Multi-Case Experience Report]
 +
* 2024-08-04, [https://dl.acm.org/doi/abs/10.1145/3672197.3673432 Custom Page Fault Handling With eBPF]
 +
* 2024-08-03, [https://dl.acm.org/doi/10.1145/3663408.3663416 Software-based Live Migration for Containerized RDMA]
 +
* 2024-07-30, [https://ieeexplore.ieee.org/abstract/document/10606135 Packet Buffering to Minimize Service Downtime and Packet Loss During Redundancy Switchover]
 +
* 2024-07-30, [https://dl.acm.org/doi/abs/10.1145/3664476.3670895 Don't, Stop, Drop, Pause: Forensics of CONtainer CheckPOINTs (ConPoint)]
 
* 2024-07-25, [https://doi.org/10.1186/s13677-024-00687-9 MDB-KCP: persistence framework of in-memory database with CRIU-based container checkpoint in Kubernetes]
 
* 2024-07-25, [https://doi.org/10.1186/s13677-024-00687-9 MDB-KCP: persistence framework of in-memory database with CRIU-based container checkpoint in Kubernetes]
 
* 2024-07-23, [https://ieeexplore.ieee.org/abstract/document/10631042 Dapper: A Lightweight and Extensible Framework for Live Program State Rewriting]
 
* 2024-07-23, [https://ieeexplore.ieee.org/abstract/document/10631042 Dapper: A Lightweight and Extensible Framework for Live Program State Rewriting]
Line 33: Line 49:  
* 2024-06-19, [https://arxiv.org/abs/2406.13856 Kishu: Time-Traveling for Computational Notebooks]
 
* 2024-06-19, [https://arxiv.org/abs/2406.13856 Kishu: Time-Traveling for Computational Notebooks]
 
* 2024-06-09, [https://dl.acm.org/doi/abs/10.1145/3626246.3654752 Demonstration of ElasticNotebook: Migrating Live Computational Notebook States]
 
* 2024-06-09, [https://dl.acm.org/doi/abs/10.1145/3626246.3654752 Demonstration of ElasticNotebook: Migrating Live Computational Notebook States]
* 2024-05-20, [https://arxiv.org/abs/2405.12079 ParallelGPUOS: A Concurrent OS-level GPU Checkpoint and Restore System using Validated Speculation]
+
* 2024-05-30, [https://is.muni.cz/th/tadf0/phd-thesis-proposal-digital.pdf In the Container Era: A Coup in Reliable Computing Over Unreliable Infrastructure]
 +
* 2024-05-20, [https://arxiv.org/abs/2405.12079v1 ParallelGPUOS: A Concurrent OS-level GPU Checkpoint and Restore System using Validated Speculation]
 
* 2024-05-09, [https://www.sciencedirect.com/science/article/pii/S1383762124000948 Practicable live container migrations in high performance computing clouds: Diskless, iterative, and connection-persistent]
 
* 2024-05-09, [https://www.sciencedirect.com/science/article/pii/S1383762124000948 Practicable live container migrations in high performance computing clouds: Diskless, iterative, and connection-persistent]
 
* 2024-05-06, [https://ieeexplore.ieee.org/abstract/document/10701375 Workload-Aware Live Migratable Cloud Instance Detector]
 
* 2024-05-06, [https://ieeexplore.ieee.org/abstract/document/10701375 Workload-Aware Live Migratable Cloud Instance Detector]
Line 40: Line 57:  
* 2024-04-22, [https://dl.acm.org/doi/abs/10.1145/3627703.3650085 Just-In-Time Checkpointing: Low Cost Error Recovery from Deep Learning Training Failures]
 
* 2024-04-22, [https://dl.acm.org/doi/abs/10.1145/3627703.3650085 Just-In-Time Checkpointing: Low Cost Error Recovery from Deep Learning Training Failures]
 
* 2024-04-22, [https://dl.acm.org/doi/10.1145/3627703.3629556 Pronghorn: Effective Checkpoint Orchestration for Serverless Hot-Starts]
 
* 2024-04-22, [https://dl.acm.org/doi/10.1145/3627703.3629556 Pronghorn: Effective Checkpoint Orchestration for Serverless Hot-Starts]
 +
* 2024-02-09, [https://ejournal.unitomo.ac.id/index.php/inform/article/view/7498/3738 Forensic Analysis of Podman Container Towards Metasploit Backdoor Using Checkpointctl]
 
* 2024-01-29, [https://www.sciencedirect.com/science/article/pii/S0167739X24000190 Prebaking runtime environments to improve the FaaS cold start latency]
 
* 2024-01-29, [https://www.sciencedirect.com/science/article/pii/S0167739X24000190 Prebaking runtime environments to improve the FaaS cold start latency]
 
* 2023-11-27, [https://dl.acm.org/doi/abs/10.1145/3590140.3629121 DynaCut: A Framework for Dynamic and Adaptive Program Customization]
 
* 2023-11-27, [https://dl.acm.org/doi/abs/10.1145/3590140.3629121 DynaCut: A Framework for Dynamic and Adaptive Program Customization]
Line 61: Line 79:  
* 2022-07-11, [https://www.usenix.org/conference/atc22/presentation/zhou-diyu RRC: Responsive Replicated Containers]
 
* 2022-07-11, [https://www.usenix.org/conference/atc22/presentation/zhou-diyu RRC: Responsive Replicated Containers]
 
* 2022-05-25, [https://hal.inria.fr/hal-03587358/ Good Shepherds Care For Their Cattle: Seamless Pod Migration in Geo-Distributed Kubernetes]
 
* 2022-05-25, [https://hal.inria.fr/hal-03587358/ Good Shepherds Care For Their Cattle: Seamless Pod Migration in Geo-Distributed Kubernetes]
 +
* 2022-05-06, [https://doi.org/10.1145/3477314.3507221 An architecture proposal for checkpoint/restore on stateful containers]
 
* 2022-04-24, [https://www.ndss-symposium.org/ndss-paper/auto-draft-295/ FitM: Binary-Only Coverage-Guided Fuzzing for Stateful Network Protocols]
 
* 2022-04-24, [https://www.ndss-symposium.org/ndss-paper/auto-draft-295/ FitM: Binary-Only Coverage-Guided Fuzzing for Stateful Network Protocols]
 
* 2022-03-01, [https://systex22.github.io/papers/systex22-final71.pdf Transparent, Cross-ISA Enclave Offloading]
 
* 2022-03-01, [https://systex22.github.io/papers/systex22-final71.pdf Transparent, Cross-ISA Enclave Offloading]
 
* 2022-02-25, [https://dl.acm.org/doi/abs/10.1145/3516807.3516817 Portkey: Hypervisor-Assisted Container Migration in Nested Cloud Environments]
 
* 2022-02-25, [https://dl.acm.org/doi/abs/10.1145/3516807.3516817 Portkey: Hypervisor-Assisted Container Migration in Nested Cloud Environments]
 
* 2022-02-16, [https://arxiv.org/abs/2202.07848 Singularity: Planet-Scale, Preemptible and Elastic Scheduling of AI Workloads]
 
* 2022-02-16, [https://arxiv.org/abs/2202.07848 Singularity: Planet-Scale, Preemptible and Elastic Scheduling of AI Workloads]
 +
* 2022-02-08, [https://doi.org/10.48550/arXiv.2202.03643 SNPSFuzzer: A Fast Greybox Fuzzer for Stateful Network Protocols using Snapshots]
 
* 2021-12-17, [https://hal.archives-ouvertes.fr/hal-03487607/document Standard-compliant parallel SystemC simulation of loosely-timed transaction level models: From baremetal to Linux-based applications support]
 
* 2021-12-17, [https://hal.archives-ouvertes.fr/hal-03487607/document Standard-compliant parallel SystemC simulation of loosely-timed transaction level models: From baremetal to Linux-based applications support]
 
* 2021-07-14, [https://www.usenix.org/conference/atc21/presentation/planeta MigrOS: Transparent Live-Migration Support for Containerised RDMA Applications]
 
* 2021-07-14, [https://www.usenix.org/conference/atc21/presentation/planeta MigrOS: Transparent Live-Migration Support for Containerised RDMA Applications]
Line 128: Line 148:  
* 2015-04-22, TuxDiary [http://tuxdiary.com/2015/04/22/dump-debug-resume-process-criu/ Dump, debug, resume process with criu]
 
* 2015-04-22, TuxDiary [http://tuxdiary.com/2015/04/22/dump-debug-resume-process-criu/ Dump, debug, resume process with criu]
 
* 2014-12-12, Symposium on Information and Communication Systems (SInCom 2014) [https://lisas.de/~adrian/proceedingsSInCom2014.pdf Checkpoint/Restore in User-Space with Open MPI]
 
* 2014-12-12, Symposium on Information and Communication Systems (SInCom 2014) [https://lisas.de/~adrian/proceedingsSInCom2014.pdf Checkpoint/Restore in User-Space with Open MPI]
 +
* 2014-11-03, [https://dl.acm.org/doi/10.1145/2660267.2660329 From Patches to Honey-Patches: Lightweight Attacker Misdirection, Deception, and Disinformation]
 
* 2014-09-31, [http://www.reuters.com/article/wa-parallels-idUSnBw035202a+100+BSW20141103 Parallels Surpasses One Million Deployed Virtual Containers]
 
* 2014-09-31, [http://www.reuters.com/article/wa-parallels-idUSnBw035202a+100+BSW20141103 Parallels Surpasses One Million Deployed Virtual Containers]
 
* 2014-08-01, ADMIN magazine: [http://www.admin-magazine.com/Archive/2014/22/Save-and-Restore-Linux-Processes-with-CRIU Save and Restore Linux Processes with CRIU]
 
* 2014-08-01, ADMIN magazine: [http://www.admin-magazine.com/Archive/2014/22/Save-and-Restore-Linux-Processes-with-CRIU Save and Restore Linux Processes with CRIU]
509

edits