| Line 9: |
Line 9: |
| | --> | | --> |
| | </noinclude> | | </noinclude> |
| | + | * 2025-11-17, [https://radostin.io/files/stoyanov-canopie-hpc-2025.pdf Engine-Agnostic Model Hot-Swapping for Cost-Effective LLM Inference] |
| | + | * 2025-08-13, [https://www.usenix.org/conference/usenixsecurity25/presentation/li-ao Software Availability Protection in Cyber-Physical Systems] |
| | + | * 2025-07-14, [https://arxiv.org/abs/2405.12079 PhoenixOS: Concurrent OS-level GPU Checkpoint and Restore with Validated Speculation] |
| | + | * 2025-06-18, [https://is.muni.cz/th/ekn8q/ Improving Checkpoint/Restore Functionality in Kubernetes] |
| | + | * 2025-06-17, LWN.net: [https://lwn.net/Articles/1024747/ A parallel path for GPU restore in CRIU] |
| | + | * 2025-06-13, [https://radostin.io/files/vspisakova-jsspp25.pdf Kubernetes Scheduling with Checkpoint/Restore: Challenges and Open Problems] |
| | + | <!------------------------------------------------ |
| | + | This is to cut the rest of it for Main Page, |
| | + | adding the More... link instead. |
| | + | Make sure to move this whole block up from time to time. |
| | + | --> |
| | + | <includeonly>: '''[[Articles|More external articles...]]'''</includeonly><noinclude> |
| | + | <!-- |
| | + | the below stuff is now shown on the Main Page |
| | + | --------------------------------------------------> |
| | + | * 2025-06-12, [https://doi.org/10.1109/TNSM.2025.3579051 MOSE: A Novel Orchestration Framework for Stateful Microservice Migration at the Edge] |
| | + | * 2025-05-14, [https://doi.org/10.1145/3672608.3707723 Elastic Vertical Memory Management for Container-based Stateful Applications in Kubernetes] |
| | + | * 2025-05-02, [https://doi.org/10.1007/s42514-025-00227-0 Practice and Observation: Live Migration for MPI Workload] |
| | + | * 2025-04-28, [https://www.usenix.org/conference/nsdi25/presentation/segarra GRANNY: Granular Management of Compute-Intensive Applications in the Cloud] |
| | + | * 2025-04-28, [https://ieeexplore.ieee.org/abstract/document/10979504 KubeSPT: Stateful Pod Teleportation for Service Resilience with Live Migration] |
| | + | * 2025-04-04, [https://doi.org/10.1109/ICIN64016.2025.10942720 Optimizing Stateful Microservice Migration in Kubernetes with MS2M and Forensic Checkpointing] |
| | + | * 2025-03-30, [https://doi.org/10.1145/3676641.3715988 CXLfork: Fast Remote Fork over CXL Fabrics] |
| | + | * 2025-02-23, [https://arxiv.org/abs/2502.16631 CRIUgpu: Transparent Checkpointing of GPU-Accelerated Workloads] |
| | + | * 2025-02-05, [https://doi.org/10.1007/s00607-025-01447-6 A Comprehensive Performance Evaluation of Container Migration Strategies] |
| | + | * 2024-11-21, [https://dl.acm.org/doi/10.1145/3698038.3698510 On-demand and Parallel Checkpoint/Restore for GPU Applications] |
| | + | * 2024-11-20, [https://dl.acm.org/doi/10.1145/3698038.3698513 Snapipeline: Accelerating Snapshot Startup for FaaS Containers] |
| | * 2024-09-06, [https://dl.acm.org/doi/10.1145/3660319.3660330 Live Migration of Multi-Container Kubernetes Pods in Multi-Cluster Serverless Edge Systems] | | * 2024-09-06, [https://dl.acm.org/doi/10.1145/3660319.3660330 Live Migration of Multi-Container Kubernetes Pods in Multi-Cluster Serverless Edge Systems] |
| | * 2024-09-04, [https://dl.acm.org/doi/10.1145/3678015.3680477 Towards Efficient End-to-End Encryption for Container Checkpointing Systems] | | * 2024-09-04, [https://dl.acm.org/doi/10.1145/3678015.3680477 Towards Efficient End-to-End Encryption for Container Checkpointing Systems] |
| | + | * 2024-09-02, [https://doi.org/10.1016/j.future.2024.107495 CSMD: Container state management for deployment in cloud data centers] |
| | + | * 2024-08-21, [https://ieeexplore.ieee.org/abstract/document/10628207 The State of Container Checkpointing with CRIU: A Multi-Case Experience Report] |
| | * 2024-08-04, [https://dl.acm.org/doi/abs/10.1145/3672197.3673432 Custom Page Fault Handling With eBPF] | | * 2024-08-04, [https://dl.acm.org/doi/abs/10.1145/3672197.3673432 Custom Page Fault Handling With eBPF] |
| | * 2024-08-03, [https://dl.acm.org/doi/10.1145/3663408.3663416 Software-based Live Migration for Containerized RDMA] | | * 2024-08-03, [https://dl.acm.org/doi/10.1145/3663408.3663416 Software-based Live Migration for Containerized RDMA] |
| Line 18: |
Line 46: |
| | * 2024-07-23, [https://ieeexplore.ieee.org/abstract/document/10631042 Dapper: A Lightweight and Extensible Framework for Live Program State Rewriting] | | * 2024-07-23, [https://ieeexplore.ieee.org/abstract/document/10631042 Dapper: A Lightweight and Extensible Framework for Live Program State Rewriting] |
| | * 2024-07-07, [https://ieeexplore.ieee.org/abstract/document/10643902 FastMig: Leveraging FastFreeze to Establish Robust Service Liquidity in Cloud 2.0] | | * 2024-07-07, [https://ieeexplore.ieee.org/abstract/document/10643902 FastMig: Leveraging FastFreeze to Establish Robust Service Liquidity in Cloud 2.0] |
| − | <!------------------------------------------------
| + | * 2024-07-02, [https://developer.nvidia.com/blog/checkpointing-cuda-applications-with-criu/ Checkpointing CUDA Applications with CRIU] |
| − | This is to cut the rest of it for Main Page,
| |
| − | adding the More... link instead.
| |
| − | Make sure to move this whole block up from time to time.
| |
| − | -->
| |
| − | <includeonly>: '''[[Articles|More external articles...]]'''</includeonly><noinclude>
| |
| − | <!--
| |
| − | the below stuff is now shown on the Main Page
| |
| − | --------------------------------------------------> | |
| | * 2024-06-19, [https://arxiv.org/abs/2406.13856 Kishu: Time-Traveling for Computational Notebooks] | | * 2024-06-19, [https://arxiv.org/abs/2406.13856 Kishu: Time-Traveling for Computational Notebooks] |
| − | * 2024-05-20, [https://arxiv.org/abs/2405.12079 ParallelGPUOS: A Concurrent OS-level GPU Checkpoint and Restore System using Validated Speculation] | + | * 2024-06-09, [https://dl.acm.org/doi/abs/10.1145/3626246.3654752 Demonstration of ElasticNotebook: Migrating Live Computational Notebook States] |
| | + | * 2024-05-30, [https://is.muni.cz/th/tadf0/phd-thesis-proposal-digital.pdf In the Container Era: A Coup in Reliable Computing Over Unreliable Infrastructure] |
| | + | * 2024-05-20, [https://arxiv.org/abs/2405.12079v1 ParallelGPUOS: A Concurrent OS-level GPU Checkpoint and Restore System using Validated Speculation] |
| | * 2024-05-09, [https://www.sciencedirect.com/science/article/pii/S1383762124000948 Practicable live container migrations in high performance computing clouds: Diskless, iterative, and connection-persistent] | | * 2024-05-09, [https://www.sciencedirect.com/science/article/pii/S1383762124000948 Practicable live container migrations in high performance computing clouds: Diskless, iterative, and connection-persistent] |
| | * 2024-05-06, [https://ieeexplore.ieee.org/abstract/document/10701375 Workload-Aware Live Migratable Cloud Instance Detector] | | * 2024-05-06, [https://ieeexplore.ieee.org/abstract/document/10701375 Workload-Aware Live Migratable Cloud Instance Detector] |
| | * 2024-05-06, [https://ieeexplore.ieee.org/abstract/document/10707218 Migration of Isolated Application Across Heterogeneous Edge Systems] | | * 2024-05-06, [https://ieeexplore.ieee.org/abstract/document/10707218 Migration of Isolated Application Across Heterogeneous Edge Systems] |
| | + | * 2024-04-26, [https://fis.tu-dresden.de/portal/files/53673228/planeta_bearb_pref2b_20240912193924.pdf Fine-grained OS Control over High-performance Networking] |
| | * 2024-04-22, [https://dl.acm.org/doi/abs/10.1145/3627703.3650085 Just-In-Time Checkpointing: Low Cost Error Recovery from Deep Learning Training Failures] | | * 2024-04-22, [https://dl.acm.org/doi/abs/10.1145/3627703.3650085 Just-In-Time Checkpointing: Low Cost Error Recovery from Deep Learning Training Failures] |
| − | * 2024-04-22, [https://www.dpss.inesc-id.pt/~rbruno/papers/skohli-eurosys24.pdf Pronghorn: Effective Checkpoint Orchestration for Serverless Hot-Starts] | + | * 2024-04-22, [https://dl.acm.org/doi/10.1145/3627703.3629556 Pronghorn: Effective Checkpoint Orchestration for Serverless Hot-Starts] |
| | + | * 2024-02-09, [https://ejournal.unitomo.ac.id/index.php/inform/article/view/7498/3738 Forensic Analysis of Podman Container Towards Metasploit Backdoor Using Checkpointctl] |
| | * 2024-01-29, [https://www.sciencedirect.com/science/article/pii/S0167739X24000190 Prebaking runtime environments to improve the FaaS cold start latency] | | * 2024-01-29, [https://www.sciencedirect.com/science/article/pii/S0167739X24000190 Prebaking runtime environments to improve the FaaS cold start latency] |
| | * 2023-11-27, [https://dl.acm.org/doi/abs/10.1145/3590140.3629121 DynaCut: A Framework for Dynamic and Adaptive Program Customization] | | * 2023-11-27, [https://dl.acm.org/doi/abs/10.1145/3590140.3629121 DynaCut: A Framework for Dynamic and Adaptive Program Customization] |
| Line 39: |
Line 63: |
| | * 2023-11-10, [https://ieeexplore.ieee.org/abstract/document/10314806 Design, Modeling, and Implementation of Robust Migration of Stateful Edge Microservices] | | * 2023-11-10, [https://ieeexplore.ieee.org/abstract/document/10314806 Design, Modeling, and Implementation of Robust Migration of Stateful Edge Microservices] |
| | * 2023-10-23, [https://dl.acm.org/doi/10.1145/3605181.3626289 Evicting for the greater good: The Case for Reactive Checkpointing in Serverless Computing] | | * 2023-10-23, [https://dl.acm.org/doi/10.1145/3605181.3626289 Evicting for the greater good: The Case for Reactive Checkpointing in Serverless Computing] |
| | + | * 2023-10-01, [https://dl.acm.org/doi/10.14778/3626292.3626296 ElasticNotebook: Enabling Live Migration for Computational Notebooks] |
| | * 2023-09-25, [https://ieeexplore.ieee.org/abstract/document/10419298 Transparent Fault Tolerance for Stateful Applications in Kubernetes with Checkpoint/Restore] | | * 2023-09-25, [https://ieeexplore.ieee.org/abstract/document/10419298 Transparent Fault Tolerance for Stateful Applications in Kubernetes with Checkpoint/Restore] |
| | * 2023-07-21, [https://vtechworks.lib.vt.edu/items/20cd28e6-1dba-4c21-b221-59f5f345205f CRIU-RTX: Remote Thread eXecution using Checkpoint/Restore in Userspace] | | * 2023-07-21, [https://vtechworks.lib.vt.edu/items/20cd28e6-1dba-4c21-b221-59f5f345205f CRIU-RTX: Remote Thread eXecution using Checkpoint/Restore in Userspace] |
| | + | * 2023-07-10, [https://www.usenix.org/conference/osdi23/presentation/wei-rdma No Provisioned Concurrency: Fast RDMA-codesigned Remote Fork for Serverless Computing] |
| | * 2023-07-06, [https://ieeexplore.ieee.org/abstract/document/10207336 Microservice Debugging with Checkpoint-Restart] | | * 2023-07-06, [https://ieeexplore.ieee.org/abstract/document/10207336 Microservice Debugging with Checkpoint-Restart] |
| | * 2023-05-28, [https://ieeexplore.ieee.org/abstract/document/10278877 Processing-Aware Migration Model for Stateful Edge Microservices] | | * 2023-05-28, [https://ieeexplore.ieee.org/abstract/document/10278877 Processing-Aware Migration Model for Stateful Edge Microservices] |
| | * 2023-04-20, [https://www.mdpi.com/2504-446X/7/5/286 A Dynamic Checkpoint Interval Decision Algorithm for Live Migration-Based Drone-Recovery System] | | * 2023-04-20, [https://www.mdpi.com/2504-446X/7/5/286 A Dynamic Checkpoint Interval Decision Algorithm for Live Migration-Based Drone-Recovery System] |
| | * 2023-03-10, [https://kubernetes.io/blog/2023/03/10/forensic-container-analysis/ Forensic Container Analysis] | | * 2023-03-10, [https://kubernetes.io/blog/2023/03/10/forensic-container-analysis/ Forensic Container Analysis] |
| | + | * 2023-01-31, [https://vtechworks.lib.vt.edu/items/ba974ad9-eac9-4306-b3fc-5f0411b89b99 HetMigrate: Secure and Efficient Cross-architecture Process Live Migration] |
| | * 2023-01-14, [https://arxiv.org/abs/2301.05861 Async-fork: Mitigating Query Latency Spikes Incurred by the Fork-based Snapshot Mechanism from the OS Level] | | * 2023-01-14, [https://arxiv.org/abs/2301.05861 Async-fork: Mitigating Query Latency Spikes Incurred by the Fork-based Snapshot Mechanism from the OS Level] |
| | * 2023-01-10, [https://ieeexplore.ieee.org/abstract/document/10077919 A Container Pre-copy Migration Method Based on Dirty Page Prediction and Compression] | | * 2023-01-10, [https://ieeexplore.ieee.org/abstract/document/10077919 A Container Pre-copy Migration Method Based on Dirty Page Prediction and Compression] |
| − | * 2022-12-05, [https://kubernetes.io/blog/2022/12/05/forensic-container-checkpointing-alpha/ Forensic container checkpointing in Kubernetes]
| |
| | * 2022-11-13, [https://dl.acm.org/doi/abs/10.5555/3571885.3572000 Out of hypervisor (OoH): efficient dirty page tracking in userspace using hardware virtualization feature] | | * 2022-11-13, [https://dl.acm.org/doi/abs/10.5555/3571885.3572000 Out of hypervisor (OoH): efficient dirty page tracking in userspace using hardware virtualization feature] |
| | * 2022-08-07, [https://www.sciencedirect.com/science/article/pii/S1084804522001369 iContainer: Consecutive Checkpointing with Rapid Resilience for Immortal Container-based Services] | | * 2022-08-07, [https://www.sciencedirect.com/science/article/pii/S1084804522001369 iContainer: Consecutive Checkpointing with Rapid Resilience for Immortal Container-based Services] |
| Line 53: |
Line 79: |
| | * 2022-07-11, [https://www.usenix.org/conference/atc22/presentation/zhou-diyu RRC: Responsive Replicated Containers] | | * 2022-07-11, [https://www.usenix.org/conference/atc22/presentation/zhou-diyu RRC: Responsive Replicated Containers] |
| | * 2022-05-25, [https://hal.inria.fr/hal-03587358/ Good Shepherds Care For Their Cattle: Seamless Pod Migration in Geo-Distributed Kubernetes] | | * 2022-05-25, [https://hal.inria.fr/hal-03587358/ Good Shepherds Care For Their Cattle: Seamless Pod Migration in Geo-Distributed Kubernetes] |
| | + | * 2022-05-06, [https://doi.org/10.1145/3477314.3507221 An architecture proposal for checkpoint/restore on stateful containers] |
| | * 2022-04-24, [https://www.ndss-symposium.org/ndss-paper/auto-draft-295/ FitM: Binary-Only Coverage-Guided Fuzzing for Stateful Network Protocols] | | * 2022-04-24, [https://www.ndss-symposium.org/ndss-paper/auto-draft-295/ FitM: Binary-Only Coverage-Guided Fuzzing for Stateful Network Protocols] |
| | + | * 2022-03-01, [https://systex22.github.io/papers/systex22-final71.pdf Transparent, Cross-ISA Enclave Offloading] |
| | * 2022-02-25, [https://dl.acm.org/doi/abs/10.1145/3516807.3516817 Portkey: Hypervisor-Assisted Container Migration in Nested Cloud Environments] | | * 2022-02-25, [https://dl.acm.org/doi/abs/10.1145/3516807.3516817 Portkey: Hypervisor-Assisted Container Migration in Nested Cloud Environments] |
| | * 2022-02-16, [https://arxiv.org/abs/2202.07848 Singularity: Planet-Scale, Preemptible and Elastic Scheduling of AI Workloads] | | * 2022-02-16, [https://arxiv.org/abs/2202.07848 Singularity: Planet-Scale, Preemptible and Elastic Scheduling of AI Workloads] |
| | + | * 2022-02-08, [https://doi.org/10.48550/arXiv.2202.03643 SNPSFuzzer: A Fast Greybox Fuzzer for Stateful Network Protocols using Snapshots] |
| | * 2021-12-17, [https://hal.archives-ouvertes.fr/hal-03487607/document Standard-compliant parallel SystemC simulation of loosely-timed transaction level models: From baremetal to Linux-based applications support] | | * 2021-12-17, [https://hal.archives-ouvertes.fr/hal-03487607/document Standard-compliant parallel SystemC simulation of loosely-timed transaction level models: From baremetal to Linux-based applications support] |
| − | * 2021-08-13, [https://doi.org/10.11591/eei.v10i2.2742 Live migration using checkpoint and restore in userspace (CRIU): Usage analysis of network, memory and CPU]
| |
| | * 2021-07-14, [https://www.usenix.org/conference/atc21/presentation/planeta MigrOS: Transparent Live-Migration Support for Containerised RDMA Applications] | | * 2021-07-14, [https://www.usenix.org/conference/atc21/presentation/planeta MigrOS: Transparent Live-Migration Support for Containerised RDMA Applications] |
| | * 2021-07-06, [https://onlinelibrary.wiley.com/doi/10.1002/cpe.6474 Cricket: A virtualization layer for distributed execution of CUDA applications with checkpoint/restart support] | | * 2021-07-06, [https://onlinelibrary.wiley.com/doi/10.1002/cpe.6474 Cricket: A virtualization layer for distributed execution of CUDA applications with checkpoint/restart support] |
| Line 90: |
Line 118: |
| | * 2018-10-13, [https://dl.acm.org/citation.cfm?id=3290626 Linux Process Tree Reconstruction Using The Attributed Grammar-Based Tree Transformation Model] | | * 2018-10-13, [https://dl.acm.org/citation.cfm?id=3290626 Linux Process Tree Reconstruction Using The Attributed Grammar-Based Tree Transformation Model] |
| | * 2018-10-10, [https://podman.io/blogs/2018/10/10/checkpoint-restore.html Adding checkpoint/restore support to Podman] | | * 2018-10-10, [https://podman.io/blogs/2018/10/10/checkpoint-restore.html Adding checkpoint/restore support to Podman] |
| | + | * 2018-10-08, [https://www.usenix.org/conference/osdi18/presentation/xiao Gandiva: Introspective Cluster Scheduling for Deep Learning] |
| | * 2018-09-15, [https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8539562 Stateful Container Migration employing Checkpoint-based Restoration for Orchestrated Container Clusters] | | * 2018-09-15, [https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8539562 Stateful Container Migration employing Checkpoint-based Restoration for Orchestrated Container Clusters] |
| | * 2018-09-07, [https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8502659 Container Live Migration for Latency Critical Industrial Applications on Edge Computing] | | * 2018-09-07, [https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8502659 Container Live Migration for Latency Critical Industrial Applications on Edge Computing] |
| | * 2018-08-15, University of Maryland: [https://drum.lib.umd.edu/bitstream/handle/1903/20499/CS-TR-5056.pdf Fast and Service-preserving Recovery from Malware Infections Using CRIU] | | * 2018-08-15, University of Maryland: [https://drum.lib.umd.edu/bitstream/handle/1903/20499/CS-TR-5056.pdf Fast and Service-preserving Recovery from Malware Infections Using CRIU] |
| | * 2018-07-31, [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6131214/ Hot-starting software containers for STAR aligner] | | * 2018-07-31, [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6131214/ Hot-starting software containers for STAR aligner] |
| − | * 2018-07-07, Moscow Institute of Physics and Technology: [https://pdfs.semanticscholar.org/9ac4/f8ab4fd0492bfdc503831f60a5ce3d1d50a5.pdf?_ga=2.17262585.1140385641.1554239661-2109847679.1554239661 Using CRIU with HPC Containers: Field Experience]
| |
| | * 2018-06-28, University of Aberdeen: [https://link.springer.com/chapter/10.1007/978-3-030-02465-9_13 Efficient Live Migration of Linux Containers] | | * 2018-06-28, University of Aberdeen: [https://link.springer.com/chapter/10.1007/978-3-030-02465-9_13 Efficient Live Migration of Linux Containers] |
| | * 2018-03-24, [https://www.smitechow.com/2018/03/compile-criu-on-centos-6.html Compile CRIU on CentOS 6] | | * 2018-03-24, [https://www.smitechow.com/2018/03/compile-criu-on-centos-6.html Compile CRIU on CentOS 6] |
| Line 120: |
Line 148: |
| | * 2015-04-22, TuxDiary [http://tuxdiary.com/2015/04/22/dump-debug-resume-process-criu/ Dump, debug, resume process with criu] | | * 2015-04-22, TuxDiary [http://tuxdiary.com/2015/04/22/dump-debug-resume-process-criu/ Dump, debug, resume process with criu] |
| | * 2014-12-12, Symposium on Information and Communication Systems (SInCom 2014) [https://lisas.de/~adrian/proceedingsSInCom2014.pdf Checkpoint/Restore in User-Space with Open MPI] | | * 2014-12-12, Symposium on Information and Communication Systems (SInCom 2014) [https://lisas.de/~adrian/proceedingsSInCom2014.pdf Checkpoint/Restore in User-Space with Open MPI] |
| | + | * 2014-11-03, [https://dl.acm.org/doi/10.1145/2660267.2660329 From Patches to Honey-Patches: Lightweight Attacker Misdirection, Deception, and Disinformation] |
| | * 2014-09-31, [http://www.reuters.com/article/wa-parallels-idUSnBw035202a+100+BSW20141103 Parallels Surpasses One Million Deployed Virtual Containers] | | * 2014-09-31, [http://www.reuters.com/article/wa-parallels-idUSnBw035202a+100+BSW20141103 Parallels Surpasses One Million Deployed Virtual Containers] |
| | * 2014-08-01, ADMIN magazine: [http://www.admin-magazine.com/Archive/2014/22/Save-and-Restore-Linux-Processes-with-CRIU Save and Restore Linux Processes with CRIU] | | * 2014-08-01, ADMIN magazine: [http://www.admin-magazine.com/Archive/2014/22/Save-and-Restore-Linux-Processes-with-CRIU Save and Restore Linux Processes with CRIU] |