Difference between revisions of "Articles"

From CRIU
Jump to navigation Jump to search
m (DynaCut: A Framework for Dynamic and Adaptive Program Customization)
m
 
(38 intermediate revisions by 2 users not shown)
Line 9: Line 9:
 
-->
 
-->
 
</noinclude>
 
</noinclude>
 +
* 2025-11-17, [https://radostin.io/files/stoyanov-canopie-hpc-2025.pdf Engine-Agnostic Model Hot-Swapping for Cost-Effective LLM Inference]
 +
* 2025-08-13, [https://www.usenix.org/conference/usenixsecurity25/presentation/li-ao Software Availability Protection in Cyber-Physical Systems]
 +
* 2025-07-14, [https://arxiv.org/abs/2405.12079 PhoenixOS: Concurrent OS-level GPU Checkpoint and Restore with Validated Speculation]
 +
* 2025-06-18, [https://is.muni.cz/th/ekn8q/ Improving Checkpoint/Restore Functionality in Kubernetes]
 +
* 2025-06-17, LWN.net: [https://lwn.net/Articles/1024747/ A parallel path for GPU restore in CRIU]
 +
* 2025-06-13, [https://radostin.io/files/vspisakova-jsspp25.pdf Kubernetes Scheduling with Checkpoint/Restore: Challenges and Open Problems]
 +
<!------------------------------------------------
 +
  This is to cut the rest of it for Main Page,
 +
  adding the More... link instead.
 +
  Make sure to move this whole block up from time to time.
 +
-->
 +
<includeonly>: '''[[Articles|More external articles...]]'''</includeonly><noinclude>
 +
<!--
 +
    the below stuff is now shown on the Main Page
 +
-------------------------------------------------->
 +
* 2025-06-12, [https://doi.org/10.1109/TNSM.2025.3579051 MOSE: A Novel Orchestration Framework for Stateful Microservice Migration at the Edge]
 +
* 2025-05-14, [https://doi.org/10.1145/3672608.3707723 Elastic Vertical Memory Management for Container-based Stateful Applications in Kubernetes]
 +
* 2025-05-02, [https://doi.org/10.1007/s42514-025-00227-0 Practice and Observation: Live Migration for MPI Workload]
 +
* 2025-04-28, [https://www.usenix.org/conference/nsdi25/presentation/segarra GRANNY: Granular Management of Compute-Intensive Applications in the Cloud]
 +
* 2025-04-28, [https://ieeexplore.ieee.org/abstract/document/10979504 KubeSPT: Stateful Pod Teleportation for Service Resilience with Live Migration]
 +
* 2025-04-04, [https://doi.org/10.1109/ICIN64016.2025.10942720 Optimizing Stateful Microservice Migration in Kubernetes with MS2M and Forensic Checkpointing]
 +
* 2025-03-30, [https://doi.org/10.1145/3676641.3715988 CXLfork: Fast Remote Fork over CXL Fabrics]
 +
* 2025-02-23, [https://arxiv.org/abs/2502.16631 CRIUgpu: Transparent Checkpointing of GPU-Accelerated Workloads]
 +
* 2025-02-05, [https://doi.org/10.1007/s00607-025-01447-6 A Comprehensive Performance Evaluation of Container Migration Strategies]
 +
* 2024-11-21, [https://dl.acm.org/doi/10.1145/3698038.3698510 On-demand and Parallel Checkpoint/Restore for GPU Applications]
 +
* 2024-11-20, [https://dl.acm.org/doi/10.1145/3698038.3698513 Snapipeline: Accelerating Snapshot Startup for FaaS Containers]
 
* 2024-09-06, [https://dl.acm.org/doi/10.1145/3660319.3660330 Live Migration of Multi-Container Kubernetes Pods in Multi-Cluster Serverless Edge Systems]
 
* 2024-09-06, [https://dl.acm.org/doi/10.1145/3660319.3660330 Live Migration of Multi-Container Kubernetes Pods in Multi-Cluster Serverless Edge Systems]
 
* 2024-09-04, [https://dl.acm.org/doi/10.1145/3678015.3680477 Towards Efficient End-to-End Encryption for Container Checkpointing Systems]
 
* 2024-09-04, [https://dl.acm.org/doi/10.1145/3678015.3680477 Towards Efficient End-to-End Encryption for Container Checkpointing Systems]
 +
* 2024-09-02, [https://doi.org/10.1016/j.future.2024.107495 CSMD: Container state management for deployment in cloud data centers]
 +
* 2024-08-21, [https://ieeexplore.ieee.org/abstract/document/10628207 The State of Container Checkpointing with CRIU: A Multi-Case Experience Report]
 
* 2024-08-04, [https://dl.acm.org/doi/abs/10.1145/3672197.3673432 Custom Page Fault Handling With eBPF]
 
* 2024-08-04, [https://dl.acm.org/doi/abs/10.1145/3672197.3673432 Custom Page Fault Handling With eBPF]
 
* 2024-08-03, [https://dl.acm.org/doi/10.1145/3663408.3663416 Software-based Live Migration for Containerized RDMA]
 
* 2024-08-03, [https://dl.acm.org/doi/10.1145/3663408.3663416 Software-based Live Migration for Containerized RDMA]
Line 18: Line 46:
 
* 2024-07-23, [https://ieeexplore.ieee.org/abstract/document/10631042 Dapper: A Lightweight and Extensible Framework for Live Program State Rewriting]
 
* 2024-07-23, [https://ieeexplore.ieee.org/abstract/document/10631042 Dapper: A Lightweight and Extensible Framework for Live Program State Rewriting]
 
* 2024-07-07, [https://ieeexplore.ieee.org/abstract/document/10643902 FastMig: Leveraging FastFreeze to Establish Robust Service Liquidity in Cloud 2.0]
 
* 2024-07-07, [https://ieeexplore.ieee.org/abstract/document/10643902 FastMig: Leveraging FastFreeze to Establish Robust Service Liquidity in Cloud 2.0]
<!------------------------------------------------
+
* 2024-07-02, [https://developer.nvidia.com/blog/checkpointing-cuda-applications-with-criu/ Checkpointing CUDA Applications with CRIU]
  This is to cut the rest of it for Main Page,
+
* 2024-06-19, [https://arxiv.org/abs/2406.13856 Kishu: Time-Traveling for Computational Notebooks]
  adding the More... link instead.
+
* 2024-06-09, [https://dl.acm.org/doi/abs/10.1145/3626246.3654752 Demonstration of ElasticNotebook: Migrating Live Computational Notebook States]
  Make sure to move this whole block up from time to time.
+
* 2024-05-30, [https://is.muni.cz/th/tadf0/phd-thesis-proposal-digital.pdf In the Container Era: A Coup in Reliable Computing Over Unreliable Infrastructure]
-->
+
* 2024-05-20, [https://arxiv.org/abs/2405.12079v1 ParallelGPUOS: A Concurrent OS-level GPU Checkpoint and Restore System using Validated Speculation]
<includeonly>: '''[[Articles|More external articles...]]'''</includeonly><noinclude>
 
<!--
 
    the below stuff is now shown on the Main Page
 
-------------------------------------------------->
 
* 2024-05-20, [https://arxiv.org/abs/2405.12079 ParallelGPUOS: A Concurrent OS-level GPU Checkpoint and Restore System using Validated Speculation]
 
 
* 2024-05-09, [https://www.sciencedirect.com/science/article/pii/S1383762124000948 Practicable live container migrations in high performance computing clouds: Diskless, iterative, and connection-persistent]
 
* 2024-05-09, [https://www.sciencedirect.com/science/article/pii/S1383762124000948 Practicable live container migrations in high performance computing clouds: Diskless, iterative, and connection-persistent]
 
* 2024-05-06, [https://ieeexplore.ieee.org/abstract/document/10701375 Workload-Aware Live Migratable Cloud Instance Detector]
 
* 2024-05-06, [https://ieeexplore.ieee.org/abstract/document/10701375 Workload-Aware Live Migratable Cloud Instance Detector]
 
* 2024-05-06, [https://ieeexplore.ieee.org/abstract/document/10707218 Migration of Isolated Application Across Heterogeneous Edge Systems]
 
* 2024-05-06, [https://ieeexplore.ieee.org/abstract/document/10707218 Migration of Isolated Application Across Heterogeneous Edge Systems]
 +
* 2024-04-26, [https://fis.tu-dresden.de/portal/files/53673228/planeta_bearb_pref2b_20240912193924.pdf Fine-grained OS Control over High-performance Networking]
 
* 2024-04-22, [https://dl.acm.org/doi/abs/10.1145/3627703.3650085 Just-In-Time Checkpointing: Low Cost Error Recovery from Deep Learning Training Failures]
 
* 2024-04-22, [https://dl.acm.org/doi/abs/10.1145/3627703.3650085 Just-In-Time Checkpointing: Low Cost Error Recovery from Deep Learning Training Failures]
* 2024-04-22, [https://www.dpss.inesc-id.pt/~rbruno/papers/skohli-eurosys24.pdf Pronghorn: Effective Checkpoint Orchestration for Serverless Hot-Starts]
+
* 2024-04-22, [https://dl.acm.org/doi/10.1145/3627703.3629556 Pronghorn: Effective Checkpoint Orchestration for Serverless Hot-Starts]
 +
* 2024-02-09, [https://ejournal.unitomo.ac.id/index.php/inform/article/view/7498/3738 Forensic Analysis of Podman Container Towards Metasploit Backdoor Using Checkpointctl]
 
* 2024-01-29, [https://www.sciencedirect.com/science/article/pii/S0167739X24000190 Prebaking runtime environments to improve the FaaS cold start latency]
 
* 2024-01-29, [https://www.sciencedirect.com/science/article/pii/S0167739X24000190 Prebaking runtime environments to improve the FaaS cold start latency]
 
* 2023-11-27, [https://dl.acm.org/doi/abs/10.1145/3590140.3629121 DynaCut: A Framework for Dynamic and Adaptive Program Customization]
 
* 2023-11-27, [https://dl.acm.org/doi/abs/10.1145/3590140.3629121 DynaCut: A Framework for Dynamic and Adaptive Program Customization]
Line 38: Line 63:
 
* 2023-11-10, [https://ieeexplore.ieee.org/abstract/document/10314806 Design, Modeling, and Implementation of Robust Migration of Stateful Edge Microservices]
 
* 2023-11-10, [https://ieeexplore.ieee.org/abstract/document/10314806 Design, Modeling, and Implementation of Robust Migration of Stateful Edge Microservices]
 
* 2023-10-23, [https://dl.acm.org/doi/10.1145/3605181.3626289 Evicting for the greater good: The Case for Reactive Checkpointing in Serverless Computing]
 
* 2023-10-23, [https://dl.acm.org/doi/10.1145/3605181.3626289 Evicting for the greater good: The Case for Reactive Checkpointing in Serverless Computing]
 +
* 2023-10-01, [https://dl.acm.org/doi/10.14778/3626292.3626296 ElasticNotebook: Enabling Live Migration for Computational Notebooks]
 
* 2023-09-25, [https://ieeexplore.ieee.org/abstract/document/10419298 Transparent Fault Tolerance for Stateful Applications in Kubernetes with Checkpoint/Restore]
 
* 2023-09-25, [https://ieeexplore.ieee.org/abstract/document/10419298 Transparent Fault Tolerance for Stateful Applications in Kubernetes with Checkpoint/Restore]
 
* 2023-07-21, [https://vtechworks.lib.vt.edu/items/20cd28e6-1dba-4c21-b221-59f5f345205f CRIU-RTX: Remote Thread eXecution using Checkpoint/Restore in Userspace]
 
* 2023-07-21, [https://vtechworks.lib.vt.edu/items/20cd28e6-1dba-4c21-b221-59f5f345205f CRIU-RTX: Remote Thread eXecution using Checkpoint/Restore in Userspace]
 +
* 2023-07-10, [https://www.usenix.org/conference/osdi23/presentation/wei-rdma No Provisioned Concurrency: Fast RDMA-codesigned Remote Fork for Serverless Computing]
 
* 2023-07-06, [https://ieeexplore.ieee.org/abstract/document/10207336 Microservice Debugging with Checkpoint-Restart]
 
* 2023-07-06, [https://ieeexplore.ieee.org/abstract/document/10207336 Microservice Debugging with Checkpoint-Restart]
 
* 2023-05-28, [https://ieeexplore.ieee.org/abstract/document/10278877 Processing-Aware Migration Model for Stateful Edge Microservices]
 
* 2023-05-28, [https://ieeexplore.ieee.org/abstract/document/10278877 Processing-Aware Migration Model for Stateful Edge Microservices]
 
* 2023-04-20, [https://www.mdpi.com/2504-446X/7/5/286 A Dynamic Checkpoint Interval Decision Algorithm for Live Migration-Based Drone-Recovery System]
 
* 2023-04-20, [https://www.mdpi.com/2504-446X/7/5/286 A Dynamic Checkpoint Interval Decision Algorithm for Live Migration-Based Drone-Recovery System]
 
* 2023-03-10, [https://kubernetes.io/blog/2023/03/10/forensic-container-analysis/ Forensic Container Analysis]
 
* 2023-03-10, [https://kubernetes.io/blog/2023/03/10/forensic-container-analysis/ Forensic Container Analysis]
 +
* 2023-01-31, [https://vtechworks.lib.vt.edu/items/ba974ad9-eac9-4306-b3fc-5f0411b89b99 HetMigrate: Secure and Efficient Cross-architecture Process Live Migration]
 
* 2023-01-14, [https://arxiv.org/abs/2301.05861 Async-fork: Mitigating Query Latency Spikes Incurred by the Fork-based Snapshot Mechanism from the OS Level]
 
* 2023-01-14, [https://arxiv.org/abs/2301.05861 Async-fork: Mitigating Query Latency Spikes Incurred by the Fork-based Snapshot Mechanism from the OS Level]
 
* 2023-01-10, [https://ieeexplore.ieee.org/abstract/document/10077919 A Container Pre-copy Migration Method Based on Dirty Page Prediction and Compression]
 
* 2023-01-10, [https://ieeexplore.ieee.org/abstract/document/10077919 A Container Pre-copy Migration Method Based on Dirty Page Prediction and Compression]
* 2022-12-05, [https://kubernetes.io/blog/2022/12/05/forensic-container-checkpointing-alpha/ Forensic container checkpointing in Kubernetes]
 
 
* 2022-11-13, [https://dl.acm.org/doi/abs/10.5555/3571885.3572000 Out of hypervisor (OoH): efficient dirty page tracking in userspace using hardware virtualization feature]
 
* 2022-11-13, [https://dl.acm.org/doi/abs/10.5555/3571885.3572000 Out of hypervisor (OoH): efficient dirty page tracking in userspace using hardware virtualization feature]
 
* 2022-08-07, [https://www.sciencedirect.com/science/article/pii/S1084804522001369 iContainer: Consecutive Checkpointing with Rapid Resilience for Immortal Container-based Services]
 
* 2022-08-07, [https://www.sciencedirect.com/science/article/pii/S1084804522001369 iContainer: Consecutive Checkpointing with Rapid Resilience for Immortal Container-based Services]
Line 52: Line 79:
 
* 2022-07-11, [https://www.usenix.org/conference/atc22/presentation/zhou-diyu RRC: Responsive Replicated Containers]
 
* 2022-07-11, [https://www.usenix.org/conference/atc22/presentation/zhou-diyu RRC: Responsive Replicated Containers]
 
* 2022-05-25, [https://hal.inria.fr/hal-03587358/ Good Shepherds Care For Their Cattle: Seamless Pod Migration in Geo-Distributed Kubernetes]
 
* 2022-05-25, [https://hal.inria.fr/hal-03587358/ Good Shepherds Care For Their Cattle: Seamless Pod Migration in Geo-Distributed Kubernetes]
 +
* 2022-05-06, [https://doi.org/10.1145/3477314.3507221 An architecture proposal for checkpoint/restore on stateful containers]
 
* 2022-04-24, [https://www.ndss-symposium.org/ndss-paper/auto-draft-295/ FitM: Binary-Only Coverage-Guided Fuzzing for Stateful Network Protocols]
 
* 2022-04-24, [https://www.ndss-symposium.org/ndss-paper/auto-draft-295/ FitM: Binary-Only Coverage-Guided Fuzzing for Stateful Network Protocols]
 +
* 2022-03-01, [https://systex22.github.io/papers/systex22-final71.pdf Transparent, Cross-ISA Enclave Offloading]
 
* 2022-02-25, [https://dl.acm.org/doi/abs/10.1145/3516807.3516817 Portkey: Hypervisor-Assisted Container Migration in Nested Cloud Environments]
 
* 2022-02-25, [https://dl.acm.org/doi/abs/10.1145/3516807.3516817 Portkey: Hypervisor-Assisted Container Migration in Nested Cloud Environments]
 
* 2022-02-16, [https://arxiv.org/abs/2202.07848 Singularity: Planet-Scale, Preemptible and Elastic Scheduling of AI Workloads]
 
* 2022-02-16, [https://arxiv.org/abs/2202.07848 Singularity: Planet-Scale, Preemptible and Elastic Scheduling of AI Workloads]
 +
* 2022-02-08, [https://doi.org/10.48550/arXiv.2202.03643 SNPSFuzzer: A Fast Greybox Fuzzer for Stateful Network Protocols using Snapshots]
 
* 2021-12-17, [https://hal.archives-ouvertes.fr/hal-03487607/document Standard-compliant parallel SystemC simulation of loosely-timed transaction level models: From baremetal to Linux-based applications support]
 
* 2021-12-17, [https://hal.archives-ouvertes.fr/hal-03487607/document Standard-compliant parallel SystemC simulation of loosely-timed transaction level models: From baremetal to Linux-based applications support]
* 2021-08-13, [https://doi.org/10.11591/eei.v10i2.2742 Live migration using checkpoint and restore in userspace (CRIU): Usage analysis of network, memory and CPU]
 
 
* 2021-07-14, [https://www.usenix.org/conference/atc21/presentation/planeta MigrOS: Transparent Live-Migration Support for Containerised RDMA Applications]
 
* 2021-07-14, [https://www.usenix.org/conference/atc21/presentation/planeta MigrOS: Transparent Live-Migration Support for Containerised RDMA Applications]
 
* 2021-07-06, [https://onlinelibrary.wiley.com/doi/10.1002/cpe.6474 Cricket: A virtualization layer for distributed execution of CUDA applications with checkpoint/restart support]
 
* 2021-07-06, [https://onlinelibrary.wiley.com/doi/10.1002/cpe.6474 Cricket: A virtualization layer for distributed execution of CUDA applications with checkpoint/restart support]
Line 89: Line 118:
 
* 2018-10-13, [https://dl.acm.org/citation.cfm?id=3290626 Linux Process Tree Reconstruction Using The Attributed Grammar-Based Tree Transformation Model]
 
* 2018-10-13, [https://dl.acm.org/citation.cfm?id=3290626 Linux Process Tree Reconstruction Using The Attributed Grammar-Based Tree Transformation Model]
 
* 2018-10-10, [https://podman.io/blogs/2018/10/10/checkpoint-restore.html Adding checkpoint/restore support to Podman]
 
* 2018-10-10, [https://podman.io/blogs/2018/10/10/checkpoint-restore.html Adding checkpoint/restore support to Podman]
 +
* 2018-10-08, [https://www.usenix.org/conference/osdi18/presentation/xiao Gandiva: Introspective Cluster Scheduling for Deep Learning]
 
* 2018-09-15, [https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8539562 Stateful Container Migration employing Checkpoint-based Restoration for Orchestrated Container Clusters]
 
* 2018-09-15, [https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8539562 Stateful Container Migration employing Checkpoint-based Restoration for Orchestrated Container Clusters]
 
* 2018-09-07, [https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8502659 Container Live Migration for Latency Critical Industrial Applications on Edge Computing]
 
* 2018-09-07, [https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8502659 Container Live Migration for Latency Critical Industrial Applications on Edge Computing]
 
* 2018-08-15, University of Maryland: [https://drum.lib.umd.edu/bitstream/handle/1903/20499/CS-TR-5056.pdf Fast and Service-preserving Recovery from Malware Infections Using CRIU]
 
* 2018-08-15, University of Maryland: [https://drum.lib.umd.edu/bitstream/handle/1903/20499/CS-TR-5056.pdf Fast and Service-preserving Recovery from Malware Infections Using CRIU]
 
* 2018-07-31, [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6131214/ Hot-starting software containers for STAR aligner]
 
* 2018-07-31, [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6131214/ Hot-starting software containers for STAR aligner]
* 2018-07-07, Moscow Institute of Physics and Technology: [https://pdfs.semanticscholar.org/9ac4/f8ab4fd0492bfdc503831f60a5ce3d1d50a5.pdf?_ga=2.17262585.1140385641.1554239661-2109847679.1554239661 Using CRIU with HPC Containers: Field Experience]
 
 
* 2018-06-28, University of Aberdeen: [https://link.springer.com/chapter/10.1007/978-3-030-02465-9_13 Efficient Live Migration of Linux Containers]
 
* 2018-06-28, University of Aberdeen: [https://link.springer.com/chapter/10.1007/978-3-030-02465-9_13 Efficient Live Migration of Linux Containers]
 
* 2018-03-24, [https://www.smitechow.com/2018/03/compile-criu-on-centos-6.html Compile CRIU on CentOS 6]
 
* 2018-03-24, [https://www.smitechow.com/2018/03/compile-criu-on-centos-6.html Compile CRIU on CentOS 6]
Line 119: Line 148:
 
* 2015-04-22, TuxDiary [http://tuxdiary.com/2015/04/22/dump-debug-resume-process-criu/ Dump, debug, resume process with criu]
 
* 2015-04-22, TuxDiary [http://tuxdiary.com/2015/04/22/dump-debug-resume-process-criu/ Dump, debug, resume process with criu]
 
* 2014-12-12, Symposium on Information and Communication Systems (SInCom 2014) [https://lisas.de/~adrian/proceedingsSInCom2014.pdf Checkpoint/Restore in User-Space with Open MPI]
 
* 2014-12-12, Symposium on Information and Communication Systems (SInCom 2014) [https://lisas.de/~adrian/proceedingsSInCom2014.pdf Checkpoint/Restore in User-Space with Open MPI]
 +
* 2014-11-03, [https://dl.acm.org/doi/10.1145/2660267.2660329 From Patches to Honey-Patches: Lightweight Attacker Misdirection, Deception, and Disinformation]
 
* 2014-09-31, [http://www.reuters.com/article/wa-parallels-idUSnBw035202a+100+BSW20141103 Parallels Surpasses One Million Deployed Virtual Containers]
 
* 2014-09-31, [http://www.reuters.com/article/wa-parallels-idUSnBw035202a+100+BSW20141103 Parallels Surpasses One Million Deployed Virtual Containers]
 
* 2014-08-01, ADMIN magazine: [http://www.admin-magazine.com/Archive/2014/22/Save-and-Restore-Linux-Processes-with-CRIU Save and Restore Linux Processes with CRIU]
 
* 2014-08-01, ADMIN magazine: [http://www.admin-magazine.com/Archive/2014/22/Save-and-Restore-Linux-Processes-with-CRIU Save and Restore Linux Processes with CRIU]

Latest revision as of 21:46, 13 November 2025

This is a collection of external articles regarding the CRIU project, sorted by date.

In Russian[edit]