Difference between revisions of "User namespace"

From CRIU
Jump to navigation Jump to search
m (Kir moved page UserNamespace to User namespace: fix title according to mediawiki standards)
 
(9 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 +
== Problems ==
 +
 
Implementing user namespace support in CRIU requires a few changes on kernel side. First of all when a new user namespace is created its capability get dropped off, thus any kernel aspect guarded with capable() may fail. In particular we found that the following things are not functional when we restore tasks running in own user namespace:
 
Implementing user namespace support in CRIU requires a few changes on kernel side. First of all when a new user namespace is created its capability get dropped off, thus any kernel aspect guarded with capable() may fail. In particular we found that the following things are not functional when we restore tasks running in own user namespace:
  
* setup of memory descriptor (prctl syscall) (CAP_SYS_RESOURCE) (assigned to gorcunov@)
+
* mknod() requires CAP_MKNOD
* access to /proc/pid/map_files which we use for restore of shared memory (CAP_SYS_ADMIN)
+
* Lots of IPCNS and UTSNS sysctls require CAP_SYS_ADMIN
* using of SO_RCVBUFFORCE and SO_SNDBUFFORCE socket option (CAP_NET_ADMIN)
 
* open_by_handle_at requires CAP_DAC_READ_SEARCH
 
  
 
The list will be updated with time. To resolve the problems we need to address every issue and modify the kernel. For first bullet there is a patch already floating around in kernel mailing list.
 
The list will be updated with time. To resolve the problems we need to address every issue and modify the kernel. For first bullet there is a patch already floating around in kernel mailing list.
  
[https://github.com/avagin/criu/tree/userns2 Here is a draft of userspace code]
+
== Usernsd ==
 +
 
 +
This daemon is the salvation to most of the problems we've found. Since many restricted operations involve a file descriptor, we may ask a privileged process to do the action and exchange the relevant FD for that.
 +
 
 +
[[Category:Development]]

Latest revision as of 15:48, 6 April 2015

Problems[edit]

Implementing user namespace support in CRIU requires a few changes on kernel side. First of all when a new user namespace is created its capability get dropped off, thus any kernel aspect guarded with capable() may fail. In particular we found that the following things are not functional when we restore tasks running in own user namespace:

  • mknod() requires CAP_MKNOD
  • Lots of IPCNS and UTSNS sysctls require CAP_SYS_ADMIN

The list will be updated with time. To resolve the problems we need to address every issue and modify the kernel. For first bullet there is a patch already floating around in kernel mailing list.

Usernsd[edit]

This daemon is the salvation to most of the problems we've found. Since many restricted operations involve a file descriptor, we may ask a privileged process to do the action and exchange the relevant FD for that.