qemu-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH] virtiofsd: Show submounts


From: Vivek Goyal
Subject: Re: [PATCH] virtiofsd: Show submounts
Date: Thu, 30 Apr 2020 11:41:05 -0400

On Thu, Apr 30, 2020 at 03:47:04PM +0100, Daniel P. Berrangé wrote:
> On Thu, Apr 30, 2020 at 10:41:16AM -0400, Vivek Goyal wrote:
> > On Thu, Apr 30, 2020 at 03:34:25PM +0100, Daniel P. Berrangé wrote:
> > > On Thu, Apr 30, 2020 at 03:20:13PM +0100, Dr. David Alan Gilbert wrote:
> > > > * Vivek Goyal (address@hidden) wrote:
> > > > > On Thu, Apr 30, 2020 at 09:58:12AM +0100, Dr. David Alan Gilbert 
> > > > > wrote:
> > > > > [..]
> > > > > > > > Even without this patch, the SLAVE stuff worked so if you start 
> > > > > > > > the
> > > > > > > > daemon and *then* mount under the shared directory, the guest 
> > > > > > > > sees it
> > > > > > > > with or without this patch.
> > > > > > > 
> > > > > > > Hm, I don’t.  Do you really?
> > > > > > 
> > > > > > Yes! With your patch reverted:
> > > > > > 
> > > > > > Start virtiofsd, mount in the guest:
> > > > > > 
> > > > > > host:
> > > > > > # ./virtiofsd --socket-path=/tmp/vhostqemu -o 
> > > > > > source=/home/dgilbert/virtio-fs/fs  -o log_level=warn -o 
> > > > > > no_writeback
> > > > > > 
> > > > > > guest:
> > > > > > # mount -t virtiofs myfs /sysroot
> > > > > > 
> > > > > > host:
> > > > > > # findmnt -o +PROPAGATION -N 6100
> > > > > > TARGET SOURCE                                                       
> > > > > >        FSTYPE OPTIONS                                               
> > > > > >        PROPAGATION
> > > > > > /      
> > > > > > /dev/mapper/fedora_dgilbert--t580-root[/home/dgilbert/virtio-fs/fs] 
> > > > > > xfs    rw,relatime,seclabel,attr2,inode64,logbufs=8,logbsize=32k,no 
> > > > > > private,slave
> > > > > > # mount -t tmpfs /dev/null /home/dgilbert/virtio-fs/fs/tmp
> > > > > > # findmnt -o +PROPAGATION -N 6100
> > > > > > TARGET SOURCE                                                       
> > > > > >        FSTYPE OPTIONS                                               
> > > > > >        PROPAGATION
> > > > > > /      
> > > > > > /dev/mapper/fedora_dgilbert--t580-root[/home/dgilbert/virtio-fs/fs] 
> > > > > > xfs    rw,relatime,seclabel,attr2,inode64,logbufs=8,logbsize=32k,no 
> > > > > > private,slave
> > > > > > └─/tmp /dev/null                                                    
> > > > > >        tmpfs  rw,relatime,seclabel                                  
> > > > > >        private,slave
> > > > > 
> > > > > Why is it showing a mount point at "/tmp". If mount point propagated, 
> > > > > then
> > > > > inside guest we should see a mount point at /sysroot/tmp?
> > > > 
> > > > That findmnt is on the host.
> > > > 
> > > > > So there are two things.
> > > > > 
> > > > > A. Propagation of mount from host to virtiofsd.
> > > > > B. Visibility of that mount inside guest over fuse protocol (submount
> > > > >   functionality).
> > > > > 
> > > > > I think A works for me without any patches. But don't think B is 
> > > > > working
> > > > > for me. I don't see the submount inside guest. 
> > > > > 
> > > > > > # touch /home/dgilbert/virtio-fs/fs/tmp/hello
> > > > > > 
> > > > > > guest:
> > > > > > # ls -l /sysroot/tmp
> > > > > > total 0
> > > > > > -rw-r--r-- 1 root root 0 Apr 30 08:50 hello
> > > > > 
> > > > > Do a "findmnt /sysroot/tmp" inside guest and see what do you see.
> > > > > 
> > > > > You will be able to see "hello" as long as virtiofsd sees the new
> > > > > mount point, I think. And guest does not have to see that mount point
> > > > > for this simple test to work.
> > > > 
> > > > Right, the guest just sees:
> > > > 
> > > > `-/sysroot                            myfs       virtiof rw,relatime
> > > 
> > > That is a good thing surely ? If I'm exporting "/sysroot" from the host,
> > > I want the content in "/sysroot/some/sub/mount" to be visible to the
> > > guest, but I don't want the guest to see "/sysroot/some/sub/mount"
> > > as an actual mount point. That would be leaking information about the
> > > host storage setup into the guest. The host admin should be free to
> > > re-arrange submounts in the host OS, to bring more storage space online,
> > > and have this be transparent to the guest OS.
> > 
> > If we don't see mount inside guest, we run into the possibility of inode
> > number collision. On host two files in shared dir can have same inode
> > number (if they are on two different filesystem with different device
> > numbers). But inside guest, we will show device number of virtiofs,
> > and it will look as if two files in this filesystem have same inode
> > number, breaking some workloads.
> > 
> > By propagating mounts (submounts), we can assign a unique device number
> > to these submounts and hence <dev,inode> number pair will become unique.
> 
> Ah, yes, that's true.  In 9pfs there was recent changes precisely
> because of this clash possibility:
> 
> commit 1a6ed33cc56997479bbe5b48337ff8da44585bd4
> Author: Antonios Motakis <address@hidden>
> Date:   Thu Oct 10 11:36:05 2019 +0200
> 
>     9p: Added virtfs option 'multidevs=remap|forbid|warn'
>     
>     'warn' (default): Only log an error message (once) on host if more than 
> one
>     device is shared by same export, except of that just ignore this config
>     error though. This is the default behaviour for not breaking existing
>     installations implying that they really know what they are doing.
>     
>     'forbid': Like 'warn', but except of just logging an error this
>     also denies access of guest to additional devices.
>     
>     'remap': Allows to share more than one device per export by remapping
>     inodes from host to guest appropriately. To support multiple devices on 
> the
>     9p share, and avoid qid path collisions we take the device id as input to
>     generate a unique QID path. The lowest 48 bits of the path will be set
>     equal to the file inode, and the top bits will be uniquely assigned based
>     on the top 16 bits of the inode and the device id.
> 
> 
> Perhaps we should try to support the same options in virtio-fs. At least
> the "forbid" and "remap" options make sense I think. "warn" was only
> really there for backcompat.  If we can expose it to the guest, then a
> further "expose" option would be viable.

Providing options in virtiofsd daemon so that use can tweak these settings
makes sense.

Given we are already allowing accessing multiple device inside guest, we
also probably need a mode to continue to allow that so that not to break
existing setups.

"forbid", "remap" make sense. And I like the option "expose" too.

"remap" will require little thought that how do we do that. We briefly
discussed encoding device number in higher inode bits and Miklos was
not too keen on it. What if filesystem is using all 64bits of inode. We
also need to do encoding in such a way so that inode number is persistent.

Thanks
Vivek




reply via email to

[Prev in Thread] Current Thread [Next in Thread]