Re: [Qemu-devel] [PATCH 3/3] block: Catch !bs->drv in bdrv

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [PATCH 3/3] block: Catch !bs->drv in bdrv_check()

From:	Max Reitz
Subject:	Re: [Qemu-devel] [PATCH 3/3] block: Catch !bs->drv in bdrv_check()
Date:	Sat, 09 Aug 2014 00:53:18 +0200
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.0

On 08.08.2014 23:11, Max Reitz wrote:

On 08.08.2014 11:15, Kevin Wolf wrote:
Am 07.08.2014 um 22:47 hat Max Reitz geschrieben:
qemu-img check calls bdrv_check() twice if the first run repaired some
inconsistencies. If the first run however again triggered corruption
prevention (on qcow2) due to very bad inconsistencies, bs->drv may be
NULL afterwards. Thus, bdrv_check() should check whether bs->drv isset.
Signed-off-by: Max Reitz <address@hidden>
I suppose there was a real case of this happening? I think bdrv_check()
triggering corruption prevention is a rather bad sign. The most
important point for image repair should be that it doesn't make the
situation any worse. Smells like a follow-up patch to the qcow2 code.
Yes, as I wrote in the cover letter, using the image provided inhttps://bugs.launchpad.net/qemu/+bug/1353456 and setting the refblockoffset to 0 (the reftable entry) results in a segmentation fault.
A simple way to trigger corruption during bdrv_check() is creating animage, setting the first (and only) reftable entry to 0 and runningqemu-img check -r all. bdrv_check() will try to allocate a refblock,but since the first clusters are unallocated, it will allocate themthere which would obviously overwrite the image header and/or L1 tableand/or reftable.
The only way I can imagine to fix this is to completely disregard theon-disk refcount information during bdrv_check() and instead only usethe calculated refcounts. This would require own allocation functionswhich may probably be rather simple, but in any case we'd need towrite them.
I think I should have some time, so I'll have a look into it.

Okay, after thinking about the situation (which involved looking throughthe other bug reports by Maria), I think there is only one way to trulydo the repair operation correctly. The general problem is that a damagedrefcount structure may lead to a new reftable or new refblocks beingallocated during the repair process. However, since the refcounts arenot accurate, these new clusters may collide with existing allocations.We could fix this by replicating all the refcount operations forin-memory refcounts (which qcow2_check_refcounts() creates), but I thinkthis to be a rather bad idea.

Instead, I'd rather create completely new refcount structures inqcow2_check_refcounts() when so much as a single referenced cluster withrefcount=0 is encountered. If there is any cluster which is indeedreferenced but for which the refcount structures say it's free, any newallocation may break things. Since changing refcounts may result in newcluster allocations, we should not update the existing refcountstructures at all.

Alternatively, we can rewrite the refcount update functions to take anin-memory refcount table to know which clusters to avoid, butconsidering that those functions are complicated enough already, I'drather refrain from that.

Max

[Prev in Thread]

Current Thread

[Next in Thread]

[Qemu-devel] [PATCH 0/3] qcow2: Prevent corruption-related crashes, Max Reitz, 2014/08/07
- [Qemu-devel] [PATCH 1/3] qcow2: Catch !*host_offset for data allocation, Max Reitz, 2014/08/07
- [Qemu-devel] [PATCH 2/3] iotests: Add test for image header overlap, Max Reitz, 2014/08/07
- [Qemu-devel] [PATCH 3/3] block: Catch !bs->drv in bdrv_check(), Max Reitz, 2014/08/07
  - Re: [Qemu-devel] [PATCH 3/3] block: Catch !bs->drv in bdrv_check(), Kevin Wolf, 2014/08/08
    - Re: [Qemu-devel] [PATCH 3/3] block: Catch !bs->drv in bdrv_check(), Max Reitz, 2014/08/08
    - Re: [Qemu-devel] [PATCH 3/3] block: Catch !bs->drv in bdrv_check(), Max Reitz <=
- Re: [Qemu-devel] [PATCH 0/3] qcow2: Prevent corruption-related crashes, Eric Blake, 2014/08/07
- Re: [Qemu-devel] [PATCH 0/3] qcow2: Prevent corruption-related crashes, Kevin Wolf, 2014/08/08

Prev by Date: [Qemu-devel] [Bug 1354529] Re: qemu-io: Assert failure on the fuzzed qcow2 image
Next by Date: [Qemu-devel] [PATCH] qemu-nbd: NULL nbd export pointer dereference after kill (TERMINATE)
Previous by thread: Re: [Qemu-devel] [PATCH 3/3] block: Catch !bs->drv in bdrv_check()
Next by thread: Re: [Qemu-devel] [PATCH 0/3] qcow2: Prevent corruption-related crashes
Index(es):
- Date
- Thread