Re: [PATCH v3 06/10] qcow2-refcount: check_refcounts_l2(): check l2

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v3 06/10] qcow2-refcount: check_refcounts_l2(): check l2_bitm

From:	Hanna Reitz
Subject:	Re: [PATCH v3 06/10] qcow2-refcount: check_refcounts_l2(): check l2_bitmap
Date:	Tue, 14 Sep 2021 13:46:14 +0200
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.11.0

On 14.09.21 13:22, Vladimir Sementsov-Ogievskiy wrote:

14.09.2021 11:54, Hanna Reitz wrote:
On 24.05.21 16:20, Vladimir Sementsov-Ogievskiy wrote:
Check subcluster bitmap of the l2 entry for different types of
clusters:

  - for compressed it must be zero
  - for allocated check consistency of two parts of the bitmap
  - for unallocated all subclusters should be unallocated
    (or zero-plain)

For unallocated clusters we can safely fix the entry by making it
zero-plain.

Signed-off-by: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
Reviewed-by: Eric Blake <eblake@redhat.com>
Tested-by: Kirill Tkhai <ktkhai@virtuozzo.com>
---
  block/qcow2-refcount.c | 30 +++++++++++++++++++++++++++++-
  1 file changed, 29 insertions(+), 1 deletion(-)

diff --git a/block/qcow2-refcount.c b/block/qcow2-refcount.c
index f48c5e1b5d..062ec48a15 100644
--- a/block/qcow2-refcount.c
+++ b/block/qcow2-refcount.c
@@ -1681,6 +1681,7 @@ static int check_refcounts_l2(BlockDriverState*bs, BdrvCheckResult *res,
          uint64_t coffset;
          int csize;
          l2_entry = get_l2_entry(s, l2_table, i);
+        uint64_t l2_bitmap = get_l2_bitmap(s, l2_table, i);
This is a declaration after a statement. (Easily fixable by movingthe l2_entry declaration here, though. Or by putting the l2_bitmapdeclaration where l2_entry is declared.)
The latter seems nicer.
[...]
@@ -1800,6 +1815,19 @@ static intcheck_refcounts_l2(BlockDriverState *bs, BdrvCheckResult *res,
          case QCOW2_CLUSTER_ZERO_PLAIN:
          case QCOW2_CLUSTER_UNALLOCATED:
+            if (l2_bitmap & QCOW_L2_BITMAP_ALL_ALLOC) {
+                res->corruptions++;
+                fprintf(stderr, "%s: Unallocated "
+ "cluster has non-zero subcluster allocationmap\n",+ fix & BDRV_FIX_ERRORS ? "Repairing" :"ERROR");
+                if (fix & BDRV_FIX_ERRORS) {
+ ret = fix_l2_entry_by_zero(bs, res, l2_offset,l2_table, i,+ active,&metadata_overlap);
I believe this is indeed the correct repair method forQCOW2_CLUSTER_ZERO_PLAIN, but I’m not so sure forQCOW2_CLUSTER_UNALLOCATED. As far as I can tell,qcow2_get_subcluster_type() will return QCOW2_SUBCLUSTER_INVALID forthis case, and so trying to read from this clusters will produce I/Oerrors. But still, shouldn’t we rather make such a clusterunallocated rather than zero then?
And as for QCOW2_CLUSTER_ZERO_PLAIN, I believeqcow2_get_cluster_type() will never return it when subclusters areenabled. So this repair path will never happen with a cluster typeof ZERO_PLAIN, but only for UNALLOCATED.
Agree about ZERO_PLAIN, that it's impossible here.
But for UNALLOCATED, I'm not sure. If we make all wrongly "allocated"subclusters to be unallocted, underlying backing layer will becomeavailable. Could it be considered as security violation?

I don’t think so, because the image has to be corrupted first, which Ihope guests cannot trigger.

On the other hand, when user have to fix format corruptions, nothingis guaranteed and the aim is to make data available as far as it'spossible. So, may be making wrong subclusters "unallocated" is correctthing..

We could also consider refusing to repair this case for images that havebacking files.

In any case, I don’t think we should force ourselves to make somecluster zero just because there’s no better choice. For example, wealso don’t make unallocated data clusters zero, because it would just bewrong.

(Though technically there is no right or wrong here, because we justrefuse to read from such clusters. Doing anything to the cluster wouldkind of be an improvement, whether it is making it zero or making itreally unallocated... If there was any important data here, it’s lostanyway.)

Perhaps we should have a truly destructive repair mode where allunreadable data is made 0. But OTOH, if users have an image that’s sobroken, then it’s probably not wrong to tell them it’s unrepairable andthey need to convert it to a fresh image (with --salvage).


Hanna

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [PATCH v3 06/10] qcow2-refcount: check_refcounts_l2(): check l2_bitmap, Hanna Reitz, 2021/09/14
- Re: [PATCH v3 06/10] qcow2-refcount: check_refcounts_l2(): check l2_bitmap, Vladimir Sementsov-Ogievskiy, 2021/09/14
  - Re: [PATCH v3 06/10] qcow2-refcount: check_refcounts_l2(): check l2_bitmap, Hanna Reitz <=
    - Re: [PATCH v3 06/10] qcow2-refcount: check_refcounts_l2(): check l2_bitmap, Vladimir Sementsov-Ogievskiy, 2021/09/14
    - Re: [PATCH v3 06/10] qcow2-refcount: check_refcounts_l2(): check l2_bitmap, Vladimir Sementsov-Ogievskiy, 2021/09/14

Prev by Date: Re: [PULL 00/14] aspeed queue
Next by Date: Re: [PATCH] vhost-vsock: fix migration issue when seqpacket is supported
Previous by thread: Re: [PATCH v3 06/10] qcow2-refcount: check_refcounts_l2(): check l2_bitmap
Next by thread: Re: [PATCH v3 06/10] qcow2-refcount: check_refcounts_l2(): check l2_bitmap
Index(es):
- Date
- Thread