Re: [Qemu-devel] Storing code caching

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] Storing code caching

From:	Martin Williams
Subject:	Re: [Qemu-devel] Storing code caching
Date:	Thu, 8 Jul 2004 18:57:22 +0100

Hmm,

My initial idea was that once code is loaded (as I understand it -please correct me if I am utterly wrong) is that when a program loads,the memory is uses starts at a certain location, and that all codeinside the program is consistently located at a relative address fromthat location. (Could someone who really knows about this sort of thingplease advise?!?)

My idea is to write a program that caches individual files code (ratherthan everything) - based around the idea that when a block is startedexecuting, the cache would be accessed and address minus the baseaddress (in other words the offset of the block) would be used to findit within the cache (some algorithm is needed for an efficient methodof storing and locating these blocks as they will not be the same sizeas the originals). The basic idea would then be that once qemu detectsa self modifying piece of code, (by a write to a memory address), itwould then black list the block in which the write happened (is thispossible?).

The program I would write would basically use the qemu core to processan entire executable, creating the blocks that are executable on thehost machine, and store them. Then start work on modifying qemu torecognise the existense of the cache file and use the blocks. Then dealwith the self-modyfing code issue as above ...


Martin

PS - I'm a CS undergrad, but I'm game for it anyway :)

On 8 Jul 2004, at 18:05, John R. Hogerhuis wrote:

On Thu, 2004-07-08 at 05:26, Martin Williams wrote:

Has anyone thought about trying to store the code caching on disk?

Are you talking about "save machine state" essentially"suspend/resume?"

That is certainly possible and I believe it has been discussed on the
list.

The other possibility, that you wish to permanently associate
untranslated code with translated code by having a big cache available

on disk is in the general case "the halting problem" and there can beno

algorithm for that. So you've been warned: There Be Dragons Here...

However this is real life so there are probably some things you can do.

Some things to understand:

1. Basic blocks of code in the cache are found by their addresses in

memory, not their content. You can imagine that from one run to thenext

code would load in different spots in memory. I suppose you could come
up with a set of heuristics for recognizing a basic block:
a) the location is not permanent but it might be a good clue. Perhaps
though with virtual address space programs always locate to the same
place in a virtual map though they will be different spots in physical
map?

b) the length of the block never changes. That could be a goodheuristicc) A checksum of the code with consideration for absolute addressesthat

have been "fixed up" in the code. These addresses may be different from
run-to-run. Remember though adding in a checksum is an efficiency
tradeoff. It may not be worth it.
d) self modifying code, self modifying code, self modifying code...

In coming up with heuristics for recognizing already translated code
available in the cache, remember you are trading off against just
retranslating. Depending on the complexity/resource intensivity of
computations for your heuristic it may not be worth it to do the
computations.

If you think hard about it there are probably some things you could do
efficiently to reuse basic blocks from previous runs. "User mode" QEMU
is probably an easier case than the general one of running an entire OS
image. And maybe you would want to look at load time... When given a
program to run you check your on disk cache to see if you have loaded

this program before. Checksum it once to see if you have already saveda

cache image for this program. If so, load it up. Encountering
dynamically translated (invalidated cache) portions of the code will
result in "dead areas" which should never be cached.

Anyway an interesting problem for a grad student, I'd say... you have

some prototyping/analysis to do in order to come up with someheuristics

for matching up real code with cached code.

-- John.



_______________________________________________
Qemu-devel mailing list
address@hidden
http://lists.nongnu.org/mailman/listinfo/qemu-devel

[Prev in Thread]

Current Thread

[Next in Thread]

[Qemu-devel] Storing code caching, Martin Williams, 2004/07/08
- Re: [Qemu-devel] Storing code caching, John R. Hogerhuis, 2004/07/08
  - Re: [Qemu-devel] Storing code caching, Antony T Curtis, 2004/07/08
    - Re: [spam score 1/10 -pobox] Re: [Qemu-devel] Storing code caching, John R. Hogerhuis, 2004/07/08
    - Re[2]: [Qemu-devel] Storing code caching, Igor Shmukler, 2004/07/08
  - Re: [Qemu-devel] Storing code caching, Martin Williams <=
    - Re: [Qemu-devel] Storing code caching, John R. Hogerhuis, 2004/07/08
    - Re: [Qemu-devel] Storing code caching, Julian Seward, 2004/07/08
    - Re: [Qemu-devel] Storing code caching, John R. Hogerhuis, 2004/07/08

Prev by Date: Re: [Qemu-devel] Storing code caching
Next by Date: [Qemu-devel] RFC for new features
Previous by thread: Re[2]: [Qemu-devel] Storing code caching
Next by thread: Re: [Qemu-devel] Storing code caching
Index(es):
- Date
- Thread