Re: [PATCH 1/1] scripts/performance: Add bisect.py script

qemu-devel
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [PATCH 1/1] scripts/performance: Add bisect.py script

From:	Ahmed Karaman
Subject:	Re: [PATCH 1/1] scripts/performance: Add bisect.py script
Date:	Sat, 25 Jul 2020 22:48:30 +0200
On Sat, Jul 25, 2020 at 9:48 PM Aleksandar Markovic
<aleksandar.qemu.devel@gmail.com> wrote:
>
>
>
> On Saturday, July 25, 2020, Ahmed Karaman <ahmedkhaledkaraman@gmail.com> 
> wrote:
>>
>> On Sat, Jul 25, 2020 at 2:31 PM Aleksandar Markovic 
>> <aleksandar.qemu.devel@gmail.com> wrote:
>>>
>>>
>>> Hi, Ahmed.
>>>
>>> Yes, somewhat related to John's hints on these comments, it is customary to 
>>> have just a brief description before "Copyright" lines. This means one 
>>> sentence, or a short paragraph (3-4 sentences max). The lenghty syntax 
>>> commemt should be, in my opinion, moved after the license preamble, just 
>>> before the start of real Python code.
>>
>>
>> Thanks Mr. John and Aleksandar for your feedback. I will update the script 
>> accordingly.
>>
>>>
>>>
>>> One question:
>>>
>>> What is the behavior in case of the executable architecture and "target" 
>>> command line option mismatch (for example, one specifies m68k target, but 
>>> passes hppa executable? Would that be detected before bisect search, or the 
>>> bisect procedure will be applied even though such cases do not make sense?
>>
>>
>> The script will exit with an error of something along the lines of "Invalid 
>> ELF image for this architecture".
>> This is done before starting "bisect" and after the initial "configure" and 
>> "make".
>>
>
> This is good enough (the moment of detection). However, are all cleanups 
> done? Is temporary directory deleted?

This is a thing I missed, I will add a clean_up() function to be
called before any exit.

>
> The same questions for the scenario where the user specifies non-existant 
> commit ID as the start or the end commit.
>

The script will exit with a message from "git" saying that this ID
doesn't exist. This will be done during the initial measurements of
the two boundary commits which is also before the bisect process.

> Does the script work if user specifies a tag, instead of commit ID? I think 
> it should work. For example, can the user specify v3.1.0 as start commit, and 
> v4.2.0 as the end commit, in order to detect degradation/improvement between 
> QEMU 3.1 and QEMU 4.2? Please test if such scenario works. If it works, I 
> think you should insert "commit ID or tag ID" instead of "commit" only in the 
> commit massage and applicable code comments (including also the user-visible 
> help outputed on "-h").

Yes, tags also work. Basically, anything that works with "git bisect"
as "start" and "end" values works with the script.

>
> Lastly, what happens if specified start and end commits are existant, but in 
> the wrong order (end is "before" start)?

The script will also exit with an error before starting the bisect
process. The error would say:
"Some slow revs are not ancestors of the fast rev.
git bisect cannot work properly in this case.
Maybe you mistook slow and fast revs?"


>
> Thanks,
> Aleksandar
>
>
>
>
>>>
>>>
>>> Yours, Aleksandar
>>>
>>>
>>>>
>>>> +#  This program is free software: you can redistribute it and/or modify
>>>> +#  it under the terms of the GNU General Public License as published by
>>>> +#  the Free Software Foundation, either version 2 of the License, or
>>>> +#  (at your option) any later version.
>>>> +#
>>>> +#  This program is distributed in the hope that it will be useful,
>>>> +#  but WITHOUT ANY WARRANTY; without even the implied warranty of
>>>> +#  MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
>>>> +#  GNU General Public License for more details.
>>>> +#
>>>> +#  You should have received a copy of the GNU General Public License
>>>> +#  along with this program. If not, see <https://www.gnu.org/licenses/>.
>>>> +
>>>> +import argparse
>>>> +import multiprocessing
>>>> +import tempfile
>>>> +import os
>>>> +import shutil
>>>> +import subprocess
>>>> +import sys
>>>> +
>>>> +
>>>> +############################ GIT WRAPPERS ############################
>>>> +def git_bisect(qemu_path, command, args=None):
>>>> +    """
>>>> +    Wrapper function for running git bisect.
>>>> +
>>>> +    Parameters:
>>>> +    qemu_path (str): QEMU path.
>>>> +    command (str):   bisect command (start|fast|slow|reset).
>>>> +    args (list):     Optional arguments.
>>>> +
>>>> +    Returns:
>>>> +    (str):           git bisect stdout.
>>>> +    """
>>>> +    process = ["git", "bisect", command]
>>>> +    if args:
>>>> +        process += args
>>>> +    bisect = subprocess.run(process,
>>>> +                            cwd=qemu_path,
>>>> +                            stdout=subprocess.PIPE,
>>>> +                            stderr=subprocess.PIPE)
>>>> +    if bisect.returncode:
>>>> +        sys.exit(bisect.stderr.decode("utf-8"))
>>>> +    return bisect.stdout.decode("utf-8")
>>>> +
>>>> +
>>>> +def git_checkout(commit, qemu_path):
>>>> +    """
>>>> +    Wrapper function for checking out a given git commit.
>>>> +
>>>> +    Parameters:
>>>> +    commit (str):    Commit hash of a git commit.
>>>> +    qemu_path (str): QEMU path.
>>>> +    """
>>>> +    checkout_commit = subprocess.run(["git",
>>>> +                                      "checkout",
>>>> +                                      commit],
>>>> +                                     cwd=qemu_path,
>>>> +                                     stdout=subprocess.DEVNULL,
>>>> +                                     stderr=subprocess.PIPE)
>>>> +    if checkout_commit.returncode:
>>>> +        sys.exit(checkout_commit.stderr.decode("utf-8"))
>>>> +
>>>> +
>>>> +def git_clone(qemu_path):
>>>> +    """
>>>> +    Wrapper function for cloning QEMU git repo from GitHub.
>>>> +
>>>> +    Parameters:
>>>> +    qemu_path (str): Path to clone the QEMU repo to.
>>>> +    """
>>>> +    clone_qemu = subprocess.run(["git",
>>>> +                                 "clone",
>>>> +                                 "https://github.com/qemu/qemu.git";,
>>>> +                                 qemu_path],
>>>> +                                stderr=subprocess.STDOUT)
>>>> +    if clone_qemu.returncode:
>>>> +        sys.exit("Failed to clone QEMU!")
>>>> +######################################################################
>>>> +
>>>> +
>>>> +def check_requirements(tool):
>>>> +    """
>>>> +    Verify that all script requirements are installed (perf|callgrind & 
>>>> git).
>>>> +
>>>> +    Parameters:
>>>> +    tool (str): Tool used for the measurement (perf or callgrind).
>>>> +    """
>>>> +    if tool == "perf":
>>>> +        check_perf_installation = subprocess.run(["which", "perf"],
>>>> +                                                 
>>>> stdout=subprocess.DEVNULL)
>>>> +        if check_perf_installation.returncode:
>>>> +            sys.exit("Please install perf before running the script.")
>>>> +
>>>> +        # Insure user has previllage to run perf
>>>> +        check_perf_executability = subprocess.run(["perf", "stat", "ls", 
>>>> "/"],
>>>> +                                                  
>>>> stdout=subprocess.DEVNULL,
>>>> +                                                  
>>>> stderr=subprocess.DEVNULL)
>>>> +        if check_perf_executability.returncode:
>>>> +            sys.exit("""
>>>> +        Error:
>>>> +        You may not have permission to collect stats.
>>>> +        Consider tweaking /proc/sys/kernel/perf_event_paranoid,
>>>> +        which controls use of the performance events system by
>>>> +        unprivileged users (without CAP_SYS_ADMIN).
>>>> +        -1: Allow use of (almost) all events by all users
>>>> +            Ignore mlock limit after perf_event_mlock_kb without 
>>>> CAP_IPC_LOCK
>>>> +        0: Disallow ftrace function tracepoint by users without 
>>>> CAP_SYS_ADMIN
>>>> +            Disallow raw tracepoint access by users without CAP_SYS_ADMIN
>>>> +        1: Disallow CPU event access by users without CAP_SYS_ADMIN
>>>> +        2: Disallow kernel profiling by users without CAP_SYS_ADMIN
>>>> +        To make this setting permanent, edit /etc/sysctl.conf too, e.g.:
>>>> +        kernel.perf_event_paranoid = -1
>>>> +
>>>> +        *Alternatively, you can run this script under sudo privileges.
>>>> +        """)
>>>> +    elif tool == "callgrind":
>>>> +        check_valgrind_installation = subprocess.run(["which", 
>>>> "valgrind"],
>>>> +                                                     
>>>> stdout=subprocess.DEVNULL)
>>>> +        if check_valgrind_installation.returncode:
>>>> +            sys.exit("Please install valgrind before running the script.")
>>>> +
>>>> +    # Insure that git is installed
>>>> +    check_git_installation = subprocess.run(["which", "git"],
>>>> +                                            stdout=subprocess.DEVNULL)
>>>> +    if check_git_installation.returncode:
>>>> +        sys.exit("Please install git before running the script.")
>>>> +
>>>> +
>>>> +def make(qemu_build_path):
>>>> +    """
>>>> +    Build QEMU by running the Makefile.
>>>> +
>>>> +    Parameters:
>>>> +    qemu_build_path (str): Path to the build directory with configuration 
>>>> files.
>>>> +    """
>>>> +    run_make = subprocess.run(["make",
>>>> +                               "-j",
>>>> +                               str(multiprocessing.cpu_count())],
>>>> +                              cwd=qemu_build_path,
>>>> +                              stdout=subprocess.DEVNULL,
>>>> +                              stderr=subprocess.PIPE)
>>>> +    if run_make.returncode:
>>>> +        sys.exit(run_make.stderr.decode("utf-8"))
>>>> +
>>>> +
>>>> +def measure_instructions(tool, qemu_exe_path, command):
>>>> +    """
>>>> +    Measure the number of instructions when running an program with QEMU.
>>>> +
>>>> +    Parameters:
>>>> +    tool (str):          Tool used for the measurement (perf|callgrind).
>>>> +    qemu_exe_path (str): Path to the QEMU executable of the equivalent 
>>>> target.
>>>> +    command (list):      Program path and arguments.
>>>> +
>>>> +    Returns:
>>>> +    (int):               Number of instructions.
>>>> +    """
>>>> +    if tool == "perf":
>>>> +        run_perf = subprocess.run((["perf",
>>>> +                                    "stat",
>>>> +                                    "-x",
>>>> +                                    " ",
>>>> +                                    "-e",
>>>> +                                    "instructions",
>>>> +                                    qemu_exe_path]
>>>> +                                   + command),
>>>> +                                  stdout=subprocess.DEVNULL,
>>>> +                                  stderr=subprocess.PIPE)
>>>> +        if run_perf.returncode:
>>>> +            sys.exit(run_perf.stderr.decode("utf-8"))
>>>> +        else:
>>>> +            perf_output = run_perf.stderr.decode("utf-8").split(" ")
>>>> +            return int(perf_output[0])
>>>> +
>>>> +    elif tool == "callgrind":
>>>> +        with tempfile.NamedTemporaryFile() as tmpfile:
>>>> +            run_callgrind = subprocess.run((["valgrind",
>>>> +                                             "--tool=callgrind",
>>>> +                                             
>>>> "--callgrind-out-file={}".format(
>>>> +                                                 tmpfile.name),
>>>> +                                             qemu_exe_path]
>>>> +                                            + command),
>>>> +                                           stdout=subprocess.DEVNULL,
>>>> +                                           stderr=subprocess.PIPE)
>>>> +        if run_callgrind.returncode:
>>>> +            sys.exit(run_callgrind.stderr.decode("utf-8"))
>>>> +        else:
>>>> +            callgrind_output = 
>>>> run_callgrind.stderr.decode("utf-8").split("\n")
>>>> +            return int(callgrind_output[8].split(" ")[-1])
>>>> +
>>>> +
>>>> +def main():
>>>> +    # Parse the command line arguments
>>>> +    parser = argparse.ArgumentParser(
>>>> +        usage="bisect.py [-h] -s,--start START [-e,--end END] [-q,--qemu 
>>>> QEMU] "
>>>> +        "--target TARGET --tool {perf,callgrind} -- "
>>>> +        "<target executable> [<target executable options>]")
>>>> +
>>>> +    parser.add_argument("-s", "--start", dest="start", type=str, 
>>>> required=True,
>>>> +                        help="First commit hash in the search range")
>>>> +    parser.add_argument("-e", "--end", dest="end", type=str, 
>>>> default="master",
>>>> +                        help="Last commit hash in the search range")
>>>> +    parser.add_argument("-q", "--qemu", dest="qemu", type=str, default="",
>>>> +                        help="QEMU path")
>>>> +    parser.add_argument("--target", dest="target", type=str, 
>>>> required=True,
>>>> +                        help="QEMU target")
>>>> +    parser.add_argument("--tool", dest="tool", choices=["perf", 
>>>> "callgrind"],
>>>> +                        required=True, help="Tool used for measurements")
>>>> +
>>>> +    parser.add_argument("command", type=str, nargs="+", 
>>>> help=argparse.SUPPRESS)
>>>> +
>>>> +    args = parser.parse_args()
>>>> +
>>>> +    # Extract the needed variables from the args
>>>> +    start_commit = args.start
>>>> +    end_commit = args.end
>>>> +    qemu = args.qemu
>>>> +    target = args.target
>>>> +    tool = args.tool
>>>> +    command = args.command
>>>> +
>>>> +    # Set QEMU path
>>>> +    if qemu == "":
>>>> +        # Create a temp directory for cloning QEMU
>>>> +        tmpdir = tempfile.TemporaryDirectory()
>>>> +        qemu_path = os.path.join(tmpdir.name, "qemu")
>>>> +
>>>> +        # Clone QEMU into the temporary directory
>>>> +        print("Fetching QEMU: ", end="", flush=True)
>>>> +        git_clone(qemu_path)
>>>> +        print()
>>>> +    else:
>>>> +        qemu_path = qemu
>>>> +
>>>> +    # Create the build directory
>>>> +    qemu_build_path = os.path.join(qemu_path, "tmp-build-gcc")
>>>> +    if not os.path.exists(qemu_build_path):
>>>> +        os.mkdir(qemu_build_path)
>>>> +    else:
>>>> +        sys.exit("A build directory with the same name (tmp-build-gcc) 
>>>> used in "
>>>> +                 "the script is already in the provided QEMU path.")
>>>> +
>>>> +    qemu_exe_path = os.path.join(qemu_build_path,
>>>> +                                 "{}-linux-user".format(target),
>>>> +                                 "qemu-{}".format(target))
>>>> +
>>>> +    # Configure QEMU
>>>> +    configure = subprocess.run(["../configure",
>>>> +                                
>>>> "--target-list={}-linux-user".format(target)],
>>>> +                               cwd=qemu_build_path,
>>>> +                               stdout=subprocess.DEVNULL,
>>>> +                               stderr=subprocess.PIPE)
>>>> +    if configure.returncode:
>>>> +        sys.exit(configure.stderr.decode("utf-8"))
>>>> +
>>>> +    # Do performance measurements for the start commit
>>>> +    git_checkout(start_commit, qemu_path)
>>>> +    make(qemu_build_path)
>>>> +    start_commit_instructions = measure_instructions(tool,
>>>> +                                                     qemu_exe_path,
>>>> +                                                     command)
>>>> +    print("{:<30} {}".format("Start Commit Instructions:",
>>>> +                             format(start_commit_instructions, ",")))
>>>> +
>>>> +    # Do performance measurements for the end commit
>>>> +    git_checkout(end_commit, qemu_path)
>>>> +    make(qemu_build_path)
>>>> +    end_commit_instructions = measure_instructions(tool,
>>>> +                                                   qemu_exe_path,
>>>> +                                                   command)
>>>> +    print("{:<30} {}".format("End Commit Instructions:",
>>>> +                             format(end_commit_instructions, ",")))
>>>> +
>>>> +    # Calculate performance difference between start and end commits
>>>> +    performance_difference = \
>>>> +        (start_commit_instructions - end_commit_instructions) / \
>>>> +        max(end_commit_instructions, start_commit_instructions) * 100
>>>> +    performance_change = "+" if performance_difference > 0 else "-"
>>>> +    print("{:<30} {}".format("Performance Change:",
>>>> +                             performance_change +
>>>> +                             str(round(abs(performance_difference), 
>>>> 3))+"%"))
>>>> +
>>>> +    # Set the custom terms used for progressing in "git bisect"
>>>> +    term_old = "fast" if performance_difference < 0 else "slow"
>>>> +    term_new = "slow" if term_old == "fast" else "fast"
>>>> +
>>>> +    # Start git bisect
>>>> +    git_bisect(qemu_path, "start", [
>>>> +               "--term-old", term_old, "--term-new", term_new])
>>>> +    # Set start commit state
>>>> +    git_bisect(qemu_path, term_old, [start_commit])
>>>> +    # Set end commit state
>>>> +    bisect_output = git_bisect(qemu_path, term_new, [end_commit])
>>>> +    # Print estimated bisect steps
>>>> +    print("\n{:<30} {}\n".format(
>>>> +        "Estimated Number of Steps:", bisect_output.split()[9]))
>>>> +
>>>> +    # Initialize bisect_count to track the number of performed
>>>> +    bisect_count = 1
>>>> +
>>>> +    while True:
>>>> +        print("**************BISECT STEP 
>>>> {}**************".format(bisect_count))
>>>> +
>>>> +        make(qemu_build_path)
>>>> +
>>>> +        instructions = measure_instructions(tool, qemu_exe_path, command)
>>>> +        # Find the difference between the current instructions and 
>>>> start/end
>>>> +        # instructions.
>>>> +        diff_end = abs(instructions - end_commit_instructions)
>>>> +        diff_start = abs(instructions - start_commit_instructions)
>>>> +
>>>> +        # If current number of insructions is closer to that of start,
>>>> +        # set current commit as term_old.
>>>> +        # Else, set current commit as term_new.
>>>> +        if diff_end > diff_start:
>>>> +            bisect_command = term_old
>>>> +        else:
>>>> +            bisect_command = term_new
>>>> +
>>>> +        print("{:<20} {}".format("Instructions:", format(instructions, 
>>>> ",")))
>>>> +        print("{:<20} {}".format("Status:", "{} 
>>>> commit".format(bisect_command)))
>>>> +
>>>> +        bisect_output = git_bisect(qemu_path, bisect_command)
>>>> +
>>>> +        # Continue if still bisecting,
>>>> +        # else, print result and break.
>>>> +        if not bisect_output.split(" ")[0] == "Bisecting:":
>>>> +            print("\n*****************BISECT RESULT*****************")
>>>> +            commit_message_start = bisect_output.find("commit\n") + 7
>>>> +            commit_message_end = bisect_output.find(":040000") - 1
>>>> +            print(bisect_output[commit_message_start:commit_message_end])
>>>> +            break
>>>> +
>>>> +        bisect_count += 1
>>>> +
>>>> +    # Reset git bisect
>>>> +    git_bisect(qemu_path, "reset")
>>>> +
>>>> +    # Delete temp build directory
>>>> +    shutil.rmtree(qemu_build_path)
>>>> +
>>>> +
>>>> +if __name__ == "__main__":
>>>> +    main()
>>>> --
>>>> 2.17.1
>>>>

Best regards,
Ahmed Karaman
[Prev in Thread]
Current Thread
[Next in Thread]
[PATCH 0/1] Add bisect.py script, Ahmed Karaman, 2020/07/21
- [PATCH 1/1] scripts/performance: Add bisect.py script, Ahmed Karaman, 2020/07/21
  - Re: [PATCH 1/1] scripts/performance: Add bisect.py script, John Snow, 2020/07/24
  - Re: [PATCH 1/1] scripts/performance: Add bisect.py script, Aleksandar Markovic, 2020/07/25
    - Re: [PATCH 1/1] scripts/performance: Add bisect.py script, Ahmed Karaman, 2020/07/25
    - Re: [PATCH 1/1] scripts/performance: Add bisect.py script, Aleksandar Markovic, 2020/07/25
    - Re: [PATCH 1/1] scripts/performance: Add bisect.py script, Ahmed Karaman <=
    - Re: [PATCH 1/1] scripts/performance: Add bisect.py script, John Snow, 2020/07/27
    - Re: [PATCH 1/1] scripts/performance: Add bisect.py script, Aleksandar Markovic, 2020/07/27
Prev by Date: [Bug 1611394] Re: qemu-ppc: Scalar Single-Precision Floating-Point instructions should not test MSR[SPV]
Next by Date: [PATCH] target/ppc: Fix SPE unavailable exception triggering
Previous by thread: Re: [PATCH 1/1] scripts/performance: Add bisect.py script
Next by thread: Re: [PATCH 1/1] scripts/performance: Add bisect.py script
Index(es):
- Date
- Thread