Issue 2759263002: Add image-first-tests flag to run-webkit-tests

Gleb Lanbin

glebl@chromium.org changed reviewers: + dpranke@chromium.org, eae@chromium.org

3 years, 9 months ago (2017-03-20 20:10:35 UTC) #1

eae

I like this change and think it makes a lot of sense for many of ...

3 years, 9 months ago (2017-03-20 20:30:35 UTC) #3

Dirk Pranke

dpranke@chromium.org changed reviewers: + jeffcarp@chromium.org, qyearsley@chromium.org

3 years, 9 months ago (2017-03-20 23:22:27 UTC) #4

Dirk Pranke

+qyearsley, +jeffcarp who are the real owners of this code now. I discussed this approach ...

3 years, 9 months ago (2017-03-20 23:22:27 UTC) #5

+qyearsley, +jeffcarp who are the real owners of this code now.

I discussed this approach w/ Gleb at some point and it seems fine.

I will point out, of course, that if there's visible text in the test it'll
still need to be in the right places, but this will obviate the need to match
the layout tree, which I'm assuming is pointless in this case anyway.

https://codereview.chromium.org/2759263002/diff/1/third_party/WebKit/Tools/Sc...
File
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py
(right):

https://codereview.chromium.org/2759263002/diff/1/third_party/WebKit/Tools/Sc...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py:30:
from collections import namedtuple
Nit: this line goes between lines 32 and 34.

https://codereview.chromium.org/2759263002/diff/1/third_party/WebKit/Tools/Sc...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py:290:
function = namedtuple('Function', 'call args')
Nit: I would do this slightly different. I'd just use unnamed 3-tuples here.

https://codereview.chromium.org/2759263002/diff/1/third_party/WebKit/Tools/Sc...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py:310:
for compare_function in compare_functions:
and change this for line to:

  for func, first_arg, second_arg in compare_functions:
    failures.extend(func(first_arg, second_arg))

https://codereview.chromium.org/2759263002/diff/1/third_party/WebKit/Tools/Sc...
File third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/run_webkit_tests.py
(right):

https://codereview.chromium.org/2759263002/diff/1/third_party/WebKit/Tools/Sc...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/run_webkit_tests.py:246:
'multiple times)'),
Is there a reason to make this a command line argument rather than a hard-coded
list somewhere in the code or in a config file? 

Where and when would we expect these flags to be passed?

eae

> I will point out, of course, that if there's visible text in the test ...

3 years, 9 months ago (2017-03-20 23:25:33 UTC) #6

eae

> I will point out, of course, that if there's visible text in the test ...

3 years, 9 months ago (2017-03-20 23:25:33 UTC) #7

Gleb Lanbin

https://codereview.chromium.org/2759263002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py File third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py (right): https://codereview.chromium.org/2759263002/diff/1/third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py#newcode30 third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py:30: from collections import namedtuple On 2017/03/20 23:22:27, Dirk Pranke ...

3 years, 9 months ago (2017-03-21 17:31:52 UTC) #8

https://codereview.chromium.org/2759263002/diff/1/third_party/WebKit/Tools/Sc...
File
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py
(right):

https://codereview.chromium.org/2759263002/diff/1/third_party/WebKit/Tools/Sc...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py:30:
from collections import namedtuple
On 2017/03/20 23:22:27, Dirk Pranke wrote:
> Nit: this line goes between lines 32 and 34.

Done.

https://codereview.chromium.org/2759263002/diff/1/third_party/WebKit/Tools/Sc...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py:290:
function = namedtuple('Function', 'call args')
On 2017/03/20 23:22:27, Dirk Pranke wrote:
> Nit: I would do this slightly different. I'd just use unnamed 3-tuples here.

Done.

https://codereview.chromium.org/2759263002/diff/1/third_party/WebKit/Tools/Sc...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py:310:
for compare_function in compare_functions:
On 2017/03/20 23:22:27, Dirk Pranke wrote:
> and change this for line to:
> 
>   for func, first_arg, second_arg in compare_functions:
>     failures.extend(func(first_arg, second_arg))

partially done. I switched to an unnamed tuple but I use 2 items tuple here.
That's because I don't want to limit the number of parameters that can be used
with a compare_function.

https://codereview.chromium.org/2759263002/diff/1/third_party/WebKit/Tools/Sc...
File third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/run_webkit_tests.py
(right):

https://codereview.chromium.org/2759263002/diff/1/third_party/WebKit/Tools/Sc...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/run_webkit_tests.py:246:
'multiple times)'),
On 2017/03/20 23:22:27, Dirk Pranke wrote:
> Is there a reason to make this a command line argument rather than a
hard-coded
> list somewhere in the code or in a config file? 
> 
> Where and when would we expect these flags to be passed?

The initial plan was to add a flag that can be used locally for testing and with
trybots(we need to update trybot JSON recipe for this?). 

So we want to start with the limited set of folders, experiment with it a little
bit and if there are no complaints then send an announcement to blink-dev and
make this testing approach as default. After that we can remove this flag.

If you say that the updating trybot JSON configuration would be problematic then
we can use the "default" option at the line 241 to set all hardcoded values. I
can add a TODO to the flag's description and attach it to the bug so we can
delete this flag later. What do you think?

Dirk Pranke

LGTM. On 2017/03/21 17:31:52, Gleb Lanbin wrote: > On 2017/03/20 23:22:27, Dirk Pranke wrote: > ...

3 years, 9 months ago (2017-03-21 22:10:53 UTC) #9

qyearsley

LGTM :-) Should there be an issue filed for (possibly, later) switching over all pixel ...

3 years, 9 months ago (2017-03-21 22:44:10 UTC) #10

LGTM :-)

Should there be an issue filed for (possibly, later) switching over all pixel
tests to operate in this mode (ignoring text render tree baselines if an image
baseline is available, and using the text render tree baseline as a backup if no
image baseline exists)?

https://codereview.chromium.org/2759263002/diff/20001/third_party/WebKit/Tool...
File
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py
(right):

https://codereview.chromium.org/2759263002/diff/20001/third_party/WebKit/Tool...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py:295:
expected_driver_output.audio, driver_output.audio))
Formatting note: Technically, the line length limit in Tools/Scripts/webkitpy is
132 so you could use longer lines if you want to.

https://codereview.chromium.org/2759263002/diff/20001/third_party/WebKit/Tool...
File third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/run_webkit_tests.py
(right):

https://codereview.chromium.org/2759263002/diff/20001/third_party/WebKit/Tool...
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/run_webkit_tests.py:246:
'multiple times)'),
This help text could probably be made a bit clearer. Does it sound correct to
say:
"A directory (or test) where the test result will only be compared with the
image baseline if an image baseline is available, and fall back to comparison
with the text baseline when image baselines are missing. Specify multiple times
to add multiple directories." 

Since this argument is sort of parallel to --pixel-test-directory, you could
also change the option name and dest etc. to be more like
--pixel-test-directory.

            optparse.make_option(
                '--image-first-directory',
                action='append',
                default=[],
                dest='image_first_directories',
                help=('A directory (or test) where the test result will '
                      'only be compared with the image baseline if an image '
                      'baseline is available, and fall back to comparison '
                      'with the text baseline when image baselines are missing.
'
                      'Specify multiple times to add multiple directories.')),

Gleb Lanbin

On 2017/03/21 22:44:10, qyearsley wrote: > LGTM :-) > > Should there be an issue ...

3 years, 9 months ago (2017-03-22 00:12:17 UTC) #11

On 2017/03/21 22:44:10, qyearsley wrote:
> LGTM :-)
> 
> Should there be an issue filed for (possibly, later) switching over all pixel
> tests to operate in this mode (ignoring text render tree baselines if an image
> baseline is available, and using the text render tree baseline as a backup if
no
> image baseline exists)?

done. thanks
http://crbug.com/703899


> 
>
https://codereview.chromium.org/2759263002/diff/20001/third_party/WebKit/Tool...
> File
>
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py
> (right):
> 
>
https://codereview.chromium.org/2759263002/diff/20001/third_party/WebKit/Tool...
>
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/controllers/single_test_runner.py:295:
> expected_driver_output.audio, driver_output.audio))
> Formatting note: Technically, the line length limit in Tools/Scripts/webkitpy
is
> 132 so you could use longer lines if you want to.
> 
>
https://codereview.chromium.org/2759263002/diff/20001/third_party/WebKit/Tool...
> File
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/run_webkit_tests.py
> (right):
> 
>
https://codereview.chromium.org/2759263002/diff/20001/third_party/WebKit/Tool...
>
third_party/WebKit/Tools/Scripts/webkitpy/layout_tests/run_webkit_tests.py:246:
> 'multiple times)'),
> This help text could probably be made a bit clearer. Does it sound correct to
> say:
> "A directory (or test) where the test result will only be compared with the
> image baseline if an image baseline is available, and fall back to comparison
> with the text baseline when image baselines are missing. Specify multiple
times
> to add multiple directories." 
> 
> Since this argument is sort of parallel to --pixel-test-directory, you could
> also change the option name and dest etc. to be more like
> --pixel-test-directory.
> 
>             optparse.make_option(
>                 '--image-first-directory',
>                 action='append',
>                 default=[],
>                 dest='image_first_directories',
>                 help=('A directory (or test) where the test result will '
>                       'only be compared with the image baseline if an image '
>                       'baseline is available, and fall back to comparison '
>                       'with the text baseline when image baselines are
missing.
> '
>                       'Specify multiple times to add multiple directories.')),


thanks for the review.

Gleb Lanbin

The CQ bit was checked by glebl@chromium.org to run a CQ dry run

3 years, 9 months ago (2017-03-22 00:12:33 UTC) #12