Issue 10441103: Make sharding_supervisor.py run InProcessBrowserTest.Empty

Issue 10441103: Make sharding_supervisor.py run InProcessBrowserTest.Empty (Closed)

Created:
8 years, 6 months ago by mmenke

Modified:
8 years, 5 months ago

Reviewers:
Nicolas Sylvain, jam, Paweł Hajdan Jr.

CC:
chromium-reviews, joi+watch-content_chromium.org, pam+watch_chromium.org, jochen+watch-content_chromium.org, darin-cc_chromium.org, cmp

Base URL:
svn://svn.chromium.org/chrome/trunk/src/

Visibility:
Public.

More Reviews

Description

Make sharding_supervisor.py run InProcessBrowserTest.Empty before other tests, with a longer timeout. This is aimed at fixing the issue where the first 4 tests on the XP buildbots take much longer than other tests, often timing out. BUG=124260 Committed: https://src.chromium.org/viewvc/chrome?view=rev&revision=144760

Patch Set 1 #

Patch Set 2 : reorganize #

Patch Set 3 : Oops #

Patch Set 4 : #

Total comments: 4

Patch Set 5 : Response to comments #

Patch Set 6 : Remove comment about log parsing, as requested #

Patch Set 7 : Use --warmup (And sync) #

Total comments: 4

Patch Set 8 : Response to comments #

Created: 8 years, 5 months ago

Download [raw] [tar.bz2]

	Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+27 lines, -5 lines)			Patch
M	content/public/test/test_launcher.h	View	1 2 3 4 5 6	1 chunk	+3 lines, -0 lines	0 comments	Download
M	content/test/test_launcher.cc	View	1 2 3 4 5 6 7	2 chunks	+7 lines, -3 lines	0 comments	Download
M	tools/sharding_supervisor/sharding_supervisor.py	View	1 2 3 4 5 6 7	2 chunks	+17 lines, -2 lines	0 comments	Download

Messages

Total messages: 49 (0 generated)

Expand Messages | Collapse Messages

Paweł Hajdan Jr.

This looks quite hacky. I don't think we should shard browser_test unless a bug with ...

8 years, 6 months ago (2012-06-01 14:48:23 UTC) #2

mmenke

On 2012/06/01 14:48:23, Paweł Hajdan Jr. wrote: > This looks quite hacky. I don't think ...

8 years, 6 months ago (2012-06-01 14:50:16 UTC) #3

mmenke

On 2012/06/01 14:50:16, Matt Menke wrote: > On 2012/06/01 14:48:23, Paweł Hajdan Jr. wrote: > ...

8 years, 6 months ago (2012-06-01 14:52:25 UTC) #4

mmenke

On 2012/06/01 14:52:25, Matt Menke wrote: > On 2012/06/01 14:50:16, Matt Menke wrote: > > ...

8 years, 6 months ago (2012-06-01 14:58:07 UTC) #5

Paweł Hajdan Jr.

My point is that: 1. This change is hacky (one place undoes change made in ...

8 years, 6 months ago (2012-06-01 15:09:43 UTC) #6

mmenke

On 2012/06/01 15:09:43, Paweł Hajdan Jr. wrote: > My point is that: > > 1. ...

8 years, 6 months ago (2012-06-01 15:22:22 UTC) #7

nsylvain

I am not sure I understand the "mixed output" bug thing. Sharding supervisor for browser ...

8 years, 6 months ago (2012-06-01 18:43:28 UTC) #8

Paweł Hajdan Jr.

The mixed-up output I was referring to is https://groups.google.com/a/chromium.org/forum/#!topic/chromium-dev/EwHAn-WtWOQ/discussion Also adding John, since what I'm ...

8 years, 6 months ago (2012-06-04 10:28:16 UTC) #9

mmenke

This CL is now just a partial revert of John's earlier CL (https://chromiumcodereview.appspot.com/9662011). http://codereview.chromium.org/10441103/diff/8004/content/test/test_launcher.cc File ...

8 years, 6 months ago (2012-06-04 14:38:11 UTC) #10

mmenke

http://codereview.chromium.org/10441103/diff/8004/content/test/test_launcher.cc File content/test/test_launcher.cc (right): http://codereview.chromium.org/10441103/diff/8004/content/test/test_launcher.cc#newcode640 content/test/test_launcher.cc:640: if (!should_shard && !command_line->HasSwitch(kGTestFilterFlag)) { On 2012/06/04 14:38:11, Matt ...

8 years, 6 months ago (2012-06-04 14:57:19 UTC) #11

mmenke

8 years, 6 months ago (2012-06-04 14:59:41 UTC) #12

jam

This means that each try run will take a few minutes more, because of the ...

8 years, 6 months ago (2012-06-04 15:39:13 UTC) #13

mmenke

On 2012/06/04 15:39:13, John Abd-El-Malek wrote: > This means that each try run will take ...

8 years, 6 months ago (2012-06-04 15:40:51 UTC) #14

Paweł Hajdan Jr.

On 2012/06/04 15:39:13, John Abd-El-Malek wrote: > This means that each try run will take ...

8 years, 6 months ago (2012-06-04 16:37:48 UTC) #15

mmenke

On 2012/06/04 16:37:48, Paweł Hajdan Jr. wrote: > On 2012/06/04 15:39:13, John Abd-El-Malek wrote: > ...

8 years, 6 months ago (2012-06-04 16:41:49 UTC) #16

jam

On 2012/06/04 16:41:49, Matt Menke wrote: > On 2012/06/04 16:37:48, Paweł Hajdan Jr. wrote: > ...

8 years, 6 months ago (2012-06-04 16:48:11 UTC) #17

jam

On 2012/06/04 15:40:51, Matt Menke wrote: > On 2012/06/04 15:39:13, John Abd-El-Malek wrote: > > ...

8 years, 6 months ago (2012-06-04 16:48:31 UTC) #18

Paweł Hajdan Jr.

LGTM In my previous comment, "login" should have been "logic", indeed it wouldn't make sense ...

8 years, 6 months ago (2012-06-04 17:00:09 UTC) #19

mmenke

On 2012/06/04 17:00:09, Paweł Hajdan Jr. wrote: > LGTM > > In my previous comment, ...

8 years, 6 months ago (2012-06-04 17:25:35 UTC) #20

jam

On 2012/06/04 17:25:35, Matt Menke wrote: > On 2012/06/04 17:00:09, Paweł Hajdan Jr. wrote: > ...

8 years, 6 months ago (2012-06-04 18:02:45 UTC) #21

On 2012/06/04 17:25:35, Matt Menke wrote:
> On 2012/06/04 17:00:09, Paweł Hajdan Jr. wrote:
> > LGTM
> > 
> > In my previous comment, "login" should have been "logic", indeed it wouldn't
> > make sense otherwise.
> 
> Great.  Once we reach some sort of consensus on patch set 4 vs patch set 6,
I'll
> land.  I feel that if I landed either set now, it would be over someone's
> objections.  I'm a bit concerned about this ending up in a stalemate.  I favor
> 4, but I could happily live with either solution.
> 
> While 4 might not be the most elegant solution, it does solve the problem
> without running the empty test a bunch of times.  It's not the most elegant
> solution, but it does solve a problem that's going on now without slowing
things
> down much.  While moving the logic into test_launcher.cc may be a better long
> term solution, we have tests failing now, and patch set 4 fixes the issue.

Changing how we do sharding is outside the scope of this, and has other
dependencies as well (i.e. gtest seems to know about sharding).

> 
> Patch set 6 also fixes the issue, with the downside being it slows test runs
> down by ~2 minutes (according to jam).  Skimming over a couple random test
runs
> of the slowest XP bots, the tests take about 30 minutes total.

The overhead is lower on the _build bots_ because browser_tests is sharded
across multiple machines. But on _try bots_, I had seen this empty test run 200
times.

>  Including
> reruns, looks like there are 20 shards run, not including re-run shards (Which
> we won't be running the Empty test again on, in either case).  Assuming 4
> seconds per test, that's 20n = 80 seconds wasted.  However, since we're
sharding
> those tests across 4 instances, that should end up being closer to 80/4 = 20
> seconds longer.  We get 120/4 = 30 seconds, if the empty test were run on
re-run
> shards, which may be where jam's two minutes comes from.  So test runs may
take
> about 1.1% longer (20/(30*60)).  This doesn't seem like a huge penalty, though
> it does all add up.  There may also be some advantage to set 4 not sharding
the
> warmup time, or there may not.

mmenke

On 2012/06/04 18:02:45, John Abd-El-Malek wrote: > On 2012/06/04 17:25:35, Matt Menke wrote: > > ...

8 years, 6 months ago (2012-06-04 18:23:03 UTC) #22

jam

On Mon, Jun 4, 2012 at 11:23 AM, <mmenke@chromium.org> wrote: > On 2012/06/04 18:02:45, John ...

8 years, 6 months ago (2012-06-04 18:24:26 UTC) #23

On Mon, Jun 4, 2012 at 11:23 AM, <mmenke@chromium.org> wrote:

> On 2012/06/04 18:02:45, John Abd-El-Malek wrote:
>
>> On 2012/06/04 17:25:35, Matt Menke wrote:
>> > On 2012/06/04 17:00:09, Paweł Hajdan Jr. wrote:
>> > > LGTM
>> > >
>> > > In my previous comment, "login" should have been "logic", indeed it
>>
> wouldn't
>
>> > > make sense otherwise.
>> >
>> > Great.  Once we reach some sort of consensus on patch set 4 vs patch
>> set 6,
>> I'll
>> > land.  I feel that if I landed either set now, it would be over
>> someone's
>> > objections.  I'm a bit concerned about this ending up in a stalemate.  I
>>
> favor
>
>> > 4, but I could happily live with either solution.
>> >
>> > While 4 might not be the most elegant solution, it does solve the
>> problem
>> > without running the empty test a bunch of times.  It's not the most
>> elegant
>> > solution, but it does solve a problem that's going on now without
>> slowing
>> things
>> > down much.  While moving the logic into test_launcher.cc may be a better
>>
> long
>
>> > term solution, we have tests failing now, and patch set 4 fixes the
>> issue.
>>
>
>  Changing how we do sharding is outside the scope of this, and has other
>> dependencies as well (i.e. gtest seems to know about sharding).
>>
>
>  >
>> > Patch set 6 also fixes the issue, with the downside being it slows test
>> runs
>> > down by ~2 minutes (according to jam).  Skimming over a couple random
>> test
>> runs
>> > of the slowest XP bots, the tests take about 30 minutes total.
>>
>
>  The overhead is lower on the _build bots_ because browser_tests is sharded
>> across multiple machines. But on _try bots_, I had seen this empty test
>> run
>>
> 200
>
>> times.
>>
>
> Ahh...  You're right.  On my test run, it ran at least 82 times.  About a
> second
> each, on average, but that is a lot of runs, and it's easy to see how that
> could
> get even more out of hand.
>

yeah, the next generation of trybot hardware has 2x as many cores, so
there'll be even more sharding :)


>
>
http://codereview.chromium.**org/10441103/<http://codereview.chromium.org/104...
>

Paweł Hajdan Jr.

I think that flaky test timeouts are a more serious problem than possibly wasting some ...

8 years, 6 months ago (2012-06-06 10:35:38 UTC) #24

jam

On 2012/06/06 10:35:38, Paweł Hajdan Jr. wrote: > I think that flaky test timeouts are ...

8 years, 6 months ago (2012-06-06 16:56:33 UTC) #25

Paweł Hajdan Jr.

On 2012/06/06 16:56:33, John Abd-El-Malek wrote: > I don't understand why you're suggesting this, what's ...

8 years, 6 months ago (2012-06-18 15:36:57 UTC) #26

mmenke

On 2012/06/18 15:36:57, Paweł Hajdan Jr. wrote: > On 2012/06/06 16:56:33, John Abd-El-Malek wrote: > ...

8 years, 6 months ago (2012-06-18 16:46:03 UTC) #27

jam

On 2012/06/18 15:36:57, Paweł Hajdan Jr. wrote: > On 2012/06/06 16:56:33, John Abd-El-Malek wrote: > ...

8 years, 6 months ago (2012-06-18 17:30:29 UTC) #28

jam

And to avoid another two week round-trip, here's why I don't think it's hacky. test_launcher.cc ...

8 years, 6 months ago (2012-06-18 17:47:07 UTC) #29

And to avoid another two week round-trip, here's why I don't think it's
hacky.

test_launcher.cc is run for every shard. It's not the right place to run
this initial test because otherwise it would have to know which set of
shards are running initially, and then to run the empty test in each of
them. I don't know if the first part of this is possible. Even if it was,
that means that the empty test would have to be run n times. So you might
think your proposed solution is less hacky, but for these reasons I don't
agree. The logic that Matt added is minimal, so it's not much of a
duplication. I also think that running it in the sharding supervisor is the
right place, again because it's the serial path that runs initially before
sharding begins. We really want this empty test to just be run _once_.

If you look at my reviews, you'll see that I never advocate for a hacky
shortcut. I always advocate for doing the right thing. So just because you
don't agree with what I'm arguing for, it's not reasonable to label it as a
hack.

And lastly, if you'd like to block changes from landing, then waiting two
weeks to reply isn't reasonable. i.e. I understand you're busy with other
stuff now, which is understandable/fine of course. But you can't chime in
and say you don't like something and then step away for two weeks which
leaves the original author waiting for you.

On Mon, Jun 18, 2012 at 10:30 AM, <jam@chromium.org> wrote:

> On 2012/06/18 15:36:57, Paweł Hajdan Jr. wrote:
>
>> On 2012/06/06 16:56:33, John Abd-El-Malek wrote:
>> > I don't understand why you're suggesting this, what's wrong with
>> patchset 4?
>> > That solves the initial test problem.
>>
>
>  Code review is not only about whether the patch solves the problem (most
>> of
>>
> the
>
>> time it does).
>>
>
>  It's also about quality, and I have a strong opinion that patchset 4
>> degrades
>> the quality of our test infrastructure, making it less maintainable.
>>
>
>  > Wasting 2 minutes over every try run adds up.
>>
>
>  Agreed. I have suggested possible solutions.
>>
>
>  > I think enough time has been spent on this and we should just commit
>>
> patchset
>
>> 4.
>>
>
>  I can't stop everyone from making hacky changes. If you really want to do
>> it,
>> feel free. Note that "let's just do this quick hack" shouldn't be a
>> convincing
>> argument. See http://dev.chromium.org/**developers/committers-**
>>
responsibility<http://dev.chromium.org/developers/committers-responsibility>:
>>
> "In
>
>> short, do the right thing for the project, not the easiest thing to get
>> code
>> committed, and above all: use your best judgement.".
>>
>
>  I would be surprised if my suggested changes took more than 1-2 days to
>> land.
>>
>
> Just saying something is hacky doesn't make it so. Have you read all this
> thread?
>
> Can you explain exactly is hacky with patchset 4?
>
>
http://codereview.chromium.**org/10441103/<http://codereview.chromium.org/104...
>

Paweł Hajdan Jr.

On 2012/06/18 16:46:03, Matt Menke wrote: > Paweł: I assume your suggestion was either add ...

8 years, 6 months ago (2012-06-19 16:29:25 UTC) #30

mmenke

On 2012/06/19 16:29:25, Paweł Hajdan Jr. wrote: > On 2012/06/18 16:46:03, Matt Menke wrote: > ...

8 years, 6 months ago (2012-06-19 16:42:07 UTC) #31

Paweł Hajdan Jr.

On 2012/06/19 16:42:07, Matt Menke wrote: > The first sentence - "use a gtest_filter for ...

8 years, 6 months ago (2012-06-19 18:51:54 UTC) #32

jam

On 2012/06/19 18:51:54, Paweł Hajdan Jr. wrote: > On 2012/06/19 16:42:07, Matt Menke wrote: > ...

8 years, 6 months ago (2012-06-19 19:19:12 UTC) #33

jam

ping On Tue, Jun 19, 2012 at 12:19 PM, <jam@chromium.org> wrote: > On 2012/06/19 18:51:54, ...

8 years, 6 months ago (2012-06-22 20:26:45 UTC) #34

Paweł Hajdan Jr.

My understanding is that Matt is working on --warmup switch. I think I've explained my ...

8 years, 6 months ago (2012-06-23 07:25:10 UTC) #35

mmenke

On 2012/06/23 07:25:10, Paweł Hajdan Jr. wrote: > My understanding is that Matt is working ...

8 years, 6 months ago (2012-06-25 19:05:15 UTC) #36

mmenke

Paweł: Is this more what you had in mind? Unfortunately, we can't really remove the ...

8 years, 6 months ago (2012-06-26 17:53:21 UTC) #37

Paweł Hajdan Jr.

LGTM This is _exactly_ what I had in mind - thank you for doing the ...

8 years, 5 months ago (2012-06-27 07:48:52 UTC) #38

mmenke

I plan on committing this tomorrow. The code and behavior should suit everyone. If anyone ...

8 years, 5 months ago (2012-06-27 16:14:48 UTC) #39

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/mmenke@chromium.org/10441103/56001

8 years, 5 months ago (2012-06-28 14:21:30 UTC) #40

commit-bot: I haz the power

Try job failure for 10441103-56001 (retry) on win_rel for step "browser_tests". It's a second try, ...

8 years, 5 months ago (2012-06-28 16:45:53 UTC) #41

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/mmenke@chromium.org/10441103/56001

8 years, 5 months ago (2012-06-28 16:46:47 UTC) #42

commit-bot: I haz the power

Try job failure for 10441103-56001 (retry) on mac_rel for step "browser_tests". It's a second try, ...

8 years, 5 months ago (2012-06-28 17:50:07 UTC) #43

mmenke

And it turns out not to be working... Looks like the CL to enable incremental ...

8 years, 5 months ago (2012-06-29 17:35:07 UTC) #44

jam

ok, supporting the empty test might be hard since test_launcher.cc is compiled in test_support_common now ...

8 years, 5 months ago (2012-06-29 18:14:42 UTC) #46

mmenke

On 2012/06/29 18:14:42, John Abd-El-Malek wrote: > ok, supporting the empty test might be hard ...

8 years, 5 months ago (2012-06-29 18:18:26 UTC) #47

jam

On 2012/06/29 18:18:26, Matt Menke wrote: > On 2012/06/29 18:14:42, John Abd-El-Malek wrote: > > ...

8 years, 5 months ago (2012-06-29 19:04:52 UTC) #48

jam

8 years, 5 months ago (2012-06-29 19:53:08 UTC) #49

ok, this should work: http://codereview.chromium.org/10692048

Expand Messages | Collapse Messages