Issue 10828165: Refine WinSAT histogram bounds and rescale so the results are integers.

Issue 10828165: Refine WinSAT histogram bounds and rescale so the results are integers. (Closed)

Created:
8 years, 4 months ago by dtu

Modified:
8 years, 4 months ago

Reviewers:
Zhenyao Mo, jar (doing other things), piman

CC:
chromium-reviews, joi+watch-content_chromium.org, darin-cc_chromium.org, jam, apatrick_chromium

Base URL:
svn://svn.chromium.org/chrome/trunk/src

Visibility:
Public.

More Reviews

Description

Refine WinSAT histogram bounds and rescale so the results are integers. BUG=None. TEST=chrome://histograms on Windows show valid results for the WinSAT histograms. Committed: http://src.chromium.org/viewvc/chrome?view=rev&revision=150260

Patch Set 1 #

Total comments: 4

Patch Set 2 : Changes from jar's comments. #

Created: 8 years, 4 months ago

Download [raw] [tar.bz2]

		Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+6 lines, -6 lines)			Patch
	M	content/gpu/gpu_info_collector_win.cc	View	1	1 chunk	+6 lines, -6 lines	0 comments	Download

Messages

Total messages: 15 (0 generated)

Expand Messages | Collapse Messages

piman

OWNERS-wise LGTM, but please have it reviewed by someone who knows the code / expectations ...

8 years, 4 months ago (2012-08-03 22:58:54 UTC) #2

Zhenyao Mo

I think you need to define new histograms because the values changed their meanings (a ...

8 years, 4 months ago (2012-08-03 23:06:07 UTC) #3

dtu

On 2012/08/03 23:06:07, Zhenyao Mo wrote: > I think you need to define new histograms ...

8 years, 4 months ago (2012-08-03 23:10:55 UTC) #4

Zhenyao Mo

On 2012/08/03 23:10:55, Dave Tu wrote: > On 2012/08/03 23:06:07, Zhenyao Mo wrote: > > ...

8 years, 4 months ago (2012-08-03 23:18:02 UTC) #5

jar (doing other things)

https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_collector_win.cc File content/gpu/gpu_info_collector_win.cc (right): https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_collector_win.cc#newcode139 content/gpu/gpu_info_collector_win.cc:139: UMA_HISTOGRAM_ENUMERATION("GPU.WinSAT.OverallScore", stats.overall * 10, 200); In addition to my ...

8 years, 4 months ago (2012-08-04 02:04:46 UTC) #6

https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_c...
File content/gpu/gpu_info_collector_win.cc (right):

https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_c...
content/gpu/gpu_info_collector_win.cc:139:
UMA_HISTOGRAM_ENUMERATION("GPU.WinSAT.OverallScore", stats.overall * 10, 200);
In addition to my comment below....

I was surprised first of all to see that you wanted 200 buckets.  That is
generally larger than most folks would find useful.... but I don't fully
understand your data, and your expected distribution.

Looking at the old code, it seemed to suggest that teh interesting values of
"stats.overall" was going to be values between 0 and 50.  In addition, since you
said you had a total of 50 buckets... you got pretty close to one bucket per
unit interval (subtle reason why it was not exact... and you probably had a top
end interval like 48-50 as one bucket).

The new range suggests that you want to see values where stats.overall * 10 is
in the range of 0-200.  Hence you're now only interested in stats.overall in the
range of 0 to 20 (rather than your previous interest up to 50).  ...and in
addition, you wanted that smaller range broken up into a super fine grain list
of 200 separate buckets.

I took a look at the collected data, and can see that it is it is indeed super
rare to even go beyond 7.

As a result, it probably makes sense that you are now looking only at the range
of 0-20 (scaled by a factor of 10).

...however... the "cool" way to do this is not to have a giant linear histogram
(re: using HISTOGRAM_ENUMERATION), but instead let the magic of the exponential
histogram help you.

I'd suggest using:

UMA_HISTOGRAM_CUSTOM_COUNTS("name", overall * 10, 1, 200, 50);

That will give you a LOT of resolution down near the lower end of the range
1-200, but not waste buckets looking at narrow details in the upper range.

YMMV... but 200 buckets is a lot of buckets... and probably not needed (and the
suggestion above will only gather 50 buckets).

https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_c...
content/gpu/gpu_info_collector_win.cc:142:
UMA_HISTOGRAM_ENUMERATION("GPU.WinSAT.GamingScore", stats.gaming * 10, 200);
Best practice, when changing the parameters by which you defined a histogram, is
to change the name to a new name.  This way, if you look at "All" results from
all versions, you won't get a confusing conflation of the two histograms.

Common practice is just to add a trailing "2" or such to the "old" name to
define a new name.

You should indeed land the old names in our XML file, so that we can see old
data.  I've temporarily landed GPU.WiinSat_OverallScore so you can now glance at
what data we've already collected.

dtu

On 2012/08/04 02:04:46, jar wrote: > https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_collector_win.cc > File content/gpu/gpu_info_collector_win.cc (right): > > https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_collector_win.cc#newcode139 > ...

8 years, 4 months ago (2012-08-04 03:17:34 UTC) #7

On 2012/08/04 02:04:46, jar wrote:
>
https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_c...
> File content/gpu/gpu_info_collector_win.cc (right):
> 
>
https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_c...
> content/gpu/gpu_info_collector_win.cc:139:
> UMA_HISTOGRAM_ENUMERATION("GPU.WinSAT.OverallScore", stats.overall * 10, 200);
> In addition to my comment below....
> 
> I was surprised first of all to see that you wanted 200 buckets.  That is
> generally larger than most folks would find useful.... but I don't fully
> understand your data, and your expected distribution.
> 
> Looking at the old code, it seemed to suggest that teh interesting values of
> "stats.overall" was going to be values between 0 and 50.  In addition, since
you
> said you had a total of 50 buckets... you got pretty close to one bucket per
> unit interval (subtle reason why it was not exact... and you probably had a
top
> end interval like 48-50 as one bucket).
> 
> The new range suggests that you want to see values where stats.overall * 10 is
> in the range of 0-200.  Hence you're now only interested in stats.overall in
the
> range of 0 to 20 (rather than your previous interest up to 50).  ...and in
> addition, you wanted that smaller range broken up into a super fine grain list
> of 200 separate buckets.
> 
> I took a look at the collected data, and can see that it is it is indeed super
> rare to even go beyond 7.
> 
> As a result, it probably makes sense that you are now looking only at the
range
> of 0-20 (scaled by a factor of 10).
> 
> ...however... the "cool" way to do this is not to have a giant linear
histogram
> (re: using HISTOGRAM_ENUMERATION), but instead let the magic of the
exponential
> histogram help you.
> 
> I'd suggest using:
> 
> UMA_HISTOGRAM_CUSTOM_COUNTS("name", overall * 10, 1, 200, 50);
> 
> That will give you a LOT of resolution down near the lower end of the range
> 1-200, but not waste buckets looking at narrow details in the upper range.
> 
> YMMV... but 200 buckets is a lot of buckets... and probably not needed (and
the
> suggestion above will only gather 50 buckets).
> 
>
https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_c...
> content/gpu/gpu_info_collector_win.cc:142:
> UMA_HISTOGRAM_ENUMERATION("GPU.WinSAT.GamingScore", stats.gaming * 10, 200);
> Best practice, when changing the parameters by which you defined a histogram,
is
> to change the name to a new name.  This way, if you look at "All" results from
> all versions, you won't get a confusing conflation of the two histograms.
> 
> Common practice is just to add a trailing "2" or such to the "old" name to
> define a new name.
> 
> You should indeed land the old names in our XML file, so that we can see old
> data.  I've temporarily landed GPU.WiinSat_OverallScore so you can now glance
at
> what data we've already collected.

Thanks for adding that temporary histogram to the dashboard! I actually didn't
know what the data would look like, so that's very helpful. Is there a way to
visualize the data without landing a patch?

About the actual distribution of the data: it's a number representing the
performance of the machine, gathered when Chrome is launched. Therefore, it
should slowly go up over time, with no theoretical limit, so some futureproofing
is needed.

There are, however, artificial limits in Windows, that also go up, with each
Windows release. These give a sense of how we expect the numbers to go up over
time. They are 5.9 on Windows VIsta, 7.9 on Windows 7, and 9.9 on Windows 8. I'm
guessing the ones over 9 are people who manually changed their scores, or
employees at Microsoft testing the next Windows release.

The numbers are expressed with one decimal place, so my first thought was to
just make a bucket for each possible value; but you're right; that leads to too
many buckets. Your suggestion is a good one. But it's also possible to get a
sample of 0, if we failed to get the numbers. Should we just ignore those? Note
there is another histogram here that checks for it, GPU.WinSAT.HasResults.

jar (doing other things)

Sample of zero is not a problem. We always have an "underflow" bucket that lists ...

8 years, 4 months ago (2012-08-06 18:37:08 UTC) #8

Sample of zero is not a problem.  We always have an "underflow" bucket that
lists as [0-k), where k is the declared minimum.  In truth, it actually picks up
any negative samples as well (in case your metric is really horked... so that we
don't discard "data."

Jim

On 2012/08/04 03:17:34, Dave Tu wrote:
> On 2012/08/04 02:04:46, jar wrote:
> >
>
https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_c...
> > File content/gpu/gpu_info_collector_win.cc (right):
> > 
> >
>
https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_c...
> > content/gpu/gpu_info_collector_win.cc:139:
> > UMA_HISTOGRAM_ENUMERATION("GPU.WinSAT.OverallScore", stats.overall * 10,
200);
> > In addition to my comment below....
> > 
> > I was surprised first of all to see that you wanted 200 buckets.  That is
> > generally larger than most folks would find useful.... but I don't fully
> > understand your data, and your expected distribution.
> > 
> > Looking at the old code, it seemed to suggest that teh interesting values of
> > "stats.overall" was going to be values between 0 and 50.  In addition, since
> you
> > said you had a total of 50 buckets... you got pretty close to one bucket per
> > unit interval (subtle reason why it was not exact... and you probably had a
> top
> > end interval like 48-50 as one bucket).
> > 
> > The new range suggests that you want to see values where stats.overall * 10
is
> > in the range of 0-200.  Hence you're now only interested in stats.overall in
> the
> > range of 0 to 20 (rather than your previous interest up to 50).  ...and in
> > addition, you wanted that smaller range broken up into a super fine grain
list
> > of 200 separate buckets.
> > 
> > I took a look at the collected data, and can see that it is it is indeed
super
> > rare to even go beyond 7.
> > 
> > As a result, it probably makes sense that you are now looking only at the
> range
> > of 0-20 (scaled by a factor of 10).
> > 
> > ...however... the "cool" way to do this is not to have a giant linear
> histogram
> > (re: using HISTOGRAM_ENUMERATION), but instead let the magic of the
> exponential
> > histogram help you.
> > 
> > I'd suggest using:
> > 
> > UMA_HISTOGRAM_CUSTOM_COUNTS("name", overall * 10, 1, 200, 50);
> > 
> > That will give you a LOT of resolution down near the lower end of the range
> > 1-200, but not waste buckets looking at narrow details in the upper range.
> > 
> > YMMV... but 200 buckets is a lot of buckets... and probably not needed (and
> the
> > suggestion above will only gather 50 buckets).
> > 
> >
>
https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_c...
> > content/gpu/gpu_info_collector_win.cc:142:
> > UMA_HISTOGRAM_ENUMERATION("GPU.WinSAT.GamingScore", stats.gaming * 10, 200);
> > Best practice, when changing the parameters by which you defined a
histogram,
> is
> > to change the name to a new name.  This way, if you look at "All" results
from
> > all versions, you won't get a confusing conflation of the two histograms.
> > 
> > Common practice is just to add a trailing "2" or such to the "old" name to
> > define a new name.
> > 
> > You should indeed land the old names in our XML file, so that we can see old
> > data.  I've temporarily landed GPU.WiinSat_OverallScore so you can now
glance
> at
> > what data we've already collected.
> 
> Thanks for adding that temporary histogram to the dashboard! I actually didn't
> know what the data would look like, so that's very helpful. Is there a way to
> visualize the data without landing a patch?
> 
> About the actual distribution of the data: it's a number representing the
> performance of the machine, gathered when Chrome is launched. Therefore, it
> should slowly go up over time, with no theoretical limit, so some
futureproofing
> is needed.
> 
> There are, however, artificial limits in Windows, that also go up, with each
> Windows release. These give a sense of how we expect the numbers to go up over
> time. They are 5.9 on Windows VIsta, 7.9 on Windows 7, and 9.9 on Windows 8.
I'm
> guessing the ones over 9 are people who manually changed their scores, or
> employees at Microsoft testing the next Windows release.
> 
> The numbers are expressed with one decimal place, so my first thought was to
> just make a bucket for each possible value; but you're right; that leads to
too
> many buckets. Your suggestion is a good one. But it's also possible to get a
> sample of 0, if we failed to get the numbers. Should we just ignore those?
Note
> there is another histogram here that checks for it, GPU.WinSAT.HasResults.

dtu

Cool! SGTM https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_collector_win.cc File content/gpu/gpu_info_collector_win.cc (right): https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_collector_win.cc#newcode139 content/gpu/gpu_info_collector_win.cc:139: UMA_HISTOGRAM_ENUMERATION("GPU.WinSAT.OverallScore", stats.overall * 10, 200); On 2012/08/04 ...

8 years, 4 months ago (2012-08-06 21:41:47 UTC) #9

Cool! SGTM

https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_c...
File content/gpu/gpu_info_collector_win.cc (right):

https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_c...
content/gpu/gpu_info_collector_win.cc:139:
UMA_HISTOGRAM_ENUMERATION("GPU.WinSAT.OverallScore", stats.overall * 10, 200);
On 2012/08/04 02:04:46, jar wrote:
> In addition to my comment below....
> 
> I was surprised first of all to see that you wanted 200 buckets.  That is
> generally larger than most folks would find useful.... but I don't fully
> understand your data, and your expected distribution.
> 
> Looking at the old code, it seemed to suggest that teh interesting values of
> "stats.overall" was going to be values between 0 and 50.  In addition, since
you
> said you had a total of 50 buckets... you got pretty close to one bucket per
> unit interval (subtle reason why it was not exact... and you probably had a
top
> end interval like 48-50 as one bucket).
> 
> The new range suggests that you want to see values where stats.overall * 10 is
> in the range of 0-200.  Hence you're now only interested in stats.overall in
the
> range of 0 to 20 (rather than your previous interest up to 50).  ...and in
> addition, you wanted that smaller range broken up into a super fine grain list
> of 200 separate buckets.
> 
> I took a look at the collected data, and can see that it is it is indeed super
> rare to even go beyond 7.
> 
> As a result, it probably makes sense that you are now looking only at the
range
> of 0-20 (scaled by a factor of 10).
> 
> ...however... the "cool" way to do this is not to have a giant linear
histogram
> (re: using HISTOGRAM_ENUMERATION), but instead let the magic of the
exponential
> histogram help you.
> 
> I'd suggest using:
> 
> UMA_HISTOGRAM_CUSTOM_COUNTS("name", overall * 10, 1, 200, 50);
> 
> That will give you a LOT of resolution down near the lower end of the range
> 1-200, but not waste buckets looking at narrow details in the upper range.
> 
> YMMV... but 200 buckets is a lot of buckets... and probably not needed (and
the
> suggestion above will only gather 50 buckets).
> 

Done.

https://chromiumcodereview.appspot.com/10828165/diff/1/content/gpu/gpu_info_c...
content/gpu/gpu_info_collector_win.cc:142:
UMA_HISTOGRAM_ENUMERATION("GPU.WinSAT.GamingScore", stats.gaming * 10, 200);
On 2012/08/04 02:04:46, jar wrote:
> Best practice, when changing the parameters by which you defined a histogram,
is
> to change the name to a new name.  This way, if you look at "All" results from
> all versions, you won't get a confusing conflation of the two histograms.
> 
> Common practice is just to add a trailing "2" or such to the "old" name to
> define a new name.
> 
> You should indeed land the old names in our XML file, so that we can see old
> data.  I've temporarily landed GPU.WiinSat_OverallScore so you can now glance
at
> what data we've already collected.

Done.

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/dtu@chromium.org/10828165/6002

8 years, 4 months ago (2012-08-06 22:55:54 UTC) #12

commit-bot: I haz the power

Try job failure for 10828165-6002 (retry) on linux_chromeos for step "compile" (clobber build). It's a ...

8 years, 4 months ago (2012-08-06 23:25:27 UTC) #13

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/dtu@chromium.org/10828165/6002

8 years, 4 months ago (2012-08-07 01:18:01 UTC) #14

Change committed as 150260

Expand Messages | Collapse Messages