Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(84)

Issue 1042053003: Add calculation of derived features for distillable page model (Closed)

Created:
5 years, 9 months ago by cjhopman
Modified:
5 years, 8 months ago
Reviewers:
nyquist, jam
CC:
chromium-reviews
Base URL:
https://chromium.googlesource.com/chromium/src.git@master
Target Ref:
refs/pending/heads/master
Project:
chromium
Visibility:
Public.

Description

Add calculation of derived features for distillable page model We've trained a simple model for detecting non-mobile distillable pages based on a set of features derived from a webpage. This change adds the main part of deriving those features (a bunch of values are calculated based on some core features of the web page). It does not add the extraction of those core features from a page. A test is added that uses json-formatted core features and then json-formatted output of the training pipelines derived feature calculation. It then ensures that the calculation here matches that in the training pipeline. BUG=471439 Committed: https://crrev.com/54caa7821b5434ed5c410fe72161e5aa0302366f Cr-Commit-Position: refs/heads/master@{#323554}

Patch Set 1 #

Patch Set 2 : GN #

Patch Set 3 : Calculate paths correctly #

Total comments: 6

Patch Set 4 : #

Patch Set 5 : Add OWNERS in components/test/data/dom_distiller #

Patch Set 6 : initialize vars #

Patch Set 7 : rebase #

Unified diffs Side-by-side diffs Delta from patch set Stats (+498 lines, -1 line) Patch
M components/components_tests.gyp View 1 2 3 4 5 6 1 chunk +1 line, -0 lines 0 comments Download
M components/dom_distiller.gypi View 1 2 3 4 5 6 2 chunks +3 lines, -0 lines 0 comments Download
M components/dom_distiller/DEPS View 1 chunk +1 line, -0 lines 0 comments Download
M components/dom_distiller/core/BUILD.gn View 1 3 chunks +4 lines, -0 lines 0 comments Download
A components/dom_distiller/core/page_features.h View 1 chunk +38 lines, -0 lines 0 comments Download
A components/dom_distiller/core/page_features.cc View 1 2 3 4 5 1 chunk +165 lines, -0 lines 0 comments Download
A components/dom_distiller/core/page_features_unittest.cc View 1 2 3 1 chunk +91 lines, -0 lines 0 comments Download
A + components/test/data/dom_distiller/OWNERS View 1 2 3 4 1 chunk +0 lines, -1 line 0 comments Download
A components/test/data/dom_distiller/core_features.json View 1 chunk +1 line, -0 lines 0 comments Download
A components/test/data/dom_distiller/derived_features.json View 1 chunk +194 lines, -0 lines 0 comments Download

Messages

Total messages: 26 (12 generated)
cjhopman
nyquist: *
5 years, 9 months ago (2015-03-28 02:19:07 UTC) #2
nyquist
lgtm. I could only find one discrepancy between the C++ and the Python code. https://codereview.chromium.org/1042053003/diff/40001/components/dom_distiller/core/page_features.cc ...
5 years, 8 months ago (2015-04-01 01:19:14 UTC) #3
cjhopman
https://codereview.chromium.org/1042053003/diff/40001/components/dom_distiller/core/page_features.cc File components/dom_distiller/core/page_features.cc (right): https://codereview.chromium.org/1042053003/diff/40001/components/dom_distiller/core/page_features.cc#newcode76 components/dom_distiller/core/page_features.cc:76: features.push_back(Contains(".phpbb", path)); On 2015/04/01 01:19:14, nyquist wrote: > The ...
5 years, 8 months ago (2015-04-01 20:12:14 UTC) #4
commit-bot: I haz the power
Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/1042053003/80001
5 years, 8 months ago (2015-04-01 20:12:54 UTC) #7
cjhopman
jam@: for OWNERS of third_party/re2 (added to components/dom_distiller/DEPS)
5 years, 8 months ago (2015-04-01 20:13:05 UTC) #9
commit-bot: I haz the power
Try jobs failed on following builders: chromium_presubmit on tryserver.chromium.linux (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.linux/builders/chromium_presubmit/builds/53399)
5 years, 8 months ago (2015-04-01 20:22:58 UTC) #11
commit-bot: I haz the power
Dry run: CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/1042053003/100001
5 years, 8 months ago (2015-04-01 22:49:06 UTC) #14
commit-bot: I haz the power
Try jobs failed on following builders: chromium_presubmit on tryserver.chromium.linux (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.linux/builders/chromium_presubmit/builds/53459)
5 years, 8 months ago (2015-04-01 23:01:08 UTC) #16
jam
On 2015/04/01 20:13:05, cjhopman wrote: > jam@: for OWNERS of third_party/re2 (added to components/dom_distiller/DEPS) rubberstamp ...
5 years, 8 months ago (2015-04-02 16:29:09 UTC) #17
commit-bot: I haz the power
CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/1042053003/100001
5 years, 8 months ago (2015-04-02 16:30:56 UTC) #19
commit-bot: I haz the power
Try jobs failed on following builders: chromium_presubmit on tryserver.chromium.linux (JOB_FAILED, http://build.chromium.org/p/tryserver.chromium.linux/builders/chromium_presubmit/builds/53712)
5 years, 8 months ago (2015-04-02 16:42:36 UTC) #21
commit-bot: I haz the power
CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/1042053003/120001
5 years, 8 months ago (2015-04-02 16:59:43 UTC) #24
commit-bot: I haz the power
Committed patchset #7 (id:120001)
5 years, 8 months ago (2015-04-02 20:27:09 UTC) #25
commit-bot: I haz the power
5 years, 8 months ago (2015-04-03 20:28:42 UTC) #26
Message was sent while issue was closed.
Patchset 7 (id:??) landed as
https://crrev.com/54caa7821b5434ed5c410fe72161e5aa0302366f
Cr-Commit-Position: refs/heads/master@{#323554}

Powered by Google App Engine
This is Rietveld 408576698