Issue 10165018: Apply a policy to the renderer.

Chris Evans

This is working reasonably locally. I'm soliciting suggestions on whether we want to land it ...

8 years, 8 months ago (2012-04-20 22:20:27 UTC) #1

Jorge Lucangeli Obes

I told cevans that if it were up to me I'd let GPU and Flash ...

8 years, 8 months ago (2012-04-20 22:27:45 UTC) #2

Jorge Lucangeli Obes

On 2012/04/20 22:27:45, Jorge Lucangeli Obes wrote: > I told cevans that if it were ...

8 years, 8 months ago (2012-04-20 22:42:54 UTC) #3

jln (very slow on Chromium)

Not specific to this new patch, but I don't like the "undefined behavior" of the ...

8 years, 8 months ago (2012-04-21 02:19:19 UTC) #4

cevans

On Fri, Apr 20, 2012 at 7:19 PM, <jln@chromium.org> wrote: > Not specific to this ...

8 years, 8 months ago (2012-04-21 05:36:57 UTC) #5

jln (very slow on Chromium)

On 2012/04/21 05:36:57, cevans wrote: > On Fri, Apr 20, 2012 at 7:19 PM, <mailto:jln@chromium.org> ...

8 years, 8 months ago (2012-04-23 18:27:11 UTC) #6

Will Drewry

On 2012/04/23 18:27:11, Julien Tinnes wrote: > On 2012/04/21 05:36:57, cevans wrote: > > On ...

8 years, 8 months ago (2012-04-23 18:35:53 UTC) #7

cevans

On Mon, Apr 23, 2012 at 11:27 AM, <jln@chromium.org> wrote: > On 2012/04/21 05:36:57, cevans ...

8 years, 8 months ago (2012-04-23 20:14:04 UTC) #8

cevans

Anyway -- today is the deadline if we want to land this for the next ...

8 years, 8 months ago (2012-04-23 20:15:31 UTC) #9

Anyway -- today is the deadline if we want to land this for the next dev
channel (the final pre-Precise dev channel ;-)

To give it a shot or not, that is the question.

On Mon, Apr 23, 2012 at 1:14 PM, Chris Evans <cevans@google.com> wrote:

> On Mon, Apr 23, 2012 at 11:27 AM, <jln@chromium.org> wrote:
>
>> On 2012/04/21 05:36:57, cevans wrote:
>>
>>  On Fri, Apr 20, 2012 at 7:19 PM, <mailto:jln@chromium.org> wrote:
>>>
>>
>>  > Not specific to this new patch, but I don't like the "undefined
>>> behavior"
>>> > of the
>>> > SIG_SYS handler. Any chance we could limit this to Debug ?
>>> >
>>>
>>
>>  Could you be a little more specific?
>>>
>>
>> I realize that the current SIGSYS_Handler is handy, but I would prefer the
>> current behavior to be limited to DEBUG builds.
>>
>> The current behavior is technically undefined, we write '0' to some
>> computed
>> address, there is no saying what this will do. This could very well tell
>> another
>> thread to immediately start Rick-rolling the user (even before the last
>> part of
>> the function might signal the process).
>>
>> Even the last part of the function is technically undefined. There is no
>> saying
>> what is at address zero (mmap_min_addr is not mandatory).
>> Something a little better for now, perhaps, could be to use a non
>> canonical 64
>> bits address to lead to a certain GPF from the processor. But even that
>> is not
>> ideal because the behavior could change in newer processors.
>>
>> Or, instead of limiting to DEBUG, I wonder if it wouldn't be cleaner to
>> bite the
>> bullet and write some simple assembly here. Set SP to a value that leaks
>> useful
>> information and then cause a GPF.
>>
>
> Right, I think my longer term plan (once we've got near-zero SIGSYS events
> in the dev channel population) is to simply use asm to set rax, rbx, rcx
> etc. to values of interest. And then raise SIGABRT.
>
> I think the short-term value is worth the ugliness of the dereference. I
> know it's undefined but practically, I have faith that will do what I want.
>
>
>>
https://chromiumcodereview.**appspot.com/10165018/<https://chromiumcodereview...
>>
>
>

jln (very slow on Chromium)

On Mon, Apr 23, 2012 at 1:15 PM, Chris Evans <cevans@google.com> wrote: > Anyway -- ...

8 years, 8 months ago (2012-04-23 21:12:05 UTC) #10

cevans

On Mon, Apr 23, 2012 at 2:12 PM, Julien Tinnes <jln@chromium.org> wrote: > On Mon, ...

8 years, 8 months ago (2012-04-23 21:40:51 UTC) #11

On Mon, Apr 23, 2012 at 2:12 PM, Julien Tinnes <jln@chromium.org> wrote:

> On Mon, Apr 23, 2012 at 1:15 PM, Chris Evans <cevans@google.com> wrote:
> > Anyway -- today is the deadline if we want to land this for the next dev
> > channel (the final pre-Precise dev channel ;-)
> >
> > To give it a shot or not, that is the question.
>
> Is there a good way to get this into the Dev channel to get all the
> interesting and useful data, but not merge it to beta / stable ?

Not sure what you mean by "merge", so I'll reply generally:

If we land it now, it's very early in the M20 dev cycle. We'd be looking at
over 2 months before it hit stable. If we didn't have confidence in
stability by then, we could stick the feature behind a new flag
--enable-seccomp-filter-sandbox.

I
> think it would be ideal. We're "lucky" in a way that for now this
> patch will be limited to one distribution, so we'll get a fairly
> consistent set of users and shouldn't expect too many crazy failures.
>

FWIW, I believe our Linux userbase is _dominated_ by Ubuntu users. Isn't it
something like 90%?

We would be limiting ourselves to just the most recent version -- 12.04,
but isn't Ubuntu upgrade / update rate pretty high?

The other bucket is 32-bit vs. 64-bit. We'd only be hitting 64-bit users,
which I believe is about a third of Ubuntu users?

> But fore reasons I outlined elsewhere, I think we should fail more
> gracefully on violations (with errno) and let the program have a
> chance to handle this gracefully (probably log something, cleanup and
> exit()). But perhaps this can be an incremental change.
>

I don't think this is a good idea. If we start errno-failing weird syscalls
in corner-case code, I expect things might limp along but sometimes with
degraded or broken functionality. We'd then be relying on users to notice
and file the broken behaviour. On the contrary, a sad tab is a clear
problem signal and also a signal that we can look for in crash dumps.

jln (very slow on Chromium)

On Mon, Apr 23, 2012 at 2:40 PM, Chris Evans <cevans@google.com> wrote: > On Mon, ...

8 years, 8 months ago (2012-04-23 22:01:19 UTC) #12

On Mon, Apr 23, 2012 at 2:40 PM, Chris Evans <cevans@google.com> wrote:
> On Mon, Apr 23, 2012 at 2:12 PM, Julien Tinnes <jln@chromium.org> wrote:
>>
>> On Mon, Apr 23, 2012 at 1:15 PM, Chris Evans <cevans@google.com> wrote:
>> > Anyway -- today is the deadline if we want to land this for the next dev
>> > channel (the final pre-Precise dev channel ;-)
>> >
>> > To give it a shot or not, that is the question.
>>
>> Is there a good way to get this into the Dev channel to get all the
>> interesting and useful data, but not merge it to beta / stable ?
>
>
> Not sure what you mean by "merge", so I'll reply generally:
>
> If we land it now, it's very early in the M20 dev cycle. We'd be looking at
> over 2 months before it hit stable. If we didn't have confidence in
> stability by then, we could stick the feature behind a new flag
> --enable-seccomp-filter-sandbox.
>
>> I
>> think it would be ideal. We're "lucky" in a way that for now this
>> patch will be limited to one distribution, so we'll get a fairly
>> consistent set of users and shouldn't expect too many crazy failures.
>
>
> FWIW, I believe our Linux userbase is _dominated_ by Ubuntu users. Isn't it
> something like 90%?
>
> We would be limiting ourselves to just the most recent version -- 12.04, but
> isn't Ubuntu upgrade / update rate pretty high?
>
> The other bucket is 32-bit vs. 64-bit. We'd only be hitting 64-bit users,
> which I believe is about a third of Ubuntu users?
>
>>
>> But fore reasons I outlined elsewhere, I think we should fail more
>> gracefully on violations (with errno) and let the program have a
>> chance to handle this gracefully (probably log something, cleanup and
>> exit()). But perhaps this can be an incremental change.
>
>
> I don't think this is a good idea. If we start errno-failing weird syscalls
> in corner-case code, I expect things might limp along but sometimes with
> degraded or broken functionality. We'd then be relying on users to notice
> and file the broken behaviour. On the contrary, a sad tab is a clear problem
> signal and also a signal that we can look for in crash dumps.

Errno is the end result, we can always log in a SIGSYS handler and
then errno in it. Maybe we can think of a way to let users notice and
report those logs to us.

In DEBUG mode we always should log. But for release Chrome on generic
Linux, it's dangerous to just make a process disappear. We should let
well written code downgrade gracefully on denied syscalls. Otherwise
things will start crashing randomly after various libraries upgrades.
There are even people out there who use Debian unstable. We'll have
breakage from glibc upgrades and whatnot.

Think about what we're doing here. We're breaking / changing a well
known API (kernel interface) and instead of returning an error ("nope,
sorry you can't do that"), we're just killing stuff in the hope that
we'll play catch-up and fix everything in time.

We should ship a policy that we can expect to work everywhere,
including when codepaths that weren't exercised before are exercised.
It's an illusion to believe that our testing will be comprehensive
enough that we'll catch every possible sandbox violation fast enough
to write a fix. Graceful downgrade is a necessity if we don't want to
alienate our users.

Julien

cevans

On Mon, Apr 23, 2012 at 3:01 PM, Julien Tinnes <jln@chromium.org> wrote: > On Mon, ...

8 years, 8 months ago (2012-04-23 22:22:52 UTC) #13

On Mon, Apr 23, 2012 at 3:01 PM, Julien Tinnes <jln@chromium.org> wrote:

> On Mon, Apr 23, 2012 at 2:40 PM, Chris Evans <cevans@google.com> wrote:
> > On Mon, Apr 23, 2012 at 2:12 PM, Julien Tinnes <jln@chromium.org> wrote:
> >>
> >> On Mon, Apr 23, 2012 at 1:15 PM, Chris Evans <cevans@google.com> wrote:
> >> > Anyway -- today is the deadline if we want to land this for the next
> dev
> >> > channel (the final pre-Precise dev channel ;-)
> >> >
> >> > To give it a shot or not, that is the question.
> >>
> >> Is there a good way to get this into the Dev channel to get all the
> >> interesting and useful data, but not merge it to beta / stable ?
> >
> >
> > Not sure what you mean by "merge", so I'll reply generally:
> >
> > If we land it now, it's very early in the M20 dev cycle. We'd be looking
> at
> > over 2 months before it hit stable. If we didn't have confidence in
> > stability by then, we could stick the feature behind a new flag
> > --enable-seccomp-filter-sandbox.
> >
> >> I
> >> think it would be ideal. We're "lucky" in a way that for now this
> >> patch will be limited to one distribution, so we'll get a fairly
> >> consistent set of users and shouldn't expect too many crazy failures.
> >
> >
> > FWIW, I believe our Linux userbase is _dominated_ by Ubuntu users. Isn't
> it
> > something like 90%?
> >
> > We would be limiting ourselves to just the most recent version -- 12.04,
> but
> > isn't Ubuntu upgrade / update rate pretty high?
> >
> > The other bucket is 32-bit vs. 64-bit. We'd only be hitting 64-bit users,
> > which I believe is about a third of Ubuntu users?
> >
> >>
> >> But fore reasons I outlined elsewhere, I think we should fail more
> >> gracefully on violations (with errno) and let the program have a
> >> chance to handle this gracefully (probably log something, cleanup and
> >> exit()). But perhaps this can be an incremental change.
> >
> >
> > I don't think this is a good idea. If we start errno-failing weird
> syscalls
> > in corner-case code, I expect things might limp along but sometimes with
> > degraded or broken functionality. We'd then be relying on users to notice
> > and file the broken behaviour. On the contrary, a sad tab is a clear
> problem
> > signal and also a signal that we can look for in crash dumps.
>
> Errno is the end result, we can always log in a SIGSYS handler and
> then errno in it.


It's easy to make off-hand suggestions starting "we can always" but the
complexities are awful:

1) The thought of trying to "log" safely in the async sig handler context
gives me nightmares.
2) We need the stack trace for a useful report. I don't know of a useful
framework to collect / report that outside of the crash handling.

Maybe we can think of a way to let users notice and
> report those logs to us.
>

What did you have in mind? Users will notice a sad tab.


> In DEBUG mode we always should log. But for release Chrome on generic
> Linux, it's dangerous to just make a process disappear.


I believe quite the opposite: it's dangerous to permit the process to
continue. We'd be continuing it in a highly unusual state, including a
failure / errno that the surrounding code might never have encountered
before.

We should let
> well written code downgrade gracefully on denied syscalls.


In an ideal world yes. We live far from an ideal world, including for
example no control over glibc and other system libraries.

Otherwise
> things will start crashing randomly after various libraries upgrades.
> There are even people out there who use Debian unstable. We'll have
> breakage from glibc upgrades and whatnot.
>
> Think about what we're doing here. We're breaking / changing a well
> known API (kernel interface) and instead of returning an error ("nope,
> sorry you can't do that"), we're just killing stuff in the hope that
> we'll play catch-up and fix everything in time.
>
> We should ship a policy that we can expect to work everywhere,
> including when codepaths that weren't exercised before are exercised.
> It's an illusion to believe that our testing will be comprehensive
> enough that we'll catch every possible sandbox violation fast enough
> to write a fix. Graceful downgrade is a necessity if we don't want to
> alienate our users.
>

You make a compelling argument. Perhaps a debug vs. opt build difference
would be worth trying for M21. I believe we want at least one stable
release with the SIGSYS -> SIGSEGV behaviour, to have confidence in broad
testing of corner-case functionality.


> Julien
>

jln (very slow on Chromium)

On Mon, Apr 23, 2012 at 3:22 PM, Chris Evans <cevans@google.com> wrote: >> We should ...

8 years, 8 months ago (2012-04-23 23:11:37 UTC) #14

On Mon, Apr 23, 2012 at 3:22 PM, Chris Evans <cevans@google.com> wrote:

>> We should let
>> well written code downgrade gracefully on denied syscalls.

> In an ideal world yes. We live far from an ideal world, including for
> example no control over glibc and other system libraries.

Indeed. Which is why I think expecting them to use a specific subset
of system calls that will never change is dangerous.

I think we both agree things will break on libraries changes and
updates, don't we?

The discussion in on how to best react to those cases: errno or crash.
Crash makes it easier to get some of the information we want and
notify the user. Errno makes it more likely that things will continue
to work and that we won't annoy users, and with some non trivial
engineering effort could still let us to notify and log.

>> Otherwise
>> things will start crashing randomly after various libraries upgrades.
>> There are even people out there who use Debian unstable. We'll have
>> breakage from glibc upgrades and whatnot.
>>
>> Think about what we're doing here. We're breaking / changing a well
>> known API (kernel interface) and instead of returning an error ("nope,
>> sorry you can't do that"), we're just killing stuff in the hope that
>> we'll play catch-up and fix everything in time.
>>
>> We should ship a policy that we can expect to work everywhere,
>> including when codepaths that weren't exercised before are exercised.
>> It's an illusion to believe that our testing will be comprehensive
>> enough that we'll catch every possible sandbox violation fast enough
>> to write a fix. Graceful downgrade is a necessity if we don't want to
>> alienate our users.
>
>
> You make a compelling argument. Perhaps a debug vs. opt build difference
> would be worth trying for M21. I believe we want at least one stable release
> with the SIGSYS -> SIGSEGV behaviour, to have confidence in broad testing of
> corner-case functionality.

Given that this will only have an effect on Ubuntu 12.04 users for now
(i.e. one well defined Linux distribution that we can test on) and
that we desperately need more data, I agree that lading this today is
the right choice.

But we should work on have a better story in the upcoming months,
either by white listing only a specific Linux distribution for this
feature to be enabled by default, or by allowing better recovery on
(unexpected) denied system calls.

Julien

cevans

On Mon, Apr 23, 2012 at 4:11 PM, Julien Tinnes <jln@chromium.org> wrote: > On Mon, ...

8 years, 8 months ago (2012-04-23 23:16:13 UTC) #15

On Mon, Apr 23, 2012 at 4:11 PM, Julien Tinnes <jln@chromium.org> wrote:

> On Mon, Apr 23, 2012 at 3:22 PM, Chris Evans <cevans@google.com> wrote:
>
> >> We should let
> >> well written code downgrade gracefully on denied syscalls.
>
> > In an ideal world yes. We live far from an ideal world, including for
> > example no control over glibc and other system libraries.
>
> Indeed. Which is why I think expecting them to use a specific subset
> of system calls that will never change is dangerous.
>
> I think we both agree things will break on libraries changes and
> updates, don't we?
>

Yep.


> The discussion in on how to best react to those cases: errno or crash.
> Crash makes it easier to get some of the information we want and
> notify the user. Errno makes it more likely that things will continue
> to work and that we won't annoy users, and with some non trivial
> engineering effort could still let us to notify and log.
>

I am not a fan of non-trivial engineering effort :) Luckily, I think you've
suggested a couple of different trivial suggestions now.


> >> Otherwise
> >> things will start crashing randomly after various libraries upgrades.
> >> There are even people out there who use Debian unstable. We'll have
> >> breakage from glibc upgrades and whatnot.
> >>
> >> Think about what we're doing here. We're breaking / changing a well
> >> known API (kernel interface) and instead of returning an error ("nope,
> >> sorry you can't do that"), we're just killing stuff in the hope that
> >> we'll play catch-up and fix everything in time.
> >>
> >> We should ship a policy that we can expect to work everywhere,
> >> including when codepaths that weren't exercised before are exercised.
> >> It's an illusion to believe that our testing will be comprehensive
> >> enough that we'll catch every possible sandbox violation fast enough
> >> to write a fix. Graceful downgrade is a necessity if we don't want to
> >> alienate our users.
> >
> >
> > You make a compelling argument. Perhaps a debug vs. opt build difference
> > would be worth trying for M21. I believe we want at least one stable
> release
> > with the SIGSYS -> SIGSEGV behaviour, to have confidence in broad
> testing of
> > corner-case functionality.
>
> Given that this will only have an effect on Ubuntu 12.04 users for now
> (i.e. one well defined Linux distribution that we can test on) and
> that we desperately need more data, I agree that lading this today is
> the right choice.
>

Ok. Will / Kees / Jorge?


> But we should work on have a better story in the upcoming months,
> either by white listing only a specific Linux distribution for this
> feature to be enabled by default,


That's an interesting idea. I wonder what distributions we claim to
support? Ubuntu / Fedora? Anything else? We'd probably be looking at
specific versions of specific distributions.

or by allowing better recovery on
> (unexpected) denied system calls.
>
> Julien
>

Kees Cook

On Mon, Apr 23, 2012 at 4:16 PM, Chris Evans <cevans@google.com> wrote: > On Mon, ...

8 years, 8 months ago (2012-04-23 23:21:16 UTC) #16

On Mon, Apr 23, 2012 at 4:16 PM, Chris Evans <cevans@google.com> wrote:
> On Mon, Apr 23, 2012 at 4:11 PM, Julien Tinnes <jln@chromium.org> wrote:
>>
>> On Mon, Apr 23, 2012 at 3:22 PM, Chris Evans <cevans@google.com> wrote:
>>
>> >> We should let
>> >> well written code downgrade gracefully on denied syscalls.
>>
>> > In an ideal world yes. We live far from an ideal world, including for
>> > example no control over glibc and other system libraries.
>>
>> Indeed. Which is why I think expecting them to use a specific subset
>> of system calls that will never change is dangerous.
>>
>> I think we both agree things will break on libraries changes and
>> updates, don't we?
>
>
> Yep.
>
>>
>> The discussion in on how to best react to those cases: errno or crash.
>> Crash makes it easier to get some of the information we want and
>> notify the user. Errno makes it more likely that things will continue
>> to work and that we won't annoy users, and with some non trivial
>> engineering effort could still let us to notify and log.
>
>
> I am not a fan of non-trivial engineering effort :) Luckily, I think you've
> suggested a couple of different trivial suggestions now.
>
>>
>> >> Otherwise
>> >> things will start crashing randomly after various libraries upgrades.
>> >> There are even people out there who use Debian unstable. We'll have
>> >> breakage from glibc upgrades and whatnot.
>> >>
>> >> Think about what we're doing here. We're breaking / changing a well
>> >> known API (kernel interface) and instead of returning an error ("nope,
>> >> sorry you can't do that"), we're just killing stuff in the hope that
>> >> we'll play catch-up and fix everything in time.
>> >>
>> >> We should ship a policy that we can expect to work everywhere,
>> >> including when codepaths that weren't exercised before are exercised.
>> >> It's an illusion to believe that our testing will be comprehensive
>> >> enough that we'll catch every possible sandbox violation fast enough
>> >> to write a fix. Graceful downgrade is a necessity if we don't want to
>> >> alienate our users.
>> >
>> >
>> > You make a compelling argument. Perhaps a debug vs. opt build difference
>> > would be worth trying for M21. I believe we want at least one stable
>> > release
>> > with the SIGSYS -> SIGSEGV behaviour, to have confidence in broad
>> > testing of
>> > corner-case functionality.
>>
>> Given that this will only have an effect on Ubuntu 12.04 users for now
>> (i.e. one well defined Linux distribution that we can test on) and
>> that we desperately need more data, I agree that lading this today is
>> the right choice.
>
> Ok. Will / Kees / Jorge?

I would agree; let's get it landed. Ubuntu 12.04's libraries aren't
going to change much, so I think the risk of not using the errno
method is minimal.

That said, I still prefer the kill method on platforms we directly
control (Chrome OS).

>> But we should work on have a better story in the upcoming months,
>> either by white listing only a specific Linux distribution for this
>> feature to be enabled by default,
>
> That's an interesting idea. I wonder what distributions we claim to support?
> Ubuntu / Fedora? Anything else? We'd probably be looking at specific
> versions of specific distributions.

In theory, any distro with a 3.5 kernel. I'm only aware of Ubuntu
carrying the backport of seccomp_bpf.

-Kees

-- 
Kees Cook
Chrome OS Security

jln (very slow on Chromium)

On Mon, Apr 23, 2012 at 4:16 PM, Chris Evans <cevans@google.com> wrote: > That's an ...

8 years, 8 months ago (2012-04-23 23:22:32 UTC) #17

cevans

On Mon, Apr 23, 2012 at 4:21 PM, Kees Cook <keescook@chromium.org> wrote: > On Mon, ...

8 years, 8 months ago (2012-04-23 23:26:28 UTC) #18

On Mon, Apr 23, 2012 at 4:21 PM, Kees Cook <keescook@chromium.org> wrote:

> On Mon, Apr 23, 2012 at 4:16 PM, Chris Evans <cevans@google.com> wrote:
> > On Mon, Apr 23, 2012 at 4:11 PM, Julien Tinnes <jln@chromium.org> wrote:
> >>
> >> On Mon, Apr 23, 2012 at 3:22 PM, Chris Evans <cevans@google.com> wrote:
> >>
> >> >> We should let
> >> >> well written code downgrade gracefully on denied syscalls.
> >>
> >> > In an ideal world yes. We live far from an ideal world, including for
> >> > example no control over glibc and other system libraries.
> >>
> >> Indeed. Which is why I think expecting them to use a specific subset
> >> of system calls that will never change is dangerous.
> >>
> >> I think we both agree things will break on libraries changes and
> >> updates, don't we?
> >
> >
> > Yep.
> >
> >>
> >> The discussion in on how to best react to those cases: errno or crash.
> >> Crash makes it easier to get some of the information we want and
> >> notify the user. Errno makes it more likely that things will continue
> >> to work and that we won't annoy users, and with some non trivial
> >> engineering effort could still let us to notify and log.
> >
> >
> > I am not a fan of non-trivial engineering effort :) Luckily, I think
> you've
> > suggested a couple of different trivial suggestions now.
> >
> >>
> >> >> Otherwise
> >> >> things will start crashing randomly after various libraries upgrades.
> >> >> There are even people out there who use Debian unstable. We'll have
> >> >> breakage from glibc upgrades and whatnot.
> >> >>
> >> >> Think about what we're doing here. We're breaking / changing a well
> >> >> known API (kernel interface) and instead of returning an error
> ("nope,
> >> >> sorry you can't do that"), we're just killing stuff in the hope that
> >> >> we'll play catch-up and fix everything in time.
> >> >>
> >> >> We should ship a policy that we can expect to work everywhere,
> >> >> including when codepaths that weren't exercised before are exercised.
> >> >> It's an illusion to believe that our testing will be comprehensive
> >> >> enough that we'll catch every possible sandbox violation fast enough
> >> >> to write a fix. Graceful downgrade is a necessity if we don't want to
> >> >> alienate our users.
> >> >
> >> >
> >> > You make a compelling argument. Perhaps a debug vs. opt build
> difference
> >> > would be worth trying for M21. I believe we want at least one stable
> >> > release
> >> > with the SIGSYS -> SIGSEGV behaviour, to have confidence in broad
> >> > testing of
> >> > corner-case functionality.
> >>
> >> Given that this will only have an effect on Ubuntu 12.04 users for now
> >> (i.e. one well defined Linux distribution that we can test on) and
> >> that we desperately need more data, I agree that lading this today is
> >> the right choice.
> >
> > Ok. Will / Kees / Jorge?
>
> I would agree; let's get it landed. Ubuntu 12.04's libraries aren't
> going to change much, so I think the risk of not using the errno
> method is minimal.
>

Ok. Not sure I have an actual LGTM from anyone; I'd like to collect at
least two :P


> That said, I still prefer the kill method on platforms we directly
> control (Chrome OS).
>

Good point. We can ifdef that cleanly in a future CL.


> >> But we should work on have a better story in the upcoming months,
> >> either by white listing only a specific Linux distribution for this
> >> feature to be enabled by default,
> >
> > That's an interesting idea. I wonder what distributions we claim to
> support?
> > Ubuntu / Fedora? Anything else? We'd probably be looking at specific
> > versions of specific distributions.
>
> In theory, any distro with a 3.5 kernel. I'm only aware of Ubuntu
> carrying the backport of seccomp_bpf.
>
> -Kees
>
> --
> Kees Cook
> Chrome OS Security
>

cevans

On Mon, Apr 23, 2012 at 4:35 PM, <keescook@chromium.org> wrote: > lgtm > And now ...

8 years, 8 months ago (2012-04-23 23:38:47 UTC) #21

Jorge Lucangeli Obes

Kees: I like the KILL way, and I agree we should do that in Chrome ...

8 years, 8 months ago (2012-04-23 23:39:49 UTC) #22

Kees Cook

On Mon, Apr 23, 2012 at 4:38 PM, Chris Evans <cevans@google.com> wrote: > On Mon, ...

8 years, 8 months ago (2012-04-23 23:39:57 UTC) #23

rvargas (doing something else)

Please fix the description before landing (and ideally link to a bug)

8 years, 8 months ago (2012-04-24 00:38:34 UTC) #24

cevans

On Mon, Apr 23, 2012 at 5:38 PM, <rvargas@chromium.org> wrote: > Please fix the description ...

8 years, 8 months ago (2012-04-24 00:41:52 UTC) #25

rvargas (doing something else)

On 2012/04/24 00:41:52, cevans wrote: > On Mon, Apr 23, 2012 at 5:38 PM, <mailto:rvargas@chromium.org> ...

8 years, 8 months ago (2012-04-24 00:54:12 UTC) #26

Markus (顧孟勤)

This might be acceptable for CrOS, and it is certainly SOP for Google3, but please ...

8 years, 8 months ago (2012-04-24 01:29:48 UTC) #27

This might be acceptable for CrOS, and it is certainly SOP for Google3, but
please do not enable this behavior in Chrome in general. This is just not a
good idea for client-software. We have already had way too many code-yellow
for excessive crashes, I don't want you to contribute to another
code-yellow.

Crashing on error is never a good idea in release builds, and it is only
rarely a good idea in development builds -- and we do have macros that help
with this, if you decide you absolutely have to crash on error; they do the
right thing and allow you to attach a debugger. That's a lot more likely to
result in somebody fixing the problem than KILLing the process without any
hope of ever debugging.

In general, you have to assume that anything that can go strange and funny
will do so on some users machine. People will have non-standard libraries,
non-standard system-wide LD_PRELOAD environments, non-standard graphics
drivers, non-standard kernels, non-standard distributions, ... Recover from
the error and continue as best you can. Reporting a valid "errno" value is
often a great way to do so.

This also means you don't want to make the list of allowed system calls too
narrow. The exact system call will differ from machine to machine.

Markus

P.S.: In general, I am somewhat saddened by the fact that in an effort to
rush out a new sandbox, we are ignoring all the experience that we gained
in the last two years with carefully designing and rolling out a sandbox
that is a) secure, and b) minimizes how much it gets into people's way.

On Mon, Apr 23, 2012 at 16:39, <jorgelo@chromium.org> wrote:

> Kees: I like the KILL way, and I agree we should do that in Chrome OS
> since we
> control the platform.
>
> Julien: I agree with the points you make, but I'm slightly wary of
> returning
> errno everywhere. This is a discussion we should continue, especially when
> we
> get the data that this patch will provide.
>
> lgtm
>
>
https://chromiumcodereview.**appspot.com/10165018/<https://chromiumcodereview...
>

jln (very slow on Chromium)

On Mon, Apr 23, 2012 at 4:38 PM, Chris Evans <cevans@google.com> wrote: > On Mon, ...

8 years, 8 months ago (2012-04-24 04:36:53 UTC) #28

cevans

On Mon, Apr 23, 2012 at 9:36 PM, Julien Tinnes <jln@chromium.org> wrote: > On Mon, ...

8 years, 8 months ago (2012-04-24 06:16:07 UTC) #29

cevans

On Mon, Apr 23, 2012 at 11:16 PM, Chris Evans <cevans@google.com> wrote: > On Mon, ...

8 years, 8 months ago (2012-04-26 17:40:19 UTC) #30

jln (very slow on Chromium)

8 years, 8 months ago (2012-04-26 17:47:38 UTC) #31

On Thu, Apr 26, 2012 at 10:40 AM, Chris Evans <cevans@google.com> wrote:
> On Mon, Apr 23, 2012 at 11:16 PM, Chris Evans <cevans@google.com> wrote:
>>
>> On Mon, Apr 23, 2012 at 9:36 PM, Julien Tinnes <jln@chromium.org> wrote:
>>>
>>> On Mon, Apr 23, 2012 at 4:38 PM, Chris Evans <cevans@google.com> wrote:
>>> > On Mon, Apr 23, 2012 at 4:35 PM, <keescook@chromium.org> wrote:
>>> >>
>>> >> lgtm
>>> >
>>> >
>>> > And now that I have these lgtm's I'm somehow nervous to land it ;-)
>>>
>>> Also, somewhat of a detail but I've just noticed that the policy
>>> doesn't allow sigreturn. sigreturn is part of the main kernel syscall
>>> interface and should be allowed.
>>
>>
>> I didn't land the change in the end. I'll add this before I do. Seems like
>> this syscall should be in a "baseline" "PSS"? What else would be in the
>> baseline PSS? Would you be interested in kicking off the PSS in earnest with
>> a patch for baseline PSS?
>
>
> I've decided to retire this CL for now. For M20, it seems we should focus
> on:
> - Making the GPU and Flash sandboxes stable.
> - Resolving the runtime behaviour of a failed syscall.
> - Refactoring to move the policies towards Julien's PSS idea.

Ok, I'll start working on that. I also plan to update about:sandbox
with some more information about the current state.

jln

Issue 10165018: Apply a policy to the renderer. (Closed)

Description

Patch Set 1 #

Messages