Issue 18661009: Update ReadProcMaps() to reflect lack of atomicity when reading /proc/self/maps.

Issue 18661009: Update ReadProcMaps() to reflect lack of atomicity when reading /proc/self/maps. (Closed)

Created:
7 years, 5 months ago by scherkus (not reviewing)

Modified:
7 years, 3 months ago

Reviewers:
Mark Mentovai, Alexander Potapenko

CC:
chromium-reviews, erikwright+watch_chromium.org

Base URL:
svn://svn.chromium.org/chrome/trunk/src

Visibility:
Public.

More Reviews

Description

Update ReadProcMaps() to reflect lack of atomicity when reading /proc/self/maps. Reading from procfs returns at most a page-sized amount of data. If the process has a larger-than-page-sized /proc/self/maps, we cannot guarantee that the virtual memory table hasn't changed while reading the entire contents from procfs. In addition, ReadProcMaps() now stops reading as soon as it finds a gate VMA entry to workaround a scenario where the kernel would return duplicate entries (it turns out ThreadSanitizer v2 was very good at triggering said scenario). BUG=258451 Committed: https://src.chromium.org/viewvc/chrome?view=rev&revision=221570

Patch Set 1 #

Total comments: 1

Patch Set 2 : Use std::string::resize() #

Total comments: 5

Patch Set 3 : check for gate vma #

Total comments: 2

Patch Set 4 : #

Total comments: 4

Patch Set 5 : fixes #

Total comments: 2

Patch Set 6 : for SPEED #

Total comments: 7

Patch Set 7 : nits #

Created: 7 years, 3 months ago

Download [raw] [tar.bz2]

	Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+105 lines, -10 lines)			Patch
M	base/debug/proc_maps_linux.h	View	1 2 3 4 5	1 chunk	+34 lines, -0 lines	0 comments	Download
M	base/debug/proc_maps_linux.cc	View	1 2 3 4 5 6	2 chunks	+63 lines, -2 lines	0 comments	Download
M	base/debug/proc_maps_linux_unittest.cc	View	1 2 3 4	2 chunks	+8 lines, -8 lines	0 comments	Download

Messages

Total messages: 22 (0 generated)

Expand Messages | Collapse Messages

scherkus (not reviewing)

glider: here's my take at it mark: base/ OWNERS + peanut gallery I believe the ...

7 years, 5 months ago (2013-07-10 23:02:05 UTC) #1

Mark Mentovai

https://codereview.chromium.org/18661009/diff/2001/base/debug/proc_maps_linux.cc File base/debug/proc_maps_linux.cc (right): https://codereview.chromium.org/18661009/diff/2001/base/debug/proc_maps_linux.cc#newcode28 base/debug/proc_maps_linux.cc:28: enum { kLargeProcMapsSize = 256 * 1024 }; Why ...

7 years, 5 months ago (2013-07-11 03:47:49 UTC) #2

Alexander Potapenko

https://codereview.chromium.org/18661009/diff/2001/base/debug/proc_maps_linux.cc File base/debug/proc_maps_linux.cc (right): https://codereview.chromium.org/18661009/diff/2001/base/debug/proc_maps_linux.cc#newcode40 base/debug/proc_maps_linux.cc:40: return file_util::ReadFileToString(proc_maps_path, proc_maps); Maybe base::MemoryMappedFile could be used instead? ...

7 years, 5 months ago (2013-07-11 10:28:54 UTC) #3

Alexander Potapenko

https://codereview.chromium.org/18661009/diff/1/base/debug/proc_maps_linux.cc File base/debug/proc_maps_linux.cc (right): https://codereview.chromium.org/18661009/diff/1/base/debug/proc_maps_linux.cc#newcode50 base/debug/proc_maps_linux.cc:50: read(fd, buffer.get() + offset, kMaxProcMapsSize - offset)); There must ...

7 years, 5 months ago (2013-07-11 10:47:23 UTC) #4

Mark Mentovai

Here’s the story… Both patch set 1 and patch set 2 aren’t thread-safe. But they ...

7 years, 5 months ago (2013-07-11 14:20:40 UTC) #5

Here’s the story…

Both patch set 1 and patch set 2 aren’t thread-safe. But they seem to fix the
problem, right? Why?

Memory allocations and deallocations may wind up changing the set of memory
regions mapped in a process. They don’t always, but they do sometimes. I bet
that when running under the sanitizers, it’s much more likely that the mapped
regions will change during any arbitrary memory allocation or deallocation. As
scherkus found, minimizing the reallocations done while reading /proc/self/maps
mitigates the problem.

That’s a start, but it’s not [multi-]thread-safe, it’s really only
single-thread-safe. /proc/self/maps now probably won’t change while a single
thread is trying to read it, assuming no other threads. Changing the allocation
stuff around like this doesn’t have anything to do with avoiding the maybe-yield
that could happen during an allocation. Anyway, that’s not the only opportunity
that exists for a thread to yield, and it’s possible for multiple threads to
execute concurrently, so none of that hand-wavey stuff is too comforting.

Since we need to fix this anyway, let’s just fix it the right way and make it
thread-safe.

The only real solution to thread safety here is to read the entire file in one
fell swoop with a single read call. How do you know how big a buffer you need?
As I pointed out previously, you can’t use fstat (and anyway, the answer might
change between your fstat and your read). The only real solution is to retry
your read within a loop—if you didn’t grab the whole file in one pass, discard
the fragment that you did read, reallocate your buffer, lseek the maps file to
rewind it back to 0, and try to read the whole thing again. But how do you know
that you’ve read the entire file and that you don’t need to go back for another
read?

The easy answer is that if you read anything smaller than the size of the
buffer, you’ve read the whole thing up to EOF. But is that signal-safe? POSIX
allows a read that has been interrupted after it has produced some data to
return a short read even though the buffer hasn’t been filled completely and the
file’s not at EOF. If you care about this, you need to detect whether a short
read was short because it read the entire map by doing an additional 1-byte read
from the file. If it returns 0, you’re at EOF, and the map you just read is
good. If it returns data, you read a partial map and were interrupted by a
signal, so rewind the file, go back, and try again. But in reality, can signals
interrupt a /proc read? Remember, it’s not a real file and it’s not a real
filesystem. On a real file, a signal might occur during a long read while the
kernel is blocked waiting for input from a disk. On /proc/self/maps, where’s the
opportunity for a signal to interrupt the read? I don’t think there is any, and
honestly, the whole thing seems kind of ridiculous to me. But I’m not positive,
and I’m not ruling out the possibility that even though this might not be
possible with current kernels, it might become possible in the future.

https://codereview.chromium.org/18661009/diff/2001/base/debug/proc_maps_linux.cc
File base/debug/proc_maps_linux.cc (right):

https://codereview.chromium.org/18661009/diff/2001/base/debug/proc_maps_linux...
base/debug/proc_maps_linux.cc:40: return
file_util::ReadFileToString(proc_maps_path, proc_maps);
Alexander Potapenko wrote:
> Maybe base::MemoryMappedFile could be used instead?
> This is still non-atomic, because the size of /proc/self/maps may change
between
> the fstat() and mmap() calls, so we may need to pass some default size to
> MapFileToMemoryInternal.

/proc is not a regular filesystem. There are two problems with this approach.

1. You can’t use stat (or fstat) to figure out how big the file is, because
there is no file. The kernel won’t assemble any data until you try to read the
file, so it doesn’t even know the answer. st_size will always be 0 for this
file, and for most or all other files in /proc.

2. You can’t mmap the file, because it doesn’t implement mmap.
(fs/proc/task_mmu.c proc_tid_maps_operations). You’ll get ENODEV. I don’t know
what semantics you would expect of mmap, anyway, since this isn’t a real file.

scherkus (not reviewing)

On 2013/07/11 14:20:40, Mark Mentovai wrote: > Here’s the story… > > Both patch set ...

7 years, 5 months ago (2013-07-12 00:35:42 UTC) #6

On 2013/07/11 14:20:40, Mark Mentovai wrote:
> Here’s the story…
> 
> Both patch set 1 and patch set 2 aren’t thread-safe. But they seem to fix the
> problem, right? Why?
> 
> Memory allocations and deallocations may wind up changing the set of memory
> regions mapped in a process. They don’t always, but they do sometimes. I bet
> that when running under the sanitizers, it’s much more likely that the mapped
> regions will change during any arbitrary memory allocation or deallocation. As
> scherkus found, minimizing the reallocations done while reading
/proc/self/maps
> mitigates the problem.
> 
> That’s a start, but it’s not [multi-]thread-safe, it’s really only
> single-thread-safe. /proc/self/maps now probably won’t change while a single
> thread is trying to read it, assuming no other threads. Changing the
allocation
> stuff around like this doesn’t have anything to do with avoiding the
maybe-yield
> that could happen during an allocation. Anyway, that’s not the only
opportunity
> that exists for a thread to yield, and it’s possible for multiple threads to
> execute concurrently, so none of that hand-wavey stuff is too comforting.
> 
> Since we need to fix this anyway, let’s just fix it the right way and make it
> thread-safe.
> 
> The only real solution to thread safety here is to read the entire file in one
> fell swoop with a single read call. How do you know how big a buffer you need?
> As I pointed out previously, you can’t use fstat (and anyway, the answer might
> change between your fstat and your read). The only real solution is to retry
> your read within a loop—if you didn’t grab the whole file in one pass, discard
> the fragment that you did read, reallocate your buffer, lseek the maps file to
> rewind it back to 0, and try to read the whole thing again. But how do you
know
> that you’ve read the entire file and that you don’t need to go back for
another
> read?
> 
> The easy answer is that if you read anything smaller than the size of the
> buffer, you’ve read the whole thing up to EOF. But is that signal-safe? POSIX
> allows a read that has been interrupted after it has produced some data to
> return a short read even though the buffer hasn’t been filled completely and
the
> file’s not at EOF. If you care about this, you need to detect whether a short
> read was short because it read the entire map by doing an additional 1-byte
read
> from the file. If it returns 0, you’re at EOF, and the map you just read is
> good. If it returns data, you read a partial map and were interrupted by a
> signal, so rewind the file, go back, and try again. But in reality, can
signals
> interrupt a /proc read? Remember, it’s not a real file and it’s not a real
> filesystem. On a real file, a signal might occur during a long read while the
> kernel is blocked waiting for input from a disk. On /proc/self/maps, where’s
the
> opportunity for a signal to interrupt the read? I don’t think there is any,
and
> honestly, the whole thing seems kind of ridiculous to me. But I’m not
positive,
> and I’m not ruling out the possibility that even though this might not be
> possible with current kernels, it might become possible in the future.
> 
>
https://codereview.chromium.org/18661009/diff/2001/base/debug/proc_maps_linux.cc
> File base/debug/proc_maps_linux.cc (right):
> 
>
https://codereview.chromium.org/18661009/diff/2001/base/debug/proc_maps_linux...
> base/debug/proc_maps_linux.cc:40: return
> file_util::ReadFileToString(proc_maps_path, proc_maps);
> Alexander Potapenko wrote:
> > Maybe base::MemoryMappedFile could be used instead?
> > This is still non-atomic, because the size of /proc/self/maps may change
> between
> > the fstat() and mmap() calls, so we may need to pass some default size to
> > MapFileToMemoryInternal.
> 
> /proc is not a regular filesystem. There are two problems with this approach.
> 
> 1. You can’t use stat (or fstat) to figure out how big the file is, because
> there is no file. The kernel won’t assemble any data until you try to read the
> file, so it doesn’t even know the answer. st_size will always be 0 for this
> file, and for most or all other files in /proc.
> 
> 2. You can’t mmap the file, because it doesn’t implement mmap.
> (fs/proc/task_mmu.c proc_tid_maps_operations). You’ll get ENODEV. I don’t know
> what semantics you would expect of mmap, anyway, since this isn’t a real file.

tl;dr: read() will return at most getpagesize() bytes because that's how
seq_file works. This explains the ~4K bytes I'm seeing on Android + desktop
Linux.


seq_read() works something like this:
  1) kmalloc(PAGE_SIZE) a buffer
  2) Call seq_file->op->start()
  3) While buffer has capacity...
    a) Call seq_file->op->show()
    b) Call seq_file->op->next() to advance iterator
  4) Call seq_file->op->stop()
  5) Copy buffer into usermode buffer
  6) Return # of bytes copied into usermode buffer

fs/seq_file.c: seq_read()
http://lxr.linux.no/linux+v2.6.37/fs/seq_file.c#L132

seq_file->op in this case is the seq_operations struct filled out by
fs/proc/task_mmu.c

Each time show() is called, fs/proc/task_mmu.c records the starting address of
the last memory mapped region it printed out as a hint for the next time start()
is called.

What this means is that each time we're forced to call read(),
fs/proc/task_mmu.c will re-lookup the last address in the rb_tree of mapped
regions. If the rb_tree changed at all between calls to read(), then it's
possible that we'll print out duplicate/extra information.

fs/proc/task_mmu.c: m_start()
http://lxr.linux.no/linux+v2.6.37/fs/proc/task_mmu.c#L109

fs/proc/task_mmu.c: show_map()
http://lxr.linux.no/linux+v2.6.37/fs/proc/task_mmu.c#L278


So ... knowing what we know about seq_file is there anything we can
realistically do in the interim aside from read()'ing until EOF and hoping it
doesn't change? Should this be reported upstream?

Alexander Potapenko

We can try to stitch the information from several read() calls. E.g. if there's a ...

7 years, 5 months ago (2013-07-12 11:26:35 UTC) #7

Mark Mentovai

This is starting to get gross. The best I’m able to come up with that’s ...

7 years, 5 months ago (2013-07-12 14:57:10 UTC) #8

scherkus (not reviewing)

On 2013/07/12 14:57:10, Mark Mentovai wrote: > This is starting to get gross. > > ...

7 years, 5 months ago (2013-07-12 23:25:24 UTC) #9

Alexander Potapenko

https://codereview.chromium.org/18661009/diff/18001/base/debug/proc_maps_linux.cc File base/debug/proc_maps_linux.cc (right): https://codereview.chromium.org/18661009/diff/18001/base/debug/proc_maps_linux.cc#newcode59 base/debug/proc_maps_linux.cc:59: ssize_t bytes_read = HANDLE_EINTR(read(fd, buffer.get(), kBufferSize)); I wonder what ...

7 years, 5 months ago (2013-07-15 08:17:18 UTC) #10

Alexander Potapenko

Also, if that's a problem to find a gate VMA, maybe we could check that ...

7 years, 5 months ago (2013-07-15 08:20:43 UTC) #11

scherkus (not reviewing)

7 years, 5 months ago (2013-07-15 17:46:55 UTC) #12

scherkus (not reviewing)

Finally got back to updating this... As discussed offline and in the bug... here's the ...

7 years, 3 months ago (2013-09-05 01:43:30 UTC) #13

Mark Mentovai

https://codereview.chromium.org/18661009/diff/26001/base/debug/proc_maps_linux.cc File base/debug/proc_maps_linux.cc (right): https://codereview.chromium.org/18661009/diff/26001/base/debug/proc_maps_linux.cc#newcode32 base/debug/proc_maps_linux.cc:32: return proc_maps->find("[vectors]", pos) != std::string::npos; Wanna look for the ...

7 years, 3 months ago (2013-09-05 16:31:17 UTC) #14

scherkus (not reviewing)

7 years, 3 months ago (2013-09-05 17:24:33 UTC) #15

Mark Mentovai

LGTM. The comment is optional. If you make the suggested change, I’ll do a quick ...

7 years, 3 months ago (2013-09-05 17:52:42 UTC) #16

scherkus (not reviewing)

https://codereview.chromium.org/18661009/diff/35001/base/debug/proc_maps_linux.cc File base/debug/proc_maps_linux.cc (right): https://codereview.chromium.org/18661009/diff/35001/base/debug/proc_maps_linux.cc#newcode58 base/debug/proc_maps_linux.cc:58: ssize_t bytes_read = HANDLE_EINTR(read(fd, buffer.get(), kBufferSize)); On 2013/09/05 17:52:43, ...

7 years, 3 months ago (2013-09-05 19:01:40 UTC) #17

Mark Mentovai

LGTM https://codereview.chromium.org/18661009/diff/48001/base/debug/proc_maps_linux.cc File base/debug/proc_maps_linux.cc (right): https://codereview.chromium.org/18661009/diff/48001/base/debug/proc_maps_linux.cc#newcode61 base/debug/proc_maps_linux.cc:61: void* buffer = &proc_maps->front() + pos; Simplify to ...

7 years, 3 months ago (2013-09-05 19:25:49 UTC) #18

scherkus (not reviewing)

https://codereview.chromium.org/18661009/diff/48001/base/debug/proc_maps_linux.cc File base/debug/proc_maps_linux.cc (right): https://codereview.chromium.org/18661009/diff/48001/base/debug/proc_maps_linux.cc#newcode61 base/debug/proc_maps_linux.cc:61: void* buffer = &proc_maps->front() + pos; On 2013/09/05 19:25:50, ...

7 years, 3 months ago (2013-09-05 19:54:51 UTC) #19

https://codereview.chromium.org/18661009/diff/48001/base/debug/proc_maps_linu...
File base/debug/proc_maps_linux.cc (right):

https://codereview.chromium.org/18661009/diff/48001/base/debug/proc_maps_linu...
base/debug/proc_maps_linux.cc:61: void* buffer = &proc_maps->front() + pos;
On 2013/09/05 19:25:50, Mark Mentovai wrote:
> Simplify to &proc_maps[pos]?

Done, but tweaked a bit as proc_maps is a pointer

https://codereview.chromium.org/18661009/diff/48001/base/debug/proc_maps_linu...
base/debug/proc_maps_linux.cc:66: proc_maps->resize(pos);
On 2013/09/05 19:25:50, Mark Mentovai wrote:
> scherkus wrote:
> > I don't know if this is worth it.
> 
> I think it is. Why return garbage? Either this resize(pos) or a clear() work
for
> me.

I like clear().

https://codereview.chromium.org/18661009/diff/48001/base/debug/proc_maps_linu...
File base/debug/proc_maps_linux_unittest.cc (right):

https://codereview.chromium.org/18661009/diff/48001/base/debug/proc_maps_linu...
base/debug/proc_maps_linux_unittest.cc:240: 
On 2013/09/05 19:25:50, Mark Mentovai wrote:
> It’d be good if there was a test that ensured that ReadProcMaps was able to
> properly read maps larger than a page so that you know your loop gets
exercised
> in testing and you can validate that things from the first [and middle] and
last
> reads show up in the output. I can’t come up with a great way to write that
test
> case, though. I guess you could mmap a few hundred files and make sure their
> paths all show up in ReadProcMaps’ output.
> 
> I’m not actually asking you to write this test if you don’t want to, it’s
> probably not that important and I’d like to see this code get checked in after
> what you’ve been through for it.

Yeah ... I'm inclined to pass as the ReadProcMaps test above should be
sufficient for now.

Debug base_unittests has a ~35KB /proc/self/maps so the loop is being covered.
Also while I was iterating on this CL, the ReadProcMaps test was catching
instances where I screwed up the string sizes 'n offsets.

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-status.appspot.com/cq/scherkus@chromium.org/18661009/55001

7 years, 3 months ago (2013-09-05 20:33:27 UTC) #21

Message was sent while issue was closed.

Change committed as 221570

Expand Messages | Collapse Messages