[v2] nptl: Invert the mmap/mprotect logic on allocated stacks (BZ#18988)

This is an update from my previous patch [1].  Then change from previous
version are:

  - Create inline functions for guard page address calculation and
    segments protection setup;
  - Fix an issue for downwards stack allocation.

--

Current allocate_stack logic for create stacks is to first mmap all
the required memory with the desirable memory and then mprotect the
guard area with PROT_NONE if required.  Although it works as expected,
it pessimizes the allocation because it requires the kernel to actually
increase commit charge (it counts against the available physical/swap
memory available for the system).

The only issue is to actually check this change since side-effects are
really Linux specific and to actually account them it would require a
kernel specific tests to parse the system wide information.  On the kernel
I checked /proc/self/statm does not show any meaningful difference for
vmm and/or rss before and after thread creation.  I could only see
really meaningful information checking on system wide /proc/meminfo
between thread creation: MemFree, MemAvailable, and Committed_AS shows
large difference without the patch.  I think trying to use these
kind of information on a testcase is fragile.

The BZ#18988 reports shows that the commit pages are easily seen with
mlockall (MCL_FUTURE) (with lock all pages that become mapped in the
process) however a more straighfoward testcase shows that pthread_create
could be faster using this patch:

--
static const int inner_count = 256;
static const int outer_count = 128;

static
void *thread1(void *arg)
{
  return NULL;
}

static
void *sleeper(void *arg)
{
  pthread_t ts[inner_count];
  for (int i = 0; i < inner_count; i++)
    pthread_create (&ts[i], &a, thread1, NULL);
  for (int i = 0; i < inner_count; i++)
    pthread_join (ts[i], NULL);

  return NULL;
}

int main(void)
{
  pthread_attr_init(&a);
  pthread_attr_setguardsize(&a, 1<<20);
  pthread_attr_setstacksize(&a, 1134592);

  pthread_t ts[outer_count];
  for (int i = 0; i < outer_count; i++)
    pthread_create(&ts[i], &a, sleeper, NULL);
  for (int i = 0; i < outer_count; i++)
    pthread_join(ts[i], NULL);
    assert(r == 0);
  }
  return 0;
}

--

On x86_64 (4.4.0-45-generic, gcc 5.4.0) running the small benchtests
I see:

$ time ./test

real	0m3.647s
user	0m0.080s
sys	0m11.836s

While with the patch I see:

$ time ./test

real	0m0.696s
user	0m0.040s
sys	0m1.152s

So I added a pthread_create benchtest (thread_create) which check
the thread creation latency.  As for the simple benchtests, I saw
improvements in thread creation on all architectures I tested the
change.

Checked on x86_64-linux-gnu, i686-linux-gnu, aarch64-linux-gnu,
arm-linux-gnueabihf, and powerpc64le-linux-gnu.

	[BZ #18988]
	* benchtests/thread_create-inputs: New file.
	* benchtests/thread_create-source.c: Likewise.
	* support/xpthread_attr_setguardsize.c: Likewise.
	* support/Makefile (libsupport-routines): Add
	xpthread_attr_setguardsize object.
	* support/xthread.h: Add xpthread_attr_setguardsize prototype.
	* benchtests/Makefile (bench-pthread): Add thread_create.
	* nptl/allocatestack.c (allocate_stack): Call mmap with PROT_NONE and
	then mprotect the required area.

[1] https://sourceware.org/ml/libc-alpha/2017-02/msg00033.html

---
 ChangeLog                            | 13 +++++++
 benchtests/Makefile                  |  2 +-
 benchtests/thread_create-inputs      | 14 ++++++++
 benchtests/thread_create-source.c    | 58 +++++++++++++++++++++++++++++++
 nptl/allocatestack.c                 | 66 +++++++++++++++++++++++++++++++-----
 support/Makefile                     |  1 +
 support/xpthread_attr_setguardsize.c | 26 ++++++++++++++
 support/xthread.h                    |  2 ++
 8 files changed, 173 insertions(+), 9 deletions(-)
 create mode 100644 benchtests/thread_create-inputs
 create mode 100644 benchtests/thread_create-source.c
 create mode 100644 support/xpthread_attr_setguardsize.c

-- 
2.7.4

Message ID	1486414193-11241-1-git-send-email-adhemerval.zanella@linaro.org
State	Accepted
Commit	0edbf1230131dfeb03d843d2859e2104456fad80
Headers	show Delivered-To: patch@linaro.org Received: by 10.140.20.99 with SMTP id 90csp1882724qgi; Mon, 6 Feb 2017 12:50:28 -0800 (PST) X-Received: by 10.99.67.6 with SMTP id q6mr15536803pga.156.1486414228427; Mon, 06 Feb 2017 12:50:28 -0800 (PST) Return-Path: <libc-alpha-return-77192-patch=linaro.org@sourceware.org> Received: from sourceware.org (server1.sourceware.org. [209.132.180.131]) by mx.google.com with ESMTPS id m3si1777773pgm.91.2017.02.06.12.50.28 for <patch@linaro.org> (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 06 Feb 2017 12:50:28 -0800 (PST) Received-SPF: pass (google.com: domain of libc-alpha-return-77192-patch=linaro.org@sourceware.org designates 209.132.180.131 as permitted sender) client-ip=209.132.180.131; Authentication-Results: mx.google.com; dkim=pass header.i=@sourceware.org; spf=pass (google.com: domain of libc-alpha-return-77192-patch=linaro.org@sourceware.org designates 209.132.180.131 as permitted sender) smtp.mailfrom=libc-alpha-return-77192-patch=linaro.org@sourceware.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=linaro.org DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:from:to:subject:date:message-id; q=dns; s= default; b=FMJM4GzEVsa7mr5ThLoqjB8Dt7eXuswFBjAKQ50v/FNjyv37LkLB6 lQ9GQKQF/w1k6Z0ALwm7oIavQgWD/yJOU42Y4i50XzbBgXnOt5mRe+ud2f06NLdi vSK4moD26755aCiwzdUFVn62sG1xQGchmJsgCAzo29QYLI9srh17UA= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:from:to:subject:date:message-id; s=default; bh=1Ftotmj0d1tzA7h4bxdVfeaIib0=; b=F7OFXUc2qsKm3ZOVsdCNq94qQAoY fJ5QUgdfHR/QRCCWz2LMSUCl8zCSlE93ZvQ4lgOBs133E7G1AT8Bg7NglGLyQW0k Fw5fQf4DOmhiH9LGsfg3W0uL2qmOjuoVEToaQE/9PP3OoV5FlhIk58blDS2QjXCZ I/c+pwUF0nSVDtE= Received: (qmail 36055 invoked by alias); 6 Feb 2017 20:50:17 -0000 Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: <libc-alpha.sourceware.org> List-Unsubscribe: <mailto:libc-alpha-unsubscribe-patch=linaro.org@sourceware.org> List-Subscribe: <mailto:libc-alpha-subscribe@sourceware.org> List-Archive: <http://sourceware.org/ml/libc-alpha/> List-Post: <mailto:libc-alpha@sourceware.org> List-Help: <mailto:libc-alpha-help@sourceware.org>, <http://sourceware.org/ml/#faqs> Sender: libc-alpha-owner@sourceware.org Delivered-To: mailing list libc-alpha@sourceware.org Received: (qmail 36042 invoked by uid 89); 6 Feb 2017 20:50:16 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=1.0 required=5.0 tests=AWL, BAYES_50, RCVD_IN_DNSWL_NONE, RCVD_IN_SORBS_SPAM, SPF_PASS autolearn=no version=3.3.2 spammy=1, 14, 67, 6, fminf, adhemerval.zanella@linaro.org X-HELO: mail-qt0-f169.google.com X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:subject:date:message-id; bh=kcqkS8Oj4ha3W9IT32WnmRTXFoU1qPoswXDgCsisA/k=; b=d91FoZWfUO69WMcm4EXL1SBGTq/ZxDgQLoVOmvWLiCh8HERR9Ip0z2WaL95tZZjTfw TXrCpSiPUYVN6hZ3Rn/+LfEGRjCv+P9uz4eB8sdeSyby3jB+9UTu0AEXDUj2wbOQ/0hy 1dpUWoEHEiTZ8wJVCIbyF9A/A7RywVeWSIgzZ11NmLngUvdrYe37ULzu3V5FtpAAI/fX HlVHf5JA2QQHdHw+UDCq4iVjqX/m8p9an4AxCgAgtGtdNMLIY4b5hE39T45jcqnH9E1j BiyBB4PftsJqlIJ+4+ttfKSBRvynFdaK0mo5GJXoRk9Ef3hXHmSrJzT0IgG/8iBUMuKb +3yQ== X-Gm-Message-State: AMke39n/nfcXUcSD+YvVH2B6PwgJr5GghSrMX/X8XUh6+jO63XsY4stmRcfTs3SvltFlIIFM X-Received: by 10.200.41.175 with SMTP id 44mr12080975qts.53.1486414204432; Mon, 06 Feb 2017 12:50:04 -0800 (PST) From: Adhemerval Zanella <adhemerval.zanella@linaro.org> To: libc-alpha@sourceware.org Subject: [PATCH v2] nptl: Invert the mmap/mprotect logic on allocated stacks (BZ#18988) Date: Mon, 6 Feb 2017 18:49:53 -0200 Message-Id: <1486414193-11241-1-git-send-email-adhemerval.zanella@linaro.org>

[v2] nptl: Invert the mmap/mprotect logic on allocated stacks (BZ#18988)

Commit Message

Comments

Patch