From patchwork Wed May 14 16:58:59 2025 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Rob Clark X-Patchwork-Id: 890299 Received: from mail-pj1-f49.google.com (mail-pj1-f49.google.com [209.85.216.49]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 1EA0323CE; Wed, 14 May 2025 17:03:10 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.216.49 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1747242192; cv=none; b=QUOM4E2HZ3iVoR1I19UkLaMz+3WW5d6eLRhFL5LZh6VtRem4Gd8HR7MmxW4j9xIHzPza1fcpJFCZVb12EbCgj5B/8wRfllD+jQgdXH3tjsTUAYOneva5e1xGFsHkiLtZ5KxH1IeQhJNOrBEmk3i8+rhKhxKkWrwG9WOEgnIx3jc= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1747242192; c=relaxed/simple; bh=BJ/MngDQrweqXLyzyk/mOzWV+IC2hvudVpGaRIzWNus=; h=From:To:Cc:Subject:Date:Message-ID:MIME-Version; b=sf80NrYF61OTKuRAhLA4+8atkoLhCcQrt3nbW7mnPFXnflCa8inywtkxzlziCcOtM/+hDBH3OtChXdlp2PISvmG7xp3Ta9Jsxmu5l8E2MFdt24ddOWZ7lf9a+z6gh5bHMGUVpQn0lBOomt8EOXNm6RGD1AOMVxEc7IQ4SmPZbH4= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com; spf=pass smtp.mailfrom=gmail.com; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b=dFLnwctf; arc=none smtp.client-ip=209.85.216.49 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=gmail.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b="dFLnwctf" Received: by mail-pj1-f49.google.com with SMTP id 98e67ed59e1d1-30c3a038acfso100825a91.3; Wed, 14 May 2025 10:03:10 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1747242190; x=1747846990; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:from:to:cc:subject:date:message-id:reply-to; bh=tTRZYELUxikHuZrnr6hjy5+oWoL6AQWZaZWVaCTTMUs=; b=dFLnwctfjC3TRFKWn5qfZgCARkq7Xo+3n/w/Djj8abK0emPlK95ehHzr+lxo1xd7vg S63MfwwpS/XN3iFzypzT22ZhB8KatVREb3E7DLxzklEb+qXXDpY6XCZXJDdV5pD5dPtB 2cF3eTangbV9uNEpbMLGV08f2lyi03dNdlYFTz62HUj9e5mn/awu0b+yF/03qLqBk0/l Z6H4I+03ccHCBoTqfog/RhidgGyEEGO8xNcgThCYM1oTXy+6kLL0BAkSwfGcP7/AF+/q ev6hGLfs+lZAkVvjcZyipixdwibsPfzthKwu/zuCEzwoG1ypkf8wbaXImG07AaHB+EpS xtAA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1747242190; x=1747846990; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=tTRZYELUxikHuZrnr6hjy5+oWoL6AQWZaZWVaCTTMUs=; b=jjx3c3ejkYrQAsVwkVff3bgffjh6dupNJBg5wiovAWEASEQq1l/63MVosrzzVa4qg0 IaxOq6RBsnW9BZvQ5g0dBfL4JVdoZqCaYLnhYDjwMUBPxhpKB/O61gvSXetTh9JgAkB3 7JA5iL9AJvoamUb3v6fZ8/P+j8yiQ3AdG8B0jsBKgtS+DHl+KKRKzTEDueVCzBUF5Q7r NW0neDSEN3vZ43wnypIyBWOEcHjz6qlZP09dQW3ickTFiAFIsJ0tRF92ufPvT6O1bTfy Mzhaak6cPsuB6dbJ5+AXrFC6zIvu3unp2Zb/hQ9im7wLy8q/DvinGl2xnUAaWp4KcYEo 3pFg== X-Forwarded-Encrypted: i=1; AJvYcCUZLdxWV1K/P1t1k8XjAdNAljlfCSX/TFf7RVn7qr2A861N5P7nFLuWjkiWDBwAwPiUbakGr2aILenC8lty@vger.kernel.org, AJvYcCWvpO8hsD0jHGzgEQCKwQ4TEhzurOPTncH3EQ4l967WLJ7txsqTBc55K8WqYJ+/QVZG3HvUPslJRCWV8iI=@vger.kernel.org, AJvYcCXp3/FBGycvpMTEgP7sIK2cDFBygbYZKRHUxvWjO5xikL7/bIUnKvZ1XD1p81Szd18K0nEbAAnpfTLaqnpM@vger.kernel.org X-Gm-Message-State: AOJu0Yz5nhQ7Ju6nkv+0KETA0PpM5Imxb11AELMG5t37wKAq4+MUWT4y SVGExmnJnq4lMjQwJnr6cAYHXjI4G12m7SijHJ4FnYwTSKPUOmfy X-Gm-Gg: ASbGncvUyD9kuT3GqFdP3+uYzwAXSvJejqGSaOPS0MBBrwlIm/doHSamIKuYe25gn/K swt+vToBgM+UemPz4mF2UM9KhxQf/oZtyuYoKCBC2DxmKOGh5FweDG04/ArVtwci5Tk3DyLeJpO yeozxJPEp4xlByWX2AigN0MrSkMftQUfUsKrfvFvt0pCsnuuPm3oSUPPUSNm+cpsh4XxrQBXaN6 BvCwK9UF+tOU7Ik3VY/4XCcMEvnKFeA9jskxvziBQC6DoZH8bNxUvyxyjw7N5+f54CJWLnFeHYW bi41nCOG8HXkGA5BUYWWk30R1hJkPdIIkyEJ04/ocshY9mCl/W4mH4exGhv3iHGtbnnui6GPlXK llve0nT0RYenXdaBCIXxP6V1Xww== X-Google-Smtp-Source: AGHT+IG+/kVgSThsJVSjiF3uCAxUrBjb1zbLg+VbmolSv8I7lQK3Z+fmpPdwjzXJNN06J8fdXZzhsg== X-Received: by 2002:a17:90b:51c5:b0:2fe:a79e:f56f with SMTP id 98e67ed59e1d1-30e2e5bb68cmr7472695a91.13.1747242190032; Wed, 14 May 2025 10:03:10 -0700 (PDT) Received: from localhost ([2a00:79e0:3e00:2601:3afc:446b:f0df:eadc]) by smtp.gmail.com with ESMTPSA id 41be03b00d2f7-b2350ddbafdsm7966273a12.53.2025.05.14.10.03.09 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 14 May 2025 10:03:09 -0700 (PDT) From: Rob Clark To: dri-devel@lists.freedesktop.org Cc: freedreno@lists.freedesktop.org, linux-arm-msm@vger.kernel.org, Connor Abbott , Rob Clark , Abhinav Kumar , =?utf-8?q?Andr=C3=A9_Almeida?= , Arnd Bergmann , =?utf-8?b?QmFybmFiw6FzIEN6w6ltw6Fu?= , =?utf-8?q?Christian_K=C3=B6nig?= , Christopher Snowhill , Dmitry Baryshkov , Dmitry Baryshkov , Eugene Lepshy , iommu@lists.linux.dev (open list:IOMMU SUBSYSTEM), Jason Gunthorpe , Jessica Zhang , Joao Martins , Jonathan Marek , Kevin Tian , Konrad Dybcio , Krzysztof Kozlowski , linaro-mm-sig@lists.linaro.org (moderated list:DMA BUFFER SHARING FRAMEWORK:Keyword:\bdma_(?:buf|fence|resv)\b), linux-arm-kernel@lists.infradead.org (moderated list:ARM SMMU DRIVERS), linux-kernel@vger.kernel.org (open list), linux-media@vger.kernel.org (open list:DMA BUFFER SHARING FRAMEWORK:Keyword:\bdma_(?:buf|fence|resv)\b), Marijn Suijten , Nicolin Chen , Robin Murphy , Sean Paul , Will Deacon Subject: [PATCH v4 00/40] drm/msm: sparse / "VM_BIND" support Date: Wed, 14 May 2025 09:58:59 -0700 Message-ID: <20250514170118.40555-1-robdclark@gmail.com> X-Mailer: git-send-email 2.49.0 Precedence: bulk X-Mailing-List: linux-arm-msm@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 From: Rob Clark Conversion to DRM GPU VA Manager[1], and adding support for Vulkan Sparse Memory[2] in the form of: 1. A new VM_BIND submitqueue type for executing VM MSM_SUBMIT_BO_OP_MAP/ MAP_NULL/UNMAP commands 2. A new VM_BIND ioctl to allow submitting batches of one or more MAP/MAP_NULL/UNMAP commands to a VM_BIND submitqueue I did not implement support for synchronous VM_BIND commands. Since userspace could just immediately wait for the `SUBMIT` to complete, I don't think we need this extra complexity in the kernel. Synchronous/immediate VM_BIND operations could be implemented with a 2nd VM_BIND submitqueue. The corresponding mesa MR: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32533 Changes in v4: - Various locking/etc fixes - Optimize the pgtable preallocation. If userspace sorts the VM_BIND ops then the kernel detects ops that fall into the same 2MB last level PTD to avoid duplicate page preallocation. - Add way to throttle pushing jobs to the scheduler, to cap the amount of potentially temporary prealloc'd pgtable pages. - Add vm_log to devcoredump for debugging. If the vm_log_shift module param is set, keep a log of the last 1< msm_context drm/msm: Improve msm_context comments drm/msm: Rename msm_gem_address_space -> msm_gem_vm drm/msm: Remove vram carveout support drm/msm: Collapse vma allocation and initialization drm/msm: Collapse vma close and delete drm/msm: Don't close VMAs on purge drm/msm: drm_gpuvm conversion drm/msm: Convert vm locking drm/msm: Use drm_gpuvm types more drm/msm: Split out helper to get iommu prot flags drm/msm: Add mmu support for non-zero offset drm/msm: Add PRR support drm/msm: Rename msm_gem_vma_purge() -> _unmap() drm/msm: Drop queued submits on lastclose() drm/msm: Lazily create context VM drm/msm: Add opt-in for VM_BIND drm/msm: Mark VM as unusable on GPU hangs drm/msm: Add _NO_SHARE flag drm/msm: Crashdump prep for sparse mappings drm/msm: rd dumping prep for sparse mappings drm/msm: Crashdec support for sparse drm/msm: rd dumping support for sparse drm/msm: Extract out syncobj helpers drm/msm: Use DMA_RESV_USAGE_BOOKKEEP/KERNEL drm/msm: Add VM_BIND submitqueue drm/msm: Support IO_PGTABLE_QUIRK_NO_WARN_ON drm/msm: Support pgtable preallocation drm/msm: Split out map/unmap ops drm/msm: Add VM_BIND ioctl drm/msm: Add VM logging for VM_BIND updates drm/msm: Add VMA unmap reason drm/msm: Add mmu prealloc tracepoint drm/msm: use trylock for debugfs drm/msm: Bump UAPI version drivers/gpu/drm/drm_gem.c | 14 +- drivers/gpu/drm/drm_gpuvm.c | 15 +- drivers/gpu/drm/msm/Kconfig | 1 + drivers/gpu/drm/msm/Makefile | 1 + drivers/gpu/drm/msm/adreno/a2xx_gpu.c | 25 +- drivers/gpu/drm/msm/adreno/a2xx_gpummu.c | 5 +- drivers/gpu/drm/msm/adreno/a3xx_gpu.c | 17 +- drivers/gpu/drm/msm/adreno/a4xx_gpu.c | 17 +- drivers/gpu/drm/msm/adreno/a5xx_debugfs.c | 4 +- drivers/gpu/drm/msm/adreno/a5xx_gpu.c | 22 +- drivers/gpu/drm/msm/adreno/a5xx_power.c | 2 +- drivers/gpu/drm/msm/adreno/a5xx_preempt.c | 10 +- drivers/gpu/drm/msm/adreno/a6xx_gmu.c | 32 +- drivers/gpu/drm/msm/adreno/a6xx_gmu.h | 2 +- drivers/gpu/drm/msm/adreno/a6xx_gpu.c | 49 +- drivers/gpu/drm/msm/adreno/a6xx_gpu_state.c | 6 +- drivers/gpu/drm/msm/adreno/a6xx_preempt.c | 10 +- drivers/gpu/drm/msm/adreno/adreno_device.c | 4 - drivers/gpu/drm/msm/adreno/adreno_gpu.c | 99 +- drivers/gpu/drm/msm/adreno/adreno_gpu.h | 23 +- .../drm/msm/disp/dpu1/dpu_encoder_phys_wb.c | 14 +- drivers/gpu/drm/msm/disp/dpu1/dpu_formats.c | 18 +- drivers/gpu/drm/msm/disp/dpu1/dpu_formats.h | 2 +- drivers/gpu/drm/msm/disp/dpu1/dpu_kms.c | 18 +- drivers/gpu/drm/msm/disp/dpu1/dpu_plane.c | 14 +- drivers/gpu/drm/msm/disp/dpu1/dpu_plane.h | 4 +- drivers/gpu/drm/msm/disp/mdp4/mdp4_crtc.c | 6 +- drivers/gpu/drm/msm/disp/mdp4/mdp4_kms.c | 28 +- drivers/gpu/drm/msm/disp/mdp4/mdp4_plane.c | 12 +- drivers/gpu/drm/msm/disp/mdp5/mdp5_crtc.c | 4 +- drivers/gpu/drm/msm/disp/mdp5/mdp5_kms.c | 19 +- drivers/gpu/drm/msm/disp/mdp5/mdp5_plane.c | 12 +- drivers/gpu/drm/msm/dsi/dsi_host.c | 14 +- drivers/gpu/drm/msm/msm_drv.c | 184 +-- drivers/gpu/drm/msm/msm_drv.h | 35 +- drivers/gpu/drm/msm/msm_fb.c | 18 +- drivers/gpu/drm/msm/msm_fbdev.c | 2 +- drivers/gpu/drm/msm/msm_gem.c | 494 +++--- drivers/gpu/drm/msm/msm_gem.h | 247 ++- drivers/gpu/drm/msm/msm_gem_prime.c | 15 + drivers/gpu/drm/msm/msm_gem_shrinker.c | 104 +- drivers/gpu/drm/msm/msm_gem_submit.c | 295 ++-- drivers/gpu/drm/msm/msm_gem_vma.c | 1471 ++++++++++++++++- drivers/gpu/drm/msm/msm_gpu.c | 214 ++- drivers/gpu/drm/msm/msm_gpu.h | 144 +- drivers/gpu/drm/msm/msm_gpu_trace.h | 14 + drivers/gpu/drm/msm/msm_iommu.c | 302 +++- drivers/gpu/drm/msm/msm_kms.c | 18 +- drivers/gpu/drm/msm/msm_kms.h | 2 +- drivers/gpu/drm/msm/msm_mmu.h | 38 +- drivers/gpu/drm/msm/msm_rd.c | 62 +- drivers/gpu/drm/msm/msm_ringbuffer.c | 10 +- drivers/gpu/drm/msm/msm_submitqueue.c | 96 +- drivers/gpu/drm/msm/msm_syncobj.c | 172 ++ drivers/gpu/drm/msm/msm_syncobj.h | 37 + drivers/gpu/drm/scheduler/sched_entity.c | 16 +- drivers/gpu/drm/scheduler/sched_main.c | 3 + drivers/iommu/io-pgtable-arm.c | 27 +- include/drm/drm_gem.h | 10 +- include/drm/drm_gpuvm.h | 12 +- include/drm/gpu_scheduler.h | 13 +- include/linux/io-pgtable.h | 8 + include/uapi/drm/msm_drm.h | 149 +- 63 files changed, 3484 insertions(+), 1251 deletions(-) create mode 100644 drivers/gpu/drm/msm/msm_syncobj.c create mode 100644 drivers/gpu/drm/msm/msm_syncobj.h