[5.10,106/129] zonefs: Fix management of open zones

Message ID	20220504153029.412540267@linuxfoundation.org
State	New
Headers	show Return-Path: <stable-owner@kernel.org> From: Greg Kroah-Hartman <gregkh@linuxfoundation.org> To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>, stable@vger.kernel.org, Damien Le Moal <damien.lemoal@opensource.wdc.com>, Johannes Thumshirn <johannes.thumshirn@wdc.com>, Hans Holmberg <hans.holmberg@wdc.com> Subject: [PATCH 5.10 106/129] zonefs: Fix management of open zones Date: Wed, 4 May 2022 18:44:58 +0200 Message-Id: <20220504153029.412540267@linuxfoundation.org> In-Reply-To: <20220504153021.299025455@linuxfoundation.org> References: <20220504153021.299025455@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Precedence: bulk
Series	None \| expand [5.10,002/129] lightnvm: disable the subsystem [5.10,003/129] usb: mtu3: fix USB 3.0 dual-role-switch from device to host [5.10,005/129] USB: quirks: add STRING quirk for VCOM device [5.10,006/129] USB: serial: whiteheat: fix heap overflow in WHITEHEAT_GET_DTR_RTS [5.10,009/129] USB: serial: option: add Telit 0x1057, 0x1058, 0x1075 compositions [5.10,013/129] iio: dac: ad5592r: Fix the missing return value. [5.10,016/129] iio: imu: inv_icm42600: Fix I2C init possible nack [5.10,018/129] usb: typec: ucsi: Fix reuse of completion structure [5.10,024/129] usb: dwc3: core: Only handle soft-reset in DCTL [5.10,026/129] usb: cdns3: Fix issue for clear halt endpoint [5.10,028/129] serial: imx: fix overrun interrupts in DMA mode [5.10,029/129] serial: 8250: Also set sticky MCR bits in console restoration [5.10,030/129] serial: 8250: Correct the clock for EndRun PTP/1588 PCIe device [5.10,031/129] arch_topology: Do not set llc_sibling if llc_id is invalid [5.10,036/129] x86/pci/xen: Disable PCI/MSI[-X] masking for XEN_HVM guests [5.10,038/129] video: fbdev: udlfb: properly check endpoint type [5.10,039/129] arm64: dts: meson: remove CPU opps below 1GHz for G12B boards [5.10,042/129] mtd: rawnand: fix ecc parameters for mt7622 [5.10,044/129] ARM: dts: imx6qdl-apalis: Fix sgtl5000 detection issue [5.10,045/129] phy: samsung: Fix missing of_node_put() in exynos_sata_phy_probe [5.10,046/129] phy: samsung: exynos5250-sata: fix missing device put in probe error paths [5.10,047/129] ARM: OMAP2+: Fix refcount leak in omap_gic_of_init [5.10,048/129] bus: ti-sysc: Make omap3 gpt12 quirk handling SoC specific [5.10,051/129] ARM: dts: at91: sama5d4_xplained: fix pinctrl phandle name [5.10,053/129] phy: ti: Add missing pm_runtime_disable() in serdes_am654_probe [5.10,054/129] ARM: dts: Fix mmc order for omap3-gta04 [5.10,055/129] ARM: dts: am3517-evm: Fix misc pinmuxing [5.10,056/129] ARM: dts: logicpd-som-lv: Fix wrong pinmuxing on OMAP35 [5.10,057/129] ipvs: correctly print the memory size of ip_vs_conn_tab [5.10,060/129] mtd: fix part field data corruption in mtd_info [5.10,061/129] pinctrl: stm32: Do not call stm32_gpio_get() for edge triggered IRQs in EOI [5.10,062/129] memory: renesas-rpc-if: Fix HF/OSPI data transfer in Manual Mode [5.10,065/129] bpf, lwt: Fix crash when using bpf_skb_set_tunnel_key() from bpf_xmit lwt hook [5.10,066/129] pinctrl: rockchip: fix RK3308 pinmux bits [5.10,069/129] tcp: ensure to use the most recently sent skb when filling the rate sample [5.10,070/129] wireguard: device: check for metadata_dst with skb_valid_dst() [5.10,071/129] sctp: check asoc strreset_chunk in sctp_generate_reconf_event [5.10,072/129] ARM: dts: imx6ull-colibri: fix vqmmc regulator [5.10,076/129] net: hns3: modify the return code of hclge_get_ring_chain_from_mbx [5.10,077/129] net: hns3: add validity check for message data length [5.10,078/129] net: hns3: add return value for mailbox handling in PF [5.10,081/129] ip6_gre: Make o_seqno start from 0 in native mode [5.10,083/129] tcp: fix potential xmit stalls caused by TCP_NOTSENT_LOWAT [5.10,087/129] net: bcmgenet: hide status block before TX timestamping [5.10,089/129] net: dsa: lantiq_gswip: Dont set GSWIP_MII_CFG_RMII_CLK [5.10,092/129] tls: Skip tls_append_frag on zero copy size [5.10,093/129] bnx2x: fix napi API usage sequence [5.10,094/129] net: fec: add missing of_node_put() in fec_enet_init_stop_mode() [5.10,096/129] ibmvnic: fix miscellaneous checks [5.10,097/129] Revert "ibmvnic: Add ethtool private flag for driver-defined queue limits" [5.10,098/129] tcp: fix F-RTO may not work correctly when receiving DSACK [5.10,099/129] ASoC: Intel: soc-acpi: correct device endpoints for max98373 [5.10,106/129] zonefs: Fix management of open zones [5.10,108/129] kasan: prevent cpu_quarantine corruption when CPU offline and cache shrink occur a... [5.10,111/129] thermal: int340x: Fix attr.show callback prototype [5.10,113/129] perf symbol: Pass is_kallsyms to symbols__fixup_end() [5.10,115/129] tty: n_gsm: fix restart handling via CLD command [5.10,116/129] tty: n_gsm: fix decoupled mux resource [5.10,118/129] tty: n_gsm: fix wrong signal octet encoding in convergence layer type 2 [5.10,121/129] tty: n_gsm: fix insufficient txframe size [5.10,123/129] tty: n_gsm: fix missing explicit ldisc flush [5.10,126/129] tty: n_gsm: fix reset fifo race condition [5.10,127/129] tty: n_gsm: fix incorrect UA handling [5.10,128/129] tty: n_gsm: fix software flow control handling [5.10,129/129] perf symbol: Remove arch__symbols__fixup_end()

Message ID

20220504153029.412540267@linuxfoundation.org

State

New

Headers

From: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
To: linux-kernel@vger.kernel.org
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>,
        stable@vger.kernel.org,
        Damien Le Moal <damien.lemoal@opensource.wdc.com>,
        Johannes Thumshirn <johannes.thumshirn@wdc.com>,
        Hans Holmberg <hans.holmberg@wdc.com>
Subject: [PATCH 5.10 106/129] zonefs: Fix management of open zones
Date: Wed,  4 May 2022 18:44:58 +0200
Message-Id: <20220504153029.412540267@linuxfoundation.org>
In-Reply-To: <20220504153021.299025455@linuxfoundation.org>
References: <20220504153021.299025455@linuxfoundation.org>
User-Agent: quilt/0.66
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Precedence: bulk

Series

None | expand

Commit Message

Greg Kroah-Hartman May 4, 2022, 4:44 p.m. UTC

From: Damien Le Moal <damien.lemoal@opensource.wdc.com>

commit 1da18a296f5ba4f99429e62a7cf4fdbefa598902 upstream.

The mount option "explicit_open" manages the device open zone
resources to ensure that if an application opens a sequential file for
writing, the file zone can always be written by explicitly opening
the zone and accounting for that state with the s_open_zones counter.

However, if some zones are already open when mounting, the device open
zone resource usage status will be larger than the initial s_open_zones
value of 0. Ensure that this inconsistency does not happen by closing
any sequential zone that is open when mounting.

Furthermore, with ZNS drives, closing an explicitly open zone that has
not been written will change the zone state to "closed", that is, the
zone will remain in an active state. Since this can then cause failures
of explicit open operations on other zones if the drive active zone
resources are exceeded, we need to make sure that the zone is not
active anymore by resetting it instead of closing it. To address this,
zonefs_zone_mgmt() is modified to change a REQ_OP_ZONE_CLOSE request
into a REQ_OP_ZONE_RESET for sequential zones that have not been
written.

Fixes: b5c00e975779 ("zonefs: open/close zone on file open/close")
Cc: <stable@vger.kernel.org>
Signed-off-by: Damien Le Moal <damien.lemoal@opensource.wdc.com>
Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com>
Reviewed-by: Hans Holmberg <hans.holmberg@wdc.com>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
---
 fs/zonefs/super.c |   45 ++++++++++++++++++++++++++++++++++++++++-----
 1 file changed, 40 insertions(+), 5 deletions(-)

--- a/fs/zonefs/super.c
+++ b/fs/zonefs/super.c
@@ -32,6 +32,17 @@  static inline int zonefs_zone_mgmt(struc
 
 	lockdep_assert_held(&zi->i_truncate_mutex);
 
+	/*
+	 * With ZNS drives, closing an explicitly open zone that has not been
+	 * written will change the zone state to "closed", that is, the zone
+	 * will remain active. Since this can then cause failure of explicit
+	 * open operation on other zones if the drive active zone resources
+	 * are exceeded, make sure that the zone does not remain active by
+	 * resetting it.
+	 */
+	if (op == REQ_OP_ZONE_CLOSE && !zi->i_wpoffset)
+		op = REQ_OP_ZONE_RESET;
+
 	ret = blkdev_zone_mgmt(inode->i_sb->s_bdev, op, zi->i_zsector,
 			       zi->i_zone_size >> SECTOR_SHIFT, GFP_NOFS);
 	if (ret) {
@@ -1306,12 +1317,13 @@  static void zonefs_init_dir_inode(struct
 	inc_nlink(parent);
 }
 
-static void zonefs_init_file_inode(struct inode *inode, struct blk_zone *zone,
-				   enum zonefs_ztype type)
+static int zonefs_init_file_inode(struct inode *inode, struct blk_zone *zone,
+				  enum zonefs_ztype type)
 {
 	struct super_block *sb = inode->i_sb;
 	struct zonefs_sb_info *sbi = ZONEFS_SB(sb);
 	struct zonefs_inode_info *zi = ZONEFS_I(inode);
+	int ret = 0;
 
 	inode->i_ino = zone->start >> sbi->s_zone_sectors_shift;
 	inode->i_mode = S_IFREG | sbi->s_perm;
@@ -1336,6 +1348,22 @@  static void zonefs_init_file_inode(struc
 	sb->s_maxbytes = max(zi->i_max_size, sb->s_maxbytes);
 	sbi->s_blocks += zi->i_max_size >> sb->s_blocksize_bits;
 	sbi->s_used_blocks += zi->i_wpoffset >> sb->s_blocksize_bits;
+
+	/*
+	 * For sequential zones, make sure that any open zone is closed first
+	 * to ensure that the initial number of open zones is 0, in sync with
+	 * the open zone accounting done when the mount option
+	 * ZONEFS_MNTOPT_EXPLICIT_OPEN is used.
+	 */
+	if (type == ZONEFS_ZTYPE_SEQ &&
+	    (zone->cond == BLK_ZONE_COND_IMP_OPEN ||
+	     zone->cond == BLK_ZONE_COND_EXP_OPEN)) {
+		mutex_lock(&zi->i_truncate_mutex);
+		ret = zonefs_zone_mgmt(inode, REQ_OP_ZONE_CLOSE);
+		mutex_unlock(&zi->i_truncate_mutex);
+	}
+
+	return ret;
 }
 
 static struct dentry *zonefs_create_inode(struct dentry *parent,
@@ -1345,6 +1373,7 @@  static struct dentry *zonefs_create_inod
 	struct inode *dir = d_inode(parent);
 	struct dentry *dentry;
 	struct inode *inode;
+	int ret;
 
 	dentry = d_alloc_name(parent, name);
 	if (!dentry)
@@ -1355,10 +1384,16 @@  static struct dentry *zonefs_create_inod
 		goto dput;
 
 	inode->i_ctime = inode->i_mtime = inode->i_atime = dir->i_ctime;
-	if (zone)
-		zonefs_init_file_inode(inode, zone, type);
-	else
+	if (zone) {
+		ret = zonefs_init_file_inode(inode, zone, type);
+		if (ret) {
+			iput(inode);
+			goto dput;
+		}
+	} else {
 		zonefs_init_dir_inode(dir, inode, type);
+	}
+
 	d_add(dentry, inode);
 	dir->i_size++;

[5.10,106/129] zonefs: Fix management of open zones

Commit Message

Patch