-
Hello, I know, that the usage of ZFS on 32bit devices is not encouraged. However, I have been using zfs 0.6x and 0.7x reliably on my device for a long time. I cannot judge the performance, but it was perfectly stable, the only downside were troubles with the debian dkms packages. This was fixed in later versions and now the kernel modules compile perfectly without interaction. But my problem is, that zfs is not stable anymore. I tried reducing threads, tweaking the module parameters to a minimum... However just a simple scrub can result in a hanging system. Same applies to heavy writing and reading on the zfs volumes. My question: Did anyone find the perfect kernel parameters to maximise stability, even on weak hardware? |
Beta Was this translation helpful? Give feedback.
Replies: 6 comments 3 replies
-
Could you possibly specify in more detail what hardware you're running on, what specific OS version, what pool layout and any non-default properties, and any messages that came out in dmesg from it? Does /proc/spl/kstat/zfs/dbgmsg say anything interesting when this happens? What does /proc/spl/kstat/zfs/[poolname]/txgs say when it's doing this? |
Beta Was this translation helpful? Give feedback.
-
@rincebrain Thank you for your question I am using an Asus Tinker Board (slightly better than Raspberry Pi 3), 2GB RAM and Armbian (just Debian buster with some adaptions for the device) as an OS. Kernel: With zfs 0.6/0.7 I was using an Armbian version based on Debian stretch The pool layout is RAIDZ1, consisting of three external drives connected via USB2, the board does not support USB3. The drives are ok, no SMART-problems, no USB cable problems, everything powered properly, I have not changed the hardware setup since the 0.6/0.7 times but cross-checked to rule-out hardware problems. The ARM-board is stable under load or during long-term operations. Root is on ZFS and regular services (eg. light databse usage) and their disk access do not create any problems. Only heavy disk access (eg writing tens of GB, scrub) creates trouble. No special properties, compression enabled, dedup off, some harmless perfomance optimisations. Kernel boot line has: and modprobe currently has (I have tested various settings properly before, but lost my values due to to unrelated reasons, these ones are not that systematic, except arc)
I do not have any insight what happens when the board hangs. Root is on the zfs pool and the machine is headless. The system logs to RAM and ocasionally writes to disk. I do not want to unnecessarily stress-test this system, I just would like it to be slow and stable. |
Beta Was this translation helpful? Give feedback.
-
Curious. I've used ZFS on rather weak systems for a long time, and not had this particular problem. I should point out that while the Pi 3 may be weaker in an absolute sense, it does have the benefit of being a 64-bit chip, which could make a difference here. I'll try firing up my Pi 3 with a 32-bit OS and see what I can make it do, though I don't have a set of USB hard drives I could turn into a raidz1 particularly easily... |
Beta Was this translation helpful? Give feedback.
-
I do not think it is related to 32 vs 64 bit-chip. My suspicion is the slow USB 2 connection of the disks in combination with the improvements in ZFS. I think I started seeing these problems with zfs 0.8x, but those were not isolated upgrades, so there could be many reasons. |
Beta Was this translation helpful? Give feedback.
-
Certainly one place I'd be suspicious. I believe on the Pi 3 it's still the case that the whole USB2 port set is behind a single actual port to the SoC and then an onboard hub, so if I can dig up enough disks, I can probably choke similarly... |
Beta Was this translation helpful? Give feedback.
-
It is not really an answer, but after an upgrade to 2.1.4 and and a 5.15 kernel, the system is stable and consistent again. Unfortunately I upgraded to many things at once to narrow it down. |
Beta Was this translation helpful? Give feedback.
It is not really an answer, but after an upgrade to 2.1.4 and and a 5.15 kernel, the system is stable and consistent again. Unfortunately I upgraded to many things at once to narrow it down.