iterate_supers_type() spinning #790

marksumm · 2012-06-18T17:32:32Z

I'm running ZFS 0.6.0.65 (kernel module) on Ubuntu 12.04 64-bit.

With this version of ZFS, and previous versions, I have the same issue, which I shall try to describe...

A few hours after booting the server (usually the next day), the load average reported will be insanely high (3600 is my personal best) for a period of time.

This situation resolves itself without any intervention from me. However, it leaves behind an arc_adapt process, which consumes 100% of one core, until I reboot the server.

Having read that arc_adapt doesn't have much responsibility these days, I am at a loss to explain this behaviour.

Annoyingly, I am unable to attach gdb or strace to the arc_adapt kernel space process, in order to further diagnose the issue.

Here is some further information:

RAM: 8GB

ZFS configuration: RAIDZ2 comprised of 8x2TB and 8x1TB drives

free -m
             total       used       free     shared    buffers     cached
Mem:          7986       6346       1639          0         20        309
-/+ buffers/cache:       6016       1969
Swap:        16383        231      16152


top -bn1 | head
top - 18:31:49 up 1 day, 5 min,  2 users,  load average: 1.18, 1.16, 1.23
Tasks: 259 total,   2 running, 257 sleeping,   0 stopped,   0 zombie 
Cpu(s):  0.1%us, 12.7%sy,  0.0%ni, 86.7%id,  0.3%wa,  0.0%hi,  0.1%si,  0.0%st
Mem:   8177736k total,  6492220k used,  1685516k free,    20184k buffers
Swap: 16777212k total,   237464k used, 16539748k free,   313788k cached

PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+ COMMAND                                                                                                                                                                       
  443 root       0 -20     0    0    0 R  102  0.0 694:51.26 arc_adapt                                                                                                                                                                     
22664 mark      20   0 20572 2588 1040 S    4  0.0   4:13.52 top   

cat /etc/modprobe.d/zfs.conf 
options zfs zfs_no_write_throttle=1 zfs_arc_max=2147483648 zfs_vdev_min_pending=1 zfs_vdev_max_pending=1 zfs_nocacheflush=1 zfs_prefetch_disable=1 zfs_arc_meta_limit=536870912 zfs_arc_shrink_shift=5 zfs_arc_p_min_shift=4



cat /proc/spl/kstat/zfs/arcstats 
4 1 0x01 77 3696 11374927197 86393422019678
name                            type data
hits                            4    23965740
misses                          4    1226051
demand_data_hits                4    7556400
demand_data_misses              4    29970
demand_metadata_hits            4    16404742
demand_metadata_misses          4    1189137
prefetch_data_hits              4    0
prefetch_data_misses            4    0
prefetch_metadata_hits          4    4598
prefetch_metadata_misses        4    6944
mru_hits                        4    1735190
mru_ghost_hits                  4    76760
mfu_hits                        4    22227199
mfu_ghost_hits                  4    995540
deleted                         4    47477
recycle_miss                    4    430963
mutex_miss                      4    227
evict_skip                      4    1489529
evict_l2_cached                 4    0
evict_l2_eligible               4    19296547840
evict_l2_ineligible             4    20530176
hash_elements                   4    143744
hash_elements_max               4    143744
hash_collisions                 4    65708
hash_chains                     4    27779
hash_chain_max                  4    6
p                               4    139279360
c                               4    2147483648
c_min                           4    293601280
c_max                           4    2147483648
size                            4    2371760816
hdr_size                        4    104388960
data_size                       4    756129792
other_size                      4    1511242064
anon_size                       4    147456
anon_evict_data                 4    0
anon_evict_metadata             4    0
mru_size                        4    128448512
mru_evict_data                  4    31750144
mru_evict_metadata              4    12288
mru_ghost_size                  4    2241672192
mru_ghost_evict_data            4    1719048192
mru_ghost_evict_metadata        4    522624000
mfu_size                        4    627533824
mfu_evict_data                  4    275552256
mfu_evict_metadata              4    0
mfu_ghost_size                  4    1320113664
mfu_ghost_evict_data            4    1150970880
mfu_ghost_evict_metadata        4    169142784
l2_hits                         4    0
l2_misses                       4    0
l2_feeds                        4    0
l2_rw_clash                     4    0
l2_read_bytes                   4    0
l2_write_bytes                  4    0
l2_writes_sent                  4    0
l2_writes_done                  4    0
l2_writes_error                 4    0
l2_writes_hdr_miss              4    0
l2_evict_lock_retry             4    0
l2_evict_reading                4    0
l2_free_on_write                4    0
l2_abort_lowmem                 4    0
l2_cksum_bad                    4    0
l2_io_error                     4    0
l2_size                         4    0
l2_hdr_size                     4    0
memory_throttle_count           4    0
memory_direct_count             4    0
memory_indirect_count           4    0
arc_no_grow                     4    0
arc_tempreserve                 4    0
arc_loaned_bytes                4    0
arc_prune                       4    37149
arc_meta_used                   4    2064458416
arc_meta_limit                  4    536870912
arc_meta_max                    4    2066623440



zfs get all tank
NAME  PROPERTY              VALUE                  SOURCE
tank  type                  filesystem             -
tank  creation              Sat Oct  8 11:15 2011  -
tank  used                  11.5T                  -
tank  available             4.50T                  -
tank  referenced            7.81T                  -
tank  compressratio         1.00x                  -
tank  mounted               yes                    -
tank  quota                 none                   default
tank  reservation           none                   default
tank  recordsize            128K                   default
tank  mountpoint            /mnt/zfs2              local
tank  sharenfs              on                     local
tank  checksum              on                     default
tank  compression           off                    default
tank  atime                 off                    local
tank  devices               on                     default
tank  exec                  on                     default
tank  setuid                on                     default
tank  readonly              off                    default
tank  zoned                 off                    default
tank  snapdir               visible                local
tank  aclinherit            restricted             default
tank  canmount              on                     default
tank  xattr                 on                     default
tank  copies                1                      default
tank  version               5                      -
tank  utf8only              off                    -
tank  normalization         none                   -
tank  casesensitivity       sensitive              -
tank  vscan                 off                    default
tank  nbmand                off                    default
tank  sharesmb              off                    default
tank  refquota              none                   default
tank  refreservation        none                   default
tank  primarycache          all                    local
tank  secondarycache        all                    local
tank  usedbysnapshots       309G                   -
tank  usedbydataset         7.81T                  -
tank  usedbychildren        3.40T                  -
tank  usedbyrefreservation  0                      -
tank  logbias               latency                default
tank  dedup                 off                    local
tank  mlslabel              none                   default
tank  sync                  disabled               local
tank  refcompressratio      1.00x                  -



zpool status
  pool: tank
 state: ONLINE
 scan: scrub repaired 0 in 139h13m with 0 errors on Wed Jun  6 15:56:34 2012
config:

NAME        STATE     READ WRITE CKSUM
tank        ONLINE       0     0     0
  raidz2-0  ONLINE       0     0     0
    sdg     ONLINE       0     0     0
    sdh     ONLINE       0     0     0
    sdi     ONLINE       0     0     0
    sdj     ONLINE       0     0     0
    sdc     ONLINE       0     0     0
    sdd     ONLINE       0     0     0
    sde     ONLINE       0     0     0
    sdf     ONLINE       0     0     0
  raidz2-1  ONLINE       0     0     0
    sdk     ONLINE       0     0     0
    sdl     ONLINE       0     0     0
    sdm     ONLINE       0     0     0
    sdn     ONLINE       0     0     0
    sdo     ONLINE       0     0     0
    sdp     ONLINE       0     0     0
    sdq     ONLINE       0     0     0
    sdr     ONLINE       0     0     0

errors: No known data errors

The text was updated successfully, but these errors were encountered:

Rudd-O · 2012-06-18T23:53:38Z

wtf.

(posting to track this)

marksumm · 2012-06-22T13:18:06Z

Just an update...

arc_adapt has still not learned the error of its ways. It has taken up permanent residence on one of my CPU cores (see below).

Any suggestions about how to find out what it is doing will be gratefully received.

top - 14:14:30 up 4 days, 19:48,  4 users,  load average: 1.21, 1.28, 1.33
Tasks: 266 total,   2 running, 264 sleeping,   0 stopped,   0 zombie
Cpu(s):  0.1%us, 25.1%sy,  0.0%ni, 74.9%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Mem:   8177736k total,  6361036k used,  1816700k free,   102212k buffers
Swap: 16777212k total,   720580k used, 16056632k free,   359960k cached

PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND                                                                                                                                                                        
 443 root       0 -20     0    0    0 R  100  0.0   6197:11 arc_adapt                                                                                                                                                                      
1525 root      20   0     0    0    0 S    0  0.0   0:00.97 kworker/1:1                                                                                                                                                                    
2378 mark      20   0 3168m 123m 4556 S    0  1.5  13:03.37 java

marksumm · 2012-06-28T09:57:57Z

A further update... after another random load spike, the server became completely unresponsive yesterday morning.

The stack trace from the kernel was revealing, in that it had nothing to do with ZFS. It was complaining about the Linux md driver, which supports the software RAID1 array I use to contain my OS install.

So, quite possibly, the load spike issue is not relevant, and the question is simply why is arc_adapt spinning?

Also, since I rebooted the server, arc_adapt has already returned to the top of the CPU utilisation list i.e. the problem resurfaced in under 24 hours.

marksumm · 2012-06-28T10:02:17Z

Kernel version is 3.2.0-25-generic (from Ubuntu Precise)

behlendorf · 2012-06-28T22:40:14Z

My best guess for what's going on here is that your working meta data set size exceeds the zfs_arc_meta_limit=536870912 you manually set. This results in the arc_adapt thread getting woken up repeatedly to try and free up some of that meta data. Notice how arc_meta_used exceeds arc_meta_limit.

arc_prune                       4    37149
arc_meta_used                   4    2064458416
arc_meta_limit                  4    536870912
arc_meta_max                    4    2066623440

First I'd try reverting to the default zfs_arc_meta_limit and see if that helps, you could alternately try increasing it further. We'll also want to put together a patch to perhaps handle this case better when for whatever reason the memory can't be reclaimed.

marksumm · 2012-07-02T10:03:43Z

Thank you for taking the time to look into this bizarre issue.

I have removed my user-specified zfs_arc_meta_limit.

However, I also moved to CentOS 6, because I simply could not tolerate the instability of SATA drivers provided in Ubuntu. This has been an issue in the past, and it was disappointing to discover that it had resurfaced. Also, the constant goalpost moving of ever-newer kernel versions was not helping matters.

In summary...

Good news: Everything seems stable and arc_adapt is not spinning

Bad news: I have changed two variables, which is bad experimentation

marksumm · 2012-07-02T10:08:34Z

Oh the irony!

After I posted my last update, I checked dmesg, and found this. Note that it does not appear to have caused any instability.

z_wr_iss/1: page allocation failure. order:0, mode:0x20
Pid: 1359, comm: z_wr_iss/1 Tainted: P           ----------------   2.6.32-220.23.1.el6.x86_64 #1
Call Trace:
 <IRQ>  [<ffffffff8112415f>] ? __alloc_pages_nodemask+0x77f/0x940
 [<ffffffff8115e152>] ? kmem_getpages+0x62/0x170
 [<ffffffff8115ed6a>] ? fallback_alloc+0x1ba/0x270
 [<ffffffff8115e7bf>] ? cache_grow+0x2cf/0x320
 [<ffffffff8115eae9>] ? ____cache_alloc_node+0x99/0x160
 [<ffffffff814220aa>] ? __alloc_skb+0x7a/0x180
 [<ffffffff8115f9af>] ? kmem_cache_alloc_node_notrace+0x6f/0x130
 [<ffffffff8115fbeb>] ? __kmalloc_node+0x7b/0x100
 [<ffffffff814220aa>] ? __alloc_skb+0x7a/0x180
 [<ffffffff81422226>] ? __netdev_alloc_skb+0x36/0x60
 [<ffffffffa050c1b2>] ? e1000_clean_rx_irq+0x392/0x530 [e1000e]
 [<ffffffffa050b5fe>] ? e1000_clean+0x7e/0x2b0 [e1000e]
 [<ffffffff8105679c>] ? scheduler_tick+0xcc/0x260
 [<ffffffff81431013>] ? net_rx_action+0x103/0x2f0
 [<ffffffff81072291>] ? __do_softirq+0xc1/0x1d0
 [<ffffffff810d9740>] ? handle_IRQ_event+0x60/0x170
 [<ffffffff8100c24c>] ? call_softirq+0x1c/0x30
 [<ffffffff8100de85>] ? do_softirq+0x65/0xa0
 [<ffffffff81072075>] ? irq_exit+0x85/0x90
 [<ffffffff814f5515>] ? do_IRQ+0x75/0xf0
 [<ffffffff8100ba53>] ? ret_from_intr+0x0/0x11
 <EOI>  [<ffffffffa0378323>] ? fletcher_4_native+0x23/0x60 [zcommon]
 [<ffffffffa0429210>] ? zio_checksum_compute+0xd0/0x160 [zfs]
 [<ffffffffa0423ac0>] ? zio_checksum_generate+0x30/0x60 [zfs]
 [<ffffffffa0427959>] ? zio_execute+0x99/0xf0 [zfs]
 [<ffffffffa02c37b2>] ? taskq_thread+0x212/0x590 [spl]
 [<ffffffff814ed250>] ? thread_return+0x4e/0x76e
 [<ffffffff8105ea30>] ? default_wake_function+0x0/0x20
 [<ffffffffa02c35a0>] ? taskq_thread+0x0/0x590 [spl]
 [<ffffffff810909c6>] ? kthread+0x96/0xa0
 [<ffffffff8100c14a>] ? child_rip+0xa/0x20
 [<ffffffff81090930>] ? kthread+0x0/0xa0
 [<ffffffff8100c140>] ? child_rip+0x0/0x20

behlendorf · 2012-07-02T16:15:15Z

That's good news. As for the stack you posted that's a known issue which is being worked and it's harmless. In your exact case it resulted in a network packet getting dropped but TCP will resend so there was no harm done.

marksumm · 2012-07-02T16:31:17Z

Indeed it is!

Also, there is a conspicuous absence of ever-increasing memory utilisation.

Given that I was on a 3.2 kernel, I'm guessing that was related to either #466 or #618

wrouesnel · 2012-07-22T08:38:25Z

Ok so I'm pretty sure I just ran into this on my Ubuntu fileserver running kernel 3.2.0-26-generic #41-Ubuntu SMP.

I'm running the ZFS daily PPA, which I update regularly. I had the same problem with arc_adapt spinning at 100% utilizing, and it looks like it was probably to do with arc_meta_limit (which I've left at defaults). After reboot (with all the usual things running) the meta_usage is sitting close to the limit:

arc_meta_used                   4    33538896
arc_meta_limit                  4    33554432
arc_meta_max                    4    44593832

I'm guessing I should just bump up arc_meta_limit, although I don't actually know what it does.

behlendorf · 2012-07-23T16:54:05Z

The arc_meta_limit attempts to put a bound on the amount of cached meta data in the ARC. Basically this is anything that's not bulk file data but still needs to be cached. Things like inodes. Anyway, the limit is there to attempt to keep a reasonable balanve between the file data in this meta data. If your hitting this limit regularly you likely have a large working set of files and it may make sense to increase it.

wrouesnel · 2012-07-29T03:38:04Z

Ok I just had this bug lock up my server again. It happened right as I ran Unison against a fairly large directory full of large files. The behavior was the same - arc_adapt started spinning at 100+% in top, and filled up the memory until it hit the swap at which point the server became unresponsive.

I have zfs_arc_max=0x8000000 in zfs.conf under modprobe.d, which should be a very low amount of memory but it doesn't seem to be respected at all. At any rate I don't know how it managed to balloon from that figure to over 8gb.

Is there something I can do to help debug this, since it doesn't seem reasonable that metadata operations should lock up the entire server - I'd expect simply to have slower file lists, data searches.

mkj · 2012-08-02T12:58:37Z

I'm currently seeing this on a system, Ubuntu 12.04 64 3.2.0-27-generic, zfs 0.6.0.65. It occurred after a rsync and snapshot. The system has 2 CPUs, 2GB ram, 16G USB flash l2arc. I'll leave it spinning for a couple of days if anyone wants me to run anything.

Looking at a "perf" callgraph it seems to be spinning inside iterate_supers_type(), not even calling zpl_prune_sb() from what I can tell (pasted below). It's as if the fs_supers list has become looped. In the 3.3 kernel the commit dabe0dc194d5d56d379a8994fff47392744b6491 looks like it fixes that kind of problem - maybe that's the issue? (I think there are some related previous commits too).

"perf record -g -p <pid_of_arc_adapt>":

# Overhead    Command      Shared Object                 Symbol
# ........  .........  .................  .....................
#
    27.05%  arc_adapt  [kernel.kallsyms]  [k] down_read
            |
            --- down_read
               |          
               |--91.69%-- iterate_supers_type
               |          zpl_prune_sbs
               |          arc_adjust_meta
               |          arc_adapt_thread
               |          thread_generic_wrapper
               |          kthread
               |          kernel_thread_helper
               |          
                --8.31%-- zpl_prune_sbs
                          arc_adjust_meta
                          arc_adapt_thread
                          thread_generic_wrapper
                          kthread
                          kernel_thread_helper

    24.63%  arc_adapt  [kernel.kallsyms]  [k] up_read
            |
            --- up_read
               |          
               |--65.94%-- iterate_supers_type
               |          zpl_prune_sbs
               |          arc_adjust_meta
               |          arc_adapt_thread
               |          thread_generic_wrapper
               |          kthread
               |          kernel_thread_helper
               |          
                --34.06%-- zpl_prune_sbs
                          arc_adjust_meta
                          arc_adapt_thread
                          thread_generic_wrapper
                          kthread
                          kernel_thread_helper

    24.57%  arc_adapt  [kernel.kallsyms]  [k] __ticket_spin_lock
            |
            --- __ticket_spin_lock
               |          
               |--68.54%-- _raw_spin_lock
               |          iterate_supers_type
               |          zpl_prune_sbs
               |          arc_adjust_meta
               |          arc_adapt_thread
               |          thread_generic_wrapper
               |          kthread
               |          kernel_thread_helper
               |          
                --31.46%-- iterate_supers_type
                          zpl_prune_sbs
                          arc_adjust_meta
                          arc_adapt_thread
                          thread_generic_wrapper
                          kthread
                          kernel_thread_helper

    10.50%  arc_adapt  [kernel.kallsyms]  [k] iterate_supers_type
            |
            --- iterate_supers_type
                zpl_prune_sbs
                arc_adjust_meta
                arc_adapt_thread
                thread_generic_wrapper
                kthread
                kernel_thread_helper

     4.49%  arc_adapt  [kernel.kallsyms]  [k] __put_super
            |
            --- __put_super
               |          
               |--64.70%-- iterate_supers_type
               |          zpl_prune_sbs
               |          arc_adjust_meta
               |          arc_adapt_thread
               |          thread_generic_wrapper
               |          kthread
               |          kernel_thread_helper
               |          
                --35.30%-- zpl_prune_sbs
                          arc_adjust_meta
                          arc_adapt_thread
                          thread_generic_wrapper
                          kthread
                          kernel_thread_helper

     3.86%  arc_adapt  [kernel.kallsyms]  [k] _raw_spin_lock
            |
            --- _raw_spin_lock
               |          
               |--61.70%-- iterate_supers_type
               |          zpl_prune_sbs
               |          arc_adjust_meta
               |          arc_adapt_thread
               |          thread_generic_wrapper
               |          kthread
               |          kernel_thread_helper
               |          
                --38.30%-- zpl_prune_sbs
                          arc_adjust_meta
                          arc_adapt_thread
                          thread_generic_wrapper
                          kthread
                          kernel_thread_helper

     2.81%  arc_adapt  [kernel.kallsyms]  [k] _cond_resched
            |
            --- _cond_resched
               |          
               |--58.07%-- down_read
               |          iterate_supers_type
               |          zpl_prune_sbs
               |          arc_adjust_meta
               |          arc_adapt_thread
               |          thread_generic_wrapper
               |          kthread
               |          kernel_thread_helper
               |          
                --41.93%-- iterate_supers_type
                          zpl_prune_sbs
                          arc_adjust_meta
                          arc_adapt_thread
                          thread_generic_wrapper
                          kthread
                          kernel_thread_helper

     2.09%  arc_adapt  [kernel.kallsyms]  [k] __ticket_spin_unlock
            |
            --- __ticket_spin_unlock
               |          
               |--72.47%-- zpl_prune_sbs
               |          arc_adjust_meta
               |          arc_adapt_thread
               |          thread_generic_wrapper
               |          kthread
               |          kernel_thread_helper
               |          
                --27.53%-- iterate_supers_type
                          zpl_prune_sbs
                          arc_adjust_meta
                          arc_adapt_thread
                          thread_generic_wrapper
                          kthread
                          kernel_thread_helper

     0.00%  arc_adapt  [kernel.kallsyms]  [k] native_write_msr_safe
            |
            --- native_write_msr_safe
                intel_pmu_enable_all
                x86_pmu_enable
                perf_pmu_enable
                x86_pmu_commit_txn
                group_sched_in
                __perf_event_enable
                remote_function
                generic_smp_call_function_single_interrupt
                smp_call_function_single_interrupt
                call_function_single_interrupt
                _raw_spin_lock
                iterate_supers_type
                0xffffffffa02cdc9f
                arc_adjust_meta
                arc_adapt_thread
                0xffffffffa01970b8
                kthread
                kernel_thread_helper

behlendorf · 2012-08-02T19:06:21Z

Interesting, the additional profiling makes this look like a duplicate of #861 which was recently filed. Somehow we're getting caught on the sb_lock while iterating over the super blocks. The kernel commit you referenced certainly looks relevant we'll need to review it carefully.

In the meanwhile, if you need a work around you could #undef HAVE_SHRINK in zfs_config.h and rebuild zfs until we exactly determine what's going on here.

mkj · 2012-08-03T11:41:50Z

Looking at my system it seems that the bug occurred while unmounting a .zfs/snapshot dir (it's still mounted but nothing using it), matching #861.

I think there's still a bug in iterate_supers_type() in head kernels. In generic_shutdown_super() it calls hlist_del_init() on s_instances, but don't think there's anything preventing it being the current iterator in iterate_supers_type().

Possibly deleting s_instances needs to be moved to __put_super(), like s_list. (And other changes made to things that look at s_instances being empty).

lkishalmi · 2012-11-14T08:50:21Z

I'm still experiencing this issue. It starts at overnight, when I leave the computer turned on, but no user is logged in.
I have Ubuntu 12.04 64bit 8GB RAM and I boot from ZFS as well. I've seen this from 0.6.0.82-rc11 through 0.6.0.85-rc11. I did some profiling which results are:

-  28.81%  arc_adapt  [kernel.kallsyms]  [k]  __ticket_spin_lock                                                                                                                                             
   - __ticket_spin_lock                                                                                                                                                                                       ▒
      - 96.26% _raw_spin_lock                                                                                                                                                                                 ▒
         - 99.99% iterate_supers_type                                                                                                                                                                         ▒
              zpl_prune_sbs                                                                                                                                                                                   ▒
              arc_adjust_meta                                                                                                                                                                                 ▒
              arc_adapt_thread                                                                                                                                                                                ▒
              thread_generic_wrapper                                                                                                                                                                          ▒
              kthread                                                                                                                                                                                         ▒
              kernel_thread_helper                                                                                                                                                                            ▒
      + 3.72% iterate_supers_type                                                                                                                                                                             ▒
-  23.54%  arc_adapt  [kernel.kallsyms]  [k] down_read                                                                                                                                                        ▒
   - down_read                                                                                                                                                                                                ▒
      - 95.26% iterate_supers_type                                                                                                                                                                            ▒
           zpl_prune_sbs                                                                                                                                                                                      ▒
           arc_adjust_meta                                                                                                                                                                                    ▒
           arc_adapt_thread                                                                                                                                                                                   ▒
           thread_generic_wrapper                                                                                                                                                                             ▒
           kthread                                                                                                                                                                                            ◆
           kernel_thread_helper                                                                                                                                                                               ▒
      + 4.74% zpl_prune_sbs                                                                                                                                                                                   ▒
-  21.83%  arc_adapt  [kernel.kallsyms]  [k] up_read                                                                                                                                                          ▒
   - up_read                                                                                                                                                                                                  ▒
      - 95.47% iterate_supers_type                                                                                                                                                                            ▒
           zpl_prune_sbs                                                                                                                                                                                      ▒
           arc_adjust_meta                                                                                                                                                                                    ▒
           arc_adapt_thread                                                                                                                                                                                   ▒
           thread_generic_wrapper                                                                                                                                                                             ▒
           kthread                                                                                                                                                                                            ▒
           kernel_thread_helper                                                                                                                                                                               ▒
      + 4.53% zpl_prune_sbs

behlendorf · 2012-11-14T17:47:37Z

This will be fixed with proper page cache integration. Until then I'd suggest making your .zfs snapshot directory invisible so it's less likely to be traversed by processes walking the file system. This will cut down on the number of mount/umounts and make the issue less likely.

Rudd-O · 2013-06-29T21:05:59Z

Is this issue fixed?

behlendorf · 2013-07-01T18:19:19Z

@Rudd-O No, but you can likely work around it by building zfs with HAVE_SHRINK undefined.

The iterate_supers_type() function which was introduced in the 3.0 kernel was supposed to provide a safe way to call an arbitrary function on all super blocks of a specific type. Unfortunately, because a list_head was used a bug was introduced which made it possible for iterate_supers_type() to get stuck spinning on a super block which was just deactivated. The bug was fixed in the 3.3 kernel by converting the list_head to an hlist_node. However, to resolve the issue for existing 3.0 - 3.2 kernels we detect when a list_head is used. Then to prevent the spinning from occurring the .next pointer is set to the fs_supers list_head which ensures the iterate_supers_type() function will always terminate. Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Issue openzfs#1045 Issue openzfs#861 Issue openzfs#790

behlendorf · 2013-07-16T03:52:56Z

@Rudd-O Pull request #1595 contains a fix if you still need one. However, upon closer inspection this bug only impacts 3.0-3.2 kernels and since I know you're usually running something newer I'm surprised you hit it. Can you verify you were using one of the impacted kernels when you hit this?

The iterate_supers_type() function which was introduced in the 3.0 kernel was supposed to provide a safe way to call an arbitrary function on all super blocks of a specific type. Unfortunately, because a list_head was used a bug was introduced which made it possible for iterate_supers_type() to get stuck spinning on a super block which was just deactivated. This can occur because when the list head is removed from the fs_supers list it is reinitialized to point to itself. If the iterate_supers_type() function happened to be processing the removed list_head it will get stuck spinning on that list_head. The bug was fixed in the 3.3 kernel by converting the list_head to an hlist_node. However, to resolve the issue for existing 3.0 - 3.2 kernels we detect when a list_head is used. Then to prevent the spinning from occurring the .next pointer is set to the fs_supers list_head which ensures the iterate_supers_type() function will always terminate. Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Closes openzfs#1045 Closes openzfs#861 Closes openzfs#790

Bumps [bytes](https://github.com/tokio-rs/bytes) from 1.3.0 to 1.4.0. - [Release notes](https://github.com/tokio-rs/bytes/releases) - [Changelog](https://github.com/tokio-rs/bytes/blob/master/CHANGELOG.md) - [Commits](tokio-rs/bytes@v1.3.0...v1.4.0) --- updated-dependencies: - dependency-name: bytes dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

stephane-chazelas mentioned this issue Oct 15, 2012

EBUSY upon ZPOOL_EXPORT #1045

Closed

behlendorf mentioned this issue Nov 28, 2012

Kernel dump on heavy load #1066

Closed

nedbass mentioned this issue Jan 23, 2013

arc_adapt hangs in "D" state while reading data from a snapshot #1215

Closed

behlendorf mentioned this issue Jul 16, 2013

Fix arc_adapt() spinning in iterate_supers_type() #1595

Closed

behlendorf closed this as completed in dba1d70 Jul 17, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

iterate_supers_type() spinning #790

iterate_supers_type() spinning #790

marksumm commented Jun 18, 2012

Rudd-O commented Jun 18, 2012

marksumm commented Jun 22, 2012

marksumm commented Jun 28, 2012

marksumm commented Jun 28, 2012

behlendorf commented Jun 28, 2012

marksumm commented Jul 2, 2012

marksumm commented Jul 2, 2012

behlendorf commented Jul 2, 2012

marksumm commented Jul 2, 2012

wrouesnel commented Jul 22, 2012

behlendorf commented Jul 23, 2012

wrouesnel commented Jul 29, 2012

mkj commented Aug 2, 2012

behlendorf commented Aug 2, 2012

mkj commented Aug 3, 2012

lkishalmi commented Nov 14, 2012

behlendorf commented Nov 14, 2012

Rudd-O commented Jun 29, 2013

behlendorf commented Jul 1, 2013

behlendorf commented Jul 16, 2013

iterate_supers_type() spinning #790

iterate_supers_type() spinning #790

Comments

marksumm commented Jun 18, 2012

Rudd-O commented Jun 18, 2012

marksumm commented Jun 22, 2012

marksumm commented Jun 28, 2012

marksumm commented Jun 28, 2012

behlendorf commented Jun 28, 2012

marksumm commented Jul 2, 2012

marksumm commented Jul 2, 2012

behlendorf commented Jul 2, 2012

marksumm commented Jul 2, 2012

wrouesnel commented Jul 22, 2012

behlendorf commented Jul 23, 2012

wrouesnel commented Jul 29, 2012

mkj commented Aug 2, 2012

behlendorf commented Aug 2, 2012

mkj commented Aug 3, 2012

lkishalmi commented Nov 14, 2012

behlendorf commented Nov 14, 2012

Rudd-O commented Jun 29, 2013

behlendorf commented Jul 1, 2013

behlendorf commented Jul 16, 2013