Bug#664088: mdadm fails to initialize components for bitmap
Markus Hochholdinger
Markus at Hochholdinger.net
Fri Mar 30 21:28:58 UTC 2012
Package: mdadm
Version: 3.2.3-2
Followup-For: Bug #664088
Dear Maintainer,
seems I've been stumbled over this bug. I'm running wheezy and can reproduce a crash:
mdadm --grow /dev/md0 --bitmap=none
mdadm --grow /dev/md0 --bitmap=internal
A few seconds after this, the system is still alive, I see the newly created bitmap in /proc/mdstat, and then the system crashes:
[ 342.437949] md0: bitmap file is out of date (0 < 322) -- forcing full recovery
[ 342.437967] created bitmap (1 pages) for device md0
[ 347.949946] BUG: unable to handle kernel NULL pointer dereference at 00000008
[ 347.949969] IP: [<eda5edbd>] bitmap_endwrite+0x138/0x199 [md_mod]
[ 347.949991] *pdpt = 0000000002660007 *pde = 0000000000000000
[ 347.950010] Oops: 0000 [#1] SMP
[ 347.950024] Modules linked in: fuse evdev snd_pcm snd_page_alloc snd_timer snd soundcore pcspkr ext3 mbcache jbd raid1 md_mod xen_netfront xen_blkfront
[ 347.950090]
[ 347.950098] Pid: 0, comm: swapper/0 Not tainted 3.2.0-2-686-pae #1
[ 347.950116] EIP: 0061:[<eda5edbd>] EFLAGS: 00010046 CPU: 0
[ 347.950130] EIP is at bitmap_endwrite+0x138/0x199 [md_mod]
[ 347.950141] EAX: 00000000 EBX: c2a4f080 ECX: c2a4f080 EDX: 00000000
[ 347.950152] ESI: 000000d0 EDI: 00000000 EBP: c2a2d1a0 ESP: c13dfdd8
[ 347.950163] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0069
[ 347.950173] Process swapper/0 (pid: 0, ti=c13de000 task=c13e4fe0 task.ti=c13de000)
[ 347.950185] Stack:
[ 347.950191] 01a039f8 00000000 c2a4f0a4 c2a400a4 0001c608 00000000 c25b7880 c2a4f080
[ 347.950229] 00000000 ea6be2ec eda396ca 00000008 00000001 00000000 c25b7880 ea51b540
[ 347.950266] c28a5c44 eda3c22b c25b7880 eda3c33c 00000000 c28a5c44 0000000d 00000000
[ 347.950303] Call Trace:
[ 347.950316] [<eda396ca>] ? close_write+0x6a/0x78 [raid1]
[ 347.950331] [<eda3c22b>] ? r1_bio_write_done+0x1b/0x32 [raid1]
[ 347.950346] [<eda3c33c>] ? raid1_end_write_request+0xfa/0x10d [raid1]
[ 347.950364] [<c10ebf0c>] ? bio_endio+0x22/0x23
[ 347.950380] [<c114cdb3>] ? blk_update_request+0x142/0x2a5
[ 347.950394] [<eda2946d>] ? do_blkif_request+0x33a/0x35c [xen_blkfront]
[ 347.950409] [<c114cf27>] ? blk_update_bidi_request+0x11/0x4e
[ 347.950424] [<c114df73>] ? __blk_end_bidi_request+0xf/0x26
[ 347.950438] [<c114dfa3>] ? __blk_end_request_all+0x19/0x22
[ 347.950452] [<eda29705>] ? blkif_interrupt+0x208/0x26a [xen_blkfront]
[ 347.950470] [<c10774d0>] ? handle_irq_event_percpu+0x47/0x158
[ 347.950484] [<c1077602>] ? handle_irq_event+0x21/0x37
[ 347.950498] [<c10792e2>] ? handle_edge_irq+0x7f/0x98
[ 347.950514] [<c11c2615>] ? __xen_evtchn_do_upcall+0x126/0x1ad
[ 347.950529] [<c11c3874>] ? xen_evtchn_do_upcall+0x18/0x26
[ 347.950544] [<c12c5857>] ? xen_do_upcall+0x7/0xc
[ 347.950558] [<c10023a7>] ? hypercall_page+0x3a7/0x1000
[ 347.950573] [<c100609a>] ? xen_safe_halt+0xf/0x19
[ 347.950587] [<c1010d6c>] ? default_idle+0x52/0x87
[ 347.950601] [<c100b22f>] ? cpu_idle+0x95/0xaf
[ 347.950615] [<c141e708>] ? start_kernel+0x32a/0x32f
[ 347.950628] [<c1420014>] ? xen_start_kernel+0x58b/0x592
[ 347.950638] Code: 02 66 89 45 00 77 38 8b 4b 10 89 d8 8b 7c 24 04 8b 34 24 83 e9 09 0f ad fe d3 ef f6 c1 20 0f 45 f7 89 f2 e8 a5 f5 ff ff 8b 53 34 <8b> 40 08 8d 04 85 01 00 00 00 0f ab 02 c7 43 44 00 00 00 00 8b
[ 347.950851] EIP: [<eda5edbd>] bitmap_endwrite+0x138/0x199 [md_mod] SS:ESP 0069:c13dfdd8
[ 347.950876] CR2: 0000000000000008
[ 347.950887] ---[ end trace 4852e9eea21e040c ]---
[ 347.950898] Kernel panic - not syncing: Fatal exception in interrupt
[ 347.950910] Pid: 0, comm: swapper/0 Tainted: G D 3.2.0-2-686-pae #1
[ 347.950922] Call Trace:
[ 347.950934] [<c12bc219>] ? panic+0x4d/0x144
[ 347.950949] [<c12c1e2a>] ? oops_end+0x8e/0x99
[ 347.950961] [<c12bbd16>] ? no_context+0x14b/0x155
[ 347.950975] [<c12bbe34>] ? bad_area_nosemaphore+0xa/0xc
[ 347.950989] [<c12c37c4>] ? do_page_fault+0x15d/0x2fc
[ 347.951002] [<c10060da>] ? xen_force_evtchn_callback+0xc/0x10
[ 347.951016] [<c1006740>] ? check_events+0x8/0xc
[ 347.951029] [<c1006737>] ? xen_restore_fl_direct_reloc+0x4/0x4
[ 347.951046] [<c105d572>] ? arch_local_irq_restore+0x6/0x7
[ 347.951060] [<c1032370>] ? try_to_wake_up+0x14b/0x155
[ 347.951074] [<c12c3667>] ? notify_page_fault+0x37/0x37
[ 347.951087] [<c12c17ff>] ? error_code+0x67/0x6c
[ 347.951103] [<eda5edbd>] ? bitmap_endwrite+0x138/0x199 [md_mod]
[ 347.951118] [<eda396ca>] ? close_write+0x6a/0x78 [raid1]
[ 347.951132] [<eda3c22b>] ? r1_bio_write_done+0x1b/0x32 [raid1]
[ 347.951146] [<eda3c33c>] ? raid1_end_write_request+0xfa/0x10d [raid1]
[ 347.951161] [<c10ebf0c>] ? bio_endio+0x22/0x23
[ 347.951173] [<c114cdb3>] ? blk_update_request+0x142/0x2a5
[ 347.951187] [<eda2946d>] ? do_blkif_request+0x33a/0x35c [xen_blkfront]
[ 347.951202] [<c114cf27>] ? blk_update_bidi_request+0x11/0x4e
[ 347.951216] [<c114df73>] ? __blk_end_bidi_request+0xf/0x26
[ 347.951229] [<c114dfa3>] ? __blk_end_request_all+0x19/0x22
[ 347.951243] [<eda29705>] ? blkif_interrupt+0x208/0x26a [xen_blkfront]
[ 347.951258] [<c10774d0>] ? handle_irq_event_percpu+0x47/0x158
[ 347.953937] [<c1077602>] ? handle_irq_event+0x21/0x37
[ 347.953937] [<c10792e2>] ? handle_edge_irq+0x7f/0x98
[ 347.953937] [<c11c2615>] ? __xen_evtchn_do_upcall+0x126/0x1ad
[ 347.953937] [<c11c3874>] ? xen_evtchn_do_upcall+0x18/0x26
[ 347.953937] [<c12c5857>] ? xen_do_upcall+0x7/0xc
[ 347.953937] [<c10023a7>] ? hypercall_page+0x3a7/0x1000
[ 347.953937] [<c100609a>] ? xen_safe_halt+0xf/0x19
[ 347.953937] [<c1010d6c>] ? default_idle+0x52/0x87
[ 347.953937] [<c100b22f>] ? cpu_idle+0x95/0xaf
[ 347.953937] [<c141e708>] ? start_kernel+0x32a/0x32f
[ 347.953937] [<c1420014>] ? xen_start_kernel+0x58b/0x592
Notice:
This is a Xen domU running on a squeeze dom0. I'm running linux-image-3.2.0-2-686-pae in this domU.
-- Package-specific info:
--- mdadm.conf
DEVICE partitions
CREATE owner=root group=disk mode=0660 auto=yes
HOMEHOST <system>
MAILADDR root
ARRAY /dev/md0 UUID=29c5fe51:a00cbbfd:641cea22:6a94829c
ARRAY /dev/md1 UUID=ff7a9373:7f9bf229:641cea22:6a94829c
--- /etc/default/mdadm
INITRDSTART='/dev/md0'
AUTOSTART=true
AUTOCHECK=false
START_DAEMON=false
DAEMON_OPTIONS="--syslog"
VERBOSE=false
--- /proc/mdstat:
Personalities : [raid1]
md1 : active (auto-read-only) raid1 xvda1[0] xvdb1[1]
2097088 blocks [2/2] [UU]
md0 : active raid1 xvda2[0] xvdb2[1]
23068608 blocks [2/2] [UU]
bitmap: 0/1 pages [0KB], 65536KB chunk
unused devices: <none>
--- /proc/partitions:
major minor #blocks name
202 2 23068672 xvda2
202 18 23068672 xvdb2
202 1 2097152 xvda1
202 17 2097152 xvdb1
9 0 23068608 md0
9 1 2097088 md1
--- LVM physical volumes:
LVM does not seem to be used.
--- mount output
udev on /dev type devtmpfs (rw,relatime,size=2062236k,nr_inodes=167373,mode=755)
devpts on /dev/pts type devpts (rw,nosuid,noexec,relatime,gid=5,mode=620,ptmxmode=000)
tmpfs on /run type tmpfs (rw,nosuid,noexec,relatime,size=414912k,mode=755)
/dev/md0 on / type ext3 (rw,relatime,errors=remount-ro,acl,barrier=1,data=ordered)
proc on /proc type proc (rw,nosuid,nodev,noexec,relatime)
sysfs on /sys type sysfs (rw,nosuid,nodev,noexec,relatime)
tmpfs on /run/shm type tmpfs (rw,nosuid,nodev,relatime,size=829820k)
fusectl on /sys/fs/fuse/connections type fusectl (rw,relatime)
--- initrd.img-3.2.0-2-686-pae:
48408 blocks
ba77a6a0a061d45b2220606a3ce377a8 ./etc/mdadm/mdadm.conf
2936a2c3842d5684206d3eb2655691ee ./lib/modules/3.2.0-2-686-pae/kernel/drivers/md/raid456.ko
18ea245dbdef2145f474679ebff738d8 ./lib/modules/3.2.0-2-686-pae/kernel/drivers/md/multipath.ko
43f3e2b6b823c92ae2d5c89babb1a325 ./lib/modules/3.2.0-2-686-pae/kernel/drivers/md/linear.ko
327228eceffcbdf730e4911aad62cfae ./lib/modules/3.2.0-2-686-pae/kernel/drivers/md/md-mod.ko
53d8666d99c83ab0cdd0260bff02d5af ./lib/modules/3.2.0-2-686-pae/kernel/drivers/md/raid1.ko
445ed2de0fa12f1fd4be5bdae6e71744 ./lib/modules/3.2.0-2-686-pae/kernel/drivers/md/raid0.ko
c1446792e8f54560e2d182e97d332b98 ./lib/modules/3.2.0-2-686-pae/kernel/drivers/md/raid10.ko
c363fa521c14d9dea66bf62fb1682ebc ./lib/modules/3.2.0-2-686-pae/kernel/drivers/md/dm-mod.ko
f4fbd9099399ab08ba9b9f6c71d77595 ./scripts/local-top/mdadm
f93000eb15384550f1ec601e01162ed2 ./sbin/mdadm
--- initrd's /conf/conf.d/md:
MD_HOMEHOST='debu32w'
MD_DEVPAIRS='/dev/md0:raid1 /dev/md1:raid1'
MD_LEVELS='raid1'
MD_DEVS='/dev/md0'
MD_MODULES='raid1'
--- /proc/modules:
raid1 26218 2 - Live 0xeda39000
md_mod 85719 3 raid1, Live 0xeda4f000
--- /var/log/syslog:
--- volume detail:
/dev/[hsv]d[a-z]* not readable by user.
--- /proc/cmdline
root=/dev/md0 ro elevator=noop root=/dev/md0 ro ip=89.19.235.142:127.0.255.255:89.19.235.129:255.255.255.128:debu32w:eth0:off md=0,/dev/xvda2,/dev/xvdb2 md=1,/dev/xvda1,/dev/xvdb1
--- grub legacy:
kernel /boot/vmlinuz-3.2.0-2-686-pae root=/dev/md0 ro elevator=noop
kernel /boot/vmlinuz-3.2.0-2-686-pae root=/dev/md0 ro elevator=noop single
--- udev:
ii udev 175-3.1 /dev/ and hotplug management daemon
6df86db16655769fa94086d3ee13453a /lib/udev/rules.d/64-md-raid.rules
--- /dev:
brw-rw---T 1 root disk 9, 0 Mar 30 23:18 /dev/md0
brw-rw---- 1 root disk 9, 1 Mar 30 23:18 /dev/md1
/dev/disk/by-id:
total 0
lrwxrwxrwx 1 root root 9 Mar 30 23:18 md-uuid-29c5fe51:a00cbbfd:641cea22:6a94829c -> ../../md0
lrwxrwxrwx 1 root root 9 Mar 30 23:18 md-uuid-ff7a9373:7f9bf229:641cea22:6a94829c -> ../../md1
/dev/disk/by-path:
total 0
lrwxrwxrwx 1 root root 11 Mar 30 23:19 xen-vbd-51713 -> ../../xvda1
lrwxrwxrwx 1 root root 11 Mar 30 23:18 xen-vbd-51714 -> ../../xvda2
lrwxrwxrwx 1 root root 11 Mar 30 23:19 xen-vbd-51729 -> ../../xvdb1
lrwxrwxrwx 1 root root 11 Mar 30 23:18 xen-vbd-51730 -> ../../xvdb2
/dev/disk/by-uuid:
total 0
lrwxrwxrwx 1 root root 9 Mar 30 23:18 23c321a3-0950-4ffa-bf88-1503a1c1ff46 -> ../../md1
lrwxrwxrwx 1 root root 9 Mar 30 23:18 50050fc5-c007-4e45-a585-14560b062233 -> ../../md0
/dev/md:
total 0
Auto-generated on Fri, 30 Mar 2012 23:27:23 +0200
by mdadm bugscript 3.2.3-2
-- System Information:
Debian Release: wheezy/sid
APT prefers testing
APT policy: (500, 'testing')
Architecture: i386 (i686)
Kernel: Linux 3.2.0-2-686-pae (SMP w/12 CPU cores)
Locale: LANG=de_DE.UTF-8, LC_CTYPE=de_DE.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Versions of packages mdadm depends on:
ii debconf 1.5.42
ii initscripts 2.88dsf-22
ii libc6 2.13-27
ii lsb-base 3.2+Debian31
ii makedev 2.3.1-89
ii udev 175-3.1
Versions of packages mdadm recommends:
ii exim4-daemon-light [mail-transport-agent] 4.77-1+b1
ii module-init-tools 6-2
mdadm suggests no packages.
-- debconf information:
mdadm/initrdstart_msg_errexist:
mdadm/mail_to: root
mdadm/initrdstart_msg_errmd:
* mdadm/initrdstart: /dev/md0
mdadm/initrdstart_msg_intro:
* mdadm/autostart: true
mdadm/initrdstart_msg_errconf:
* mdadm/autocheck: false
mdadm/initrdstart_notinconf: false
mdadm/initrdstart_msg_errblock:
* mdadm/start_daemon: false
More information about the pkg-mdadm-devel
mailing list