Conversation

Notices

  1. @chalkahlom there's still hope :) I had testdisk analyse and re-write partition data, and before rebooting to make that active I'm downloading another liveCD with UFS Explorer (now I need to figure out how to burn that from commandline of current RescueCD)

    Sunday, 05-Jul-15 08:07:56 UTC from oracle.skilledtests.com
    1. @mk @chalkahlom even better: SystemRescueCD comes with Xfburn :)

      Sunday, 05-Jul-15 08:12:02 UTC from oracle.skilledtests.com
      1. @mk !til I need to boot SystemRescueCD with option to load everything into memory, otherwise the optical drive is always busy and Xfburn, cdw and cdburn cannot be used ... burning UFS LiveCD now. :)

        Sunday, 05-Jul-15 09:25:34 UTC from oracle.skilledtests.com
        1. @mk running ddrescue now...

          Sunday, 05-Jul-15 10:40:03 UTC from oracle.skilledtests.com
          1. @mk ddrescue chugging along, outside it's stormy and rainy. Little I can do except be patient, read, and play a game like sudoku every now and then :(

            Sunday, 05-Jul-15 14:24:09 UTC from oracle.skilledtests.com
            1. @mk thunder, too, by now.

              Sunday, 05-Jul-15 15:18:27 UTC from oracle.skilledtests.com
          2. @mk ddrescue still running (11+ hours) but oddly it now seems to read past the end of the partition... off to my # now, I'll see what I find #!

            Sunday, 05-Jul-15 21:31:16 UTC from oracle.skilledtests.com
            1. @mk just when I want to dive in - it's finished! Run time 11.66 h. # I'll see what I have...

              Sunday, 05-Jul-15 22:04:39 UTC from oracle.skilledtests.com
              1. @mk and now, I have run e2fsck on the image file created by ddrescue - in phases, to also check if repairs actually 'stick': and they do! So now I have a HD that (as I suspected already) has somehow turned itself into a ROM, and a repaired image! Next: see what I have by using UFS Explorer on the repaired image. BUT # first! #

                Monday, 06-Jul-15 04:45:01 UTC from oracle.skilledtests.com
                1. @mk so, how on earth does a HD suddenly turn itself into a READ-ONLY drive? I've seen plenty of bad, failed or broken drives, but never a suddenly read-only one before. How do you even know it's read-only, since fsck doesn't tell you the 'repairs' it's doing are not actually written? It actually says 'file system modified' when it isn't!

                  Monday, 06-Jul-15 05:27:36 UTC from oracle.skilledtests.com
                2. @mk so much for (Ubuntu 11.10-based) UFS Explorer: once booted, my two external disks in the JBOD enclosure are 'seen' but not recognised, no partitions or even a serial number! Also, it tries to auto-mount everything - not a good idea if you need to do repairs. Back to SystemRescueCD which recognises them just fine and (via fdisk) tells me their serial numbers, and that their partition table is gpt, with 512-byte sectors. So, I'll try to mount the image ddrescue has written...

                  Monday, 06-Jul-15 06:14:52 UTC from oracle.skilledtests.com
                  1. @mk for the first time I'm missing something in the SystemRescueCD: a way/app to make a screenshot. It has Xfce, but seemingly not all of it (would that be in xfce-extra?). And Gentoo package management is beyond me for now (the documentation does explain how to customize but -while interesting- that's a bridge too far for what I'm trying to do right now). Later, I'll see if I can request some screenshooter to be part of the CD.

                    Monday, 06-Jul-15 07:26:01 UTC from oracle.skilledtests.com
                    1. @mk there doesn't seem to be a graphics viewer either... I'll definitely want to look at extending SystemRescueCD later: the best test to see if a graphics file is uncorrupted or properly repaired is opening it in a viewer or editor. Pretty important since photographs is what I do. :)

                      Monday, 06-Jul-15 08:33:12 UTC from oracle.skilledtests.com
                  2. @mk poking around the mounted 'rescued' image now and so far everything seems OK! \o/

                    Monday, 06-Jul-15 08:35:22 UTC from oracle.skilledtests.com
                    1. @mk cherry-picking most important files and directories now and saving them to a USB flash drive (with plenty of space) - no problems whatsoever, it looks as though the filesystem image is completely repaired! And I've yet to look at disk2 of the RAID set. Plus, I have a spare disk for the set, too. So it looks like I'll get all my data back, all it takes is time (a lot) and grunt work.

                      Monday, 06-Jul-15 10:54:00 UTC from oracle.skilledtests.com
                    2. @mk And, I'll have to try and figure out what caused the problem with the disk (or 2?) suddenly going read-only. Not mounted ro but 'firmware' ro. Overheating? RAID station malfunction?

                      Monday, 06-Jul-15 10:58:35 UTC from oracle.skilledtests.com
                      1. @mk one theoretical possibility is that the RAID station detected (danger of) overheating and sent a hdparm command to the drive(s) to set them read-only to prevent (further) damage. If disk2 is also found to be read-only, that will make this scenario more likely (and it would be the opposite of malfunction).

                        Monday, 06-Jul-15 11:25:03 UTC from oracle.skilledtests.com
                        1. @mk well, so much for that nice idea: querying the drive with 'hdparm -r' it tells me 'readonly' is *not* set. Then how come e2fsck cannot actually write its repairs (and is even ignorant of that fact)?? More and more #: I have never seen this behavior before!

                          Monday, 06-Jul-15 13:40:22 UTC from oracle.skilledtests.com
                      2. @mk and guess what: disk 2 in the enclosure now and it seems to be even sicker than disk 1: I/O errors when GParted scans devices, even after a few retries; filesystem in main partition 'unknown'.

                        Monday, 06-Jul-15 16:36:59 UTC from oracle.skilledtests.com
                        1. @mk testdisk 'hung' in the end, but it did make the filesystem recognizable. Replaced superblock (guessing at number & succeeding @ 2nd try) now checking but denying most repairs. I need a 'badblocks' run before that. After some grumbles e2fsck is now chugging along so it looks like most damage is at the start of the partition... #

                          Monday, 06-Jul-15 20:30:57 UTC from oracle.skilledtests.com
                          1. @mk overnight e2fsck hung, had to be terminated with hw button; it reported 'file system modified' after that anyway. Running it again now with option to check for bad blocks. It's beginning to behave like an actual mirror of disk 1 so there's still hope. :)

                            Tuesday, 07-Jul-15 04:48:25 UTC from oracle.skilledtests.com
                            1. @mk also, I started (yesterday) looking for replacement disks (I have only one) and a replacement RAID box (I don't trust this one any more). Not so much choice here in NL :( need 2-bay, SATA III, eSATA external, not (only) USB. This will be an expensive month, just three disks (WD reds) will already be €300...

                              Tuesday, 07-Jul-15 04:56:27 UTC from oracle.skilledtests.com
                              1. @mk well, this is odd: on disk 2 each attempt to run e2fsck seems to get a little farther but invariably in pass 5 (group summary information) it finds an repairs some blocks counts, then inode counts, and then it 'hangs' with the drive activity light constantly blinking; kill doesn't work, the only way out is turning the dive off with the power button. dmesg has an endless list of I/O errors an write errors, seemingly progressing but never succeeding...

                                Tuesday, 07-Jul-15 18:10:01 UTC from oracle.skilledtests.com
                                1. @mk ...but why the same write errors every time? A bad blocks scan found nothing. Some mechanical with the drive? Basically same behavior as with disk 1 (but that could not write anything at all) - I've never seen this behavior before. I can try to get data off the disk (as for disk 1) but then I guess I must give up both - terminally sick, unknown disease. :(

                                  Tuesday, 07-Jul-15 18:15:42 UTC from oracle.skilledtests.com
                                  1. @mk so much for that brilliant idea - now the partition isn't properly recognised again, so I cannot run ddrescue on it. All very instructive but I'm getting bored... :( Next plan: edit fstab to mount the rescued disk1 image as /home and limp along with that (+ make a proper backup) until I have new disks and a new RAID enclosure. *sigh*

                                    Tuesday, 07-Jul-15 19:35:03 UTC from oracle.skilledtests.com
                                    1. @mk working on my 'Next plan'; problem: the rescue *image* sits in a file on an external disk; that disk needs to be mounted before I can reference it to be mounted - so I added TWO lines to fstab: one for the disk, and one (after that) for the image to be mounted. It fails, because the line for the disk is noted, but it isn't *actually* mounted until accessed; so then the image isn't mounted either, so my home directory doesn't exist, ... and I cannot log in. Unless I log in as root first, mount the disk, mount the image, log out, log in as myself. That is # :( But it does work.

                                      Tuesday, 07-Jul-15 22:58:26 UTC from oracle.skilledtests.com
                                      1. @mk rsync of my 'rescued' home directory to Hal is chugging along nicely: finally some *good* news! :)

                                        Wednesday, 08-Jul-15 05:16:25 UTC from oracle.skilledtests.com
                                        1. @mk it seems the backup is nearing completion - I don't dare to do too much while the backup is incomplete! I may be mistaken, but an image file somehow feels more fragile to me than a HD partition. Probably not true but what is true is that this is not mirrored now, as it was on the RAID disks... Once the backup is complete (and I've done a little administration), I'll go RAID-box-and-HD hunting. :)

                                          Wednesday, 08-Jul-15 11:08:23 UTC from oracle.skilledtests.com
                                          1. @mk it's done! that's a relief. Now I can go shopping for a new RAID box and disks.

                                            Wednesday, 08-Jul-15 15:26:07 UTC from oracle.skilledtests.com
                                            1. @mk yes, creating a backup works fine - with one little annoying exception: newly-created files assume ownership of the remote user, even if I specify -o (and -g and -p and -X) and even if I run the script locally as root. I cannot figure out why, if there's some limitation I'm not aware of, or if I'm doing something wrong... so maybe I should just dive into my #

                                              Wednesday, 08-Jul-15 21:40:07 UTC from oracle.skilledtests.com