VHBA and RocketRaid 2320 (rr232x) modules with Kernel 2.6.37.x
Only a few days ago I was informed by a user that the VHBA kernel module – for which I maintain the SlackBuild script – stopped working with the new 2.6.37.3 and 2.6.37.4 kernels in Slackware 13.37 (actually, it’s still officially -current).
When I say “stopped working”, I’m actually being nice… It caused a fatal crash that requires rebooting the box it runs on
The user sent me a patch that worked, but hacking kernel modules is not to be taken lightly, so I wanted to understand what was going on…
Read the documentation!
It’s something most people don’t like, but it actually is a good idea to read the documentation once in a while ![]()
In /usr/src/linux/Documentation/scsi/scsi_mid_low_api.txt (if you have the 2.6.37.x kernel installed), I found this interesting comment:
Locks: up to and including 2.6.36, struct Scsi_Host::host_lock
held on entry (with “irqsave”) and is expected to be
held on return. From 2.6.37 onwards, queuecommand is
called without any locks held.
Well, that seems to be the culprit then… So now I headed for the kernel git repository and found this interesting commit:
Move the mid-layer’s ->queuecommand() invocation from being locked
with the host lock to being unlocked to facilitate speeding up the
critical path for drivers who don’t need this lock taken anyway.The patch below presents a simple SCSI host lock push-down as an
equivalent transformation. No locking or other behavior should change
with this patch. All existing bugs and locking orders are preserved.Additionally, add one parameter to queuecommand,
struct Scsi_Host *
and remove one parameter from queuecommand,
void (*done)(struct scsi_cmnd *)Scsi_Host* is a convenient pointer that most host drivers need anyway,
and ‘done’ is redundant to struct scsi_cmnd->scsi_done.Minimal code disturbance was attempted with this change. Most drivers
needed only two one-line modifications for their host lock push-down.
I checked some of the patches in the kernel tree and indeed they seemed quite simple. This is the patch for the aha1542 driver:
--- a/drivers/scsi/aha1542.c
+++ b/drivers/scsi/aha1542.c
@@ -558,7 +558,7 @@ static void aha1542_intr_handle(struct Scsi_Host *shost)
};
}
-static int aha1542_queuecommand(Scsi_Cmnd * SCpnt, void (*done) (Scsi_Cmnd *))
+static int aha1542_queuecommand_lck(Scsi_Cmnd * SCpnt, void (*done) (Scsi_Cmnd *))
{
unchar ahacmd = CMD_START_SCSI;
unchar direction;
@@ -718,6 +718,8 @@ static int aha1542_queuecommand(Scsi_Cmnd * SCpnt, void (*done) (Scsi_Cmnd *))
return 0;
}
+static DEF_SCSI_QCMD(aha1542_queuecommand)
+
/* Initialize mailboxes */
static void setup_mailboxes(int bse, struct Scsi_Host *shpnt)
{
Patching the VHBA module
In the SVN repository for VHBA I found this patch:
--- trunk/vhba-module/vhba.c 2010/08/15 20:11:18 691
+++ trunk/vhba-module/vhba.c 2011/02/27 15:56:27 730
@@ -363,7 +363,7 @@
spin_unlock_irqrestore(&vhost->cmd_lock, flags);
}
-static int vhba_queuecommand(struct scsi_cmnd *cmd, void (*done)(struct scsi_cmnd *))
+static int vhba_queuecommand_lck(struct scsi_cmnd *cmd, void (*done)(struct scsi_cmnd *))
{
struct vhba_device *vdev;
int retval;
@@ -388,6 +388,12 @@
return retval;
}
+#ifdef DEF_SCSI_QCMD
+DEF_SCSI_QCMD(vhba_queuecommand)
+#else
+#define vhba_queuecommand vhba_queuecommand_lck
+#endif
+
static int vhba_abort(struct scsi_cmnd *cmd)
{
struct vhba_device *vdev;
(there’s actually a bit more, but that’s just a cosmetic change…)
The #ifdef / #else / #endif structure was included so that it will work on “older” kernels as well.
It looked similar enough to the official kernel changes, so I included that patch in my SlackBuild script and the VHBA module started working again without crashing my system!
RocketRaid module with similar behavior…
And then yesterday I saw a post on LinuxQuestions about similar problems with the RocketRaid rr232x driver. Since it’s also a “SCSI” driver and it also crashed, I imagined it to have the same cause.
I created a patch based on my little research and posted it in the forum:
--- rr232x-linux-src-v1.10/osm/linux/osm_linux.c 2009-07-15 22:28:28.000000000 -0300
+++ rr232x-linux-src-v1.10_patched/osm/linux/osm_linux.c 2011-03-19 20:27:49.000000000 -0300
@@ -874,7 +874,7 @@
}
}
-static int hpt_queuecommand (Scsi_Cmnd * SCpnt, void (*done) (Scsi_Cmnd *))
+static int hpt_queuecommand_lck (Scsi_Cmnd * SCpnt, void (*done) (Scsi_Cmnd *))
{
struct Scsi_Host *phost = sc_host(SCpnt);
PVBUS_EXT vbus_ext = get_vbus_ext(phost);
@@ -1408,6 +1408,12 @@
return 0;
}
+#ifdef DEF_SCSI_QCMD
+DEF_SCSI_QCMD(hpt_queuecommand)
+#else
+#define hpt_queuecommand hpt_queuecommand_lck
+#endif
+
static int hpt_reset (Scsi_Cmnd *SCpnt)
{
PVBUS_EXT vbus_ext = get_vbus_ext(sc_host(SCpnt));
Today the original poster answered that it worked “like a charm” ![]()
This patch can be downloaded from my site.
This entry was posted on Sunday, March 20th, 2011 at 12:50 and is filed under Linux, Slackware, kernel. You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback from your own site.

Dutch guy living in Brazil. Has worked in the IT business since it was invented (well, almost...). Interests: Slackware, Lego, Star Wars.
May 5th, 2011 at 17:16
Thanks for the patch! I just tried it out on Gentoo’s 2.6.37-gentoo-r4 version of the kernel with a recently purchased HighPoint 2320 and it did the trick.
May 5th, 2011 at 17:46
Glad I could be of help!
May 12th, 2011 at 16:42
Just found your blog while I was working on the RocketRaid 2340 driver for the new kernel myself. The same changes work beautifully on the rr2340 driver under Ubuntu (and I imagine most Linux’s). Wonderful work!
May 12th, 2011 at 19:37
Good news!
May 17th, 2011 at 21:55
Wow, talk about finding this in the nick of time. I blindly updated my kernel and forgot about the rr232x driver. Highpoint hasn’t updated that driver since 2009, so I’m not all that hopeful that they’ll release a fixed version.
May 17th, 2011 at 22:35
Yeah, that gives an idea on how they think about follow-up service…
August 4th, 2011 at 23:24
Patch works on rr26xx driver also — thanks for the fix.
August 5th, 2011 at 8:24
Good to know, thanks for the info!
September 21st, 2011 at 7:46
I’ve contacted hpt about future support for the 2340 specifically and they said they just want to support new high end products.
So no more hope for support from hpt. If I would have known I would have bought something else.
September 22nd, 2011 at 19:06
Man, you absolutely and totally rock!
I have to admit: I only just installed the driver, but the system – as of yet^^ – didn’t crash and I can now (via linux cmd or webgui) access my rr2320 e.g. my discs!
THANK YOU!
I know now, never to purchase Highpoint ever again, but for the time being you just saved my ass (and bucks
)
Hail to the king
December 24th, 2011 at 2:14
2 words: THANK YOU!!!!
worked on a 2.6.37 kernel (the lines number have changed… but easy to fuzz
)