Revisited: Hurd on a cluster computer

bug-hurd

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Revisited: Hurd on a cluster computer

From:	Mark Morgan Lloyd
Subject:	Revisited: Hurd on a cluster computer
Date:	Sat, 20 May 2017 12:18:01 +0000
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:45.0) Gecko/20100101 Thunderbird/45.8.0

Please excuse my raising my head above the parapet, most of the time I'ma lurker but I prefer the idea of a robustly-partitioned system and Ithink the industry-wide events of the last week or so reinforce that.


>> [Brent said] The payoff is a supercomputer operating system that
>> presents an entire cluster as a single POSIX system with hundreds
>> of processors, terabytes of RAM, and petabytes of disk space.

> [Richard replied] Most attempts in the past have failed. It seems
> better to build specialiazed cluster computers on top of local
> operating systems. Look for "single system image" on a search engine
> for projects with this goal.

Looking at this from an historical POV, I think there's another approachwhich is process migration without an SSI. Specifically, I'd highlightMOSIX and derivatives.

Unlike e.g. Amoeba which I believe worked by locating a system withspare capacity and using that for a newly-started process, MOSIX workedby starting a new process on the local computer, then migrating it toanother system retaining local stubs for communication with the kernel.A process could be moved multiple times to track spare capacity, but itcontinued to talk to its original kernel via the stubs.

I had it running something like 15 years ago, and found that it wasrobust enough that if a collaborating system were booted during e.g. akernel compilation then work would immediately migrate onto it, with nouser involvement at all. It was, obviously, sensitive to systems dyingwithout due warning: there was no checkpointing or program restart buteven as it stood it was one of the more impressive things I've seen inthe industry.

I believe it was originally research with anticipated commercialspinoff. It was open sourced as OpenMOSIX, and later renamed to LinuxPMI(Process Migration Infrastructure). I put enough time into it a coupleof years ago to determine which versions of Linux kernel and compiler itwas compatible with (roughly speaking, Debian "Lenny") and I don't thinkanybody has done much more: more than anything else Linux is too much ofa moving target for something which has fallen behind to ever catch up,and its monolithic architecture probably doesn't help.

Apart from that, known problems were that it relied on kernel extensionswritten in assembler, it had no negotiation to ensure that collaboratingsystems were binary-compatible, it had no authentication or capabilitiestracking, and it's probably not friendly to applications which useshared memory for their IPC.

Finally, what is the Hurd portability situation? Way back I worked on amicrokernel in '386 protected mode that used segmentation heavily, am Icorrect in assuming that that sort of thing is completely deprecated inthe interest of portability?


When I were a lad we used logic analysers to debug our code...

--
Mark Morgan Lloyd
markMLl .AT. telemetry.co .DOT. uk

[Opinions above are the author's, not those of his employers or colleagues]

[Prev in Thread]

Current Thread

[Next in Thread]

Revisited: Hurd on a cluster computer, Mark Morgan Lloyd <=
- Re: Revisited: Hurd on a cluster computer, Samuel Thibault, 2017/05/20
  - Re: Revisited: Hurd on a cluster computer, Justus Winter, 2017/05/21
    - Re: Revisited: Hurd on a cluster computer, Samuel Thibault, 2017/05/21
    - Re: Revisited: Hurd on a cluster computer, Justus Winter, 2017/05/21
    - Re: Revisited: Hurd on a cluster computer, Mark Morgan Lloyd, 2017/05/21

Prev by Date: Re: guile-2.2.0 in GNU/Hurd
Next by Date: Re: Revisited: Hurd on a cluster computer
Previous by thread: guile-2.2.0 in GNU/Hurd
Next by thread: Re: Revisited: Hurd on a cluster computer
Index(es):
- Date
- Thread