[Bioclusters] Newbie question: simple low-admin non-threaded Debian-based cluster solution?

Glen Otero gotero at linuxprophet.com
Thu Jan 20 19:15:15 EST 2005


On Jan 20, 2005, at 4:04 PM, Speakman, John 
H./Epidemiology-Biostatistics wrote:

> Hi Glen
>
>  
>
> Thanks – I agree – Rocks looks great – but I agreed with the users not 
> to consider a non-Debian-based solution unless it simply will not work 
> any other way…  that’ll teach me to make pledges with users.

Bummer. Good luck. Everyone I talk to tells me that FAI is a pain in 
the ass. You might look into running Warewulf 
(http://warewulf-cluster.org/index.html) over Debian. You can pull the 
tarball out of the available srpms or convert the rpm to a .deb 
package.

Glen
>
>  
>
> John
>
>  
>
>
> From: bioclusters-bounces at bioinformatics.org 
> [mailto:bioclusters-bounces at bioinformatics.org] On Behalf Of Glen 
> Otero
> Sent: Thursday, January 20, 2005 6:56 PM
> To: Clustering, compute farming & distributed computing in life 
> science informatics
> Subject: Re: [Bioclusters] Newbie question: simple low-admin 
> non-threaded Debian-based cluster solution?
>
>  
>
> Check out Rocks (http://www.rocksclusters.org). IMHO it is much better 
> than FAI and SIS. It also includes SGE.
>
>   
>
> On Jan 20, 2005, at 3:47 PM, Speakman, John 
> H./Epidemiology-Biostatistics wrote:
>
>   
>
> Hello
>
>  
>
> If anyone can review the below and suggest a way to go, or even better 
> something I have gotten completely wrong, it would be much 
> appreciated!
>
>  
>
> Thanks
>
> John
>
>  
>
> Hardware:
>
>  
>
> Ten HP Proliant nodes, one DL380 and nine DL140.  Each node has two 
> 3.2Ghz Xeon processors.  They do not have a dedicated switch; the 
> infrastructure folks say they want to implement this using a VLAN.  We 
> have some performance concerns here but have agreed to give it a try.
>
>  
>
> User characteristics:
>
>  
>
> The users are biostatisticians who typically program in R; they often 
> use plug-in R modules like bioconductor.  They always want the newest 
> version of R right away.  Also they may also write programs in C or 
> Fortran.  Data files are usually small.  Nothing fancy like BLAST, 
> etc.
>
>  
>
> User concerns:
>
>  
>
> Users require a Linux clustering environment which enables them to 
> interact with the cluster as though it were a single system (via ssh 
> or X) but which will distribute compute-intensive jobs across nodes.  
> As the code is by and large not multithreaded, it is expected that 
> each job will be farmed out to an idle compute node and probably stay 
> there until it is done.   That’s fine.  In other words, to use all 
> twenty CPUs we will need twenty concurrent jobs.
>
>  
>
> Administration concerns:
>
>  
>
> The cluster must require the absolute minimum of configuration and 
> maintenance, because I’ve got to do it and I’m hardly ever around 
> these days.
>
>  
>
> Other concerns:
>
>  
>
> Users and administrators alike have a preference for Debian Linux over 
> other distributions.  Users also have an aversion to non-free 
> software.  Either or both of these considerations could be overridden 
> if the reasons were pressing.
>
>  
>
> Cluster software requirements:
>
>  
>
> (1)
>
>    
>
>  The cluster must have a mean of deploying Linux to the nodes and 
> keeping their configurations (including updates to the operating 
> system and applications, lists of users, printers, etc.) in 
> synchronization.
>
> (2)
>
>    
>
>  The cluster must have a means of transparently distributing jobs to 
> idle CPUs.  It’s not necessarily to actively rebalance this when a job 
> has started – it’s okay if, once tied to a node, it stays there.
>
>  
>
> Potential solutions:
>
>  
>
> We like the look of NPACI Rocks but its non-Debian-ness makes it a 
> last resort only.  What we would really like to try is a Debian 
> version of NPACI Rocks; in its absence we will probably have to use 
> two separate packages to fulfil the requirements of #1 and #2 above.
>
>  
>
> Sensible options for #1 seem to be:
>
> (1)
>
>       
>
>  SystemImager (www.systemimager.org)
>
> (2)
>
>       
>
>  FAI (http://www.informatik.uni-koeln.de/fai/), maybe also involving 
> the use of cfengine2 (http://www.iu.hio.no/cfengine/)
>
>  
>
> SystemImager is the better-established product and looks to be simpler 
> to set up than FAI and/or cfengine2, in both of which the learning 
> curve looks steep.  However, FAI seems more elegant and more like the 
> idea of “NPACI Rocks Debian” that we’re looking for, implying that 
> once set up FAI/cfengine2 will require less ongoing maintenance.
>
>  
>
> Sensible options for #2 seem to be:
>
>  
>
> (1)
>
>    
>
>  OpenMosix
>
> (2)
>
>    
>
>  OpenPBS
>
> (3)
>
>    
>
>  Sun GridEngine N1
>
>  
>
> Note: all of the above have commercial versions; we’d be reluctant to 
> consider them unless it means big savings in administration time and 
> effort.  We get the impression OpenMosix (and, to a lesser extent, 
> OpenPBS) have question marks over how much time and resources the 
> people maintaining these products have, suggesting bugs, instability 
> and not keeping up with kernel/library updates, etc.  Sun GridEngine 
> seems more robust but does not seem to have a big Debian user base.
>
>  
>
> What do you all should we try first?
>
>  
>
> Thanks!
>
> John
>
>  
>
>  
>
>  
>
> John Speakman
>
>  
>
> Manager, Clinical Research Systems
>
>  
>
> Memorial Sloan-Kettering Cancer Center
>
>  
>
> 307 East 63rd Street, New York NY 10021 USA
>
>  
>
> +1 646 735 8187 - SpeakmaJ at mskcc.org
>
>  
>
>  
>
>  
>
> =====================================================================
>
>   
>
> Please note that this e-mail and any files transmitted with it may be
>
>  privileged, confidential, and protected from disclosure under
>
>  applicable law. If the reader of this message is not the intended
>
>  recipient, or an employee or agent responsible for delivering this
>
>  message to the intended recipient, you are hereby notified that any
>
>  reading, dissemination, distribution, copying, or other use of this
>
>  communication or any of its attachments is strictly prohibited. If
>
>  you have received this communication in error, please notify the
>
>  sender immediately by replying to this message and deleting this
>
>  message, any attachments, and all copies and backups from your
>
>  computer.
>
>   
>
> _______________________________________________
>
>  Bioclusters maillist - Bioclusters at bioinformatics.org
>
>  https://bioinformatics.org/mailman/listinfo/bioclusters
>
>   
>
> Glen Otero Ph.D.
>
>  Linux Prophet
>
>   
> _______________________________________________
> Bioclusters maillist  -  Bioclusters at bioinformatics.org
> https://bioinformatics.org/mailman/listinfo/bioclusters
>
Glen Otero Ph.D.
Linux Prophet

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: text/enriched
Size: 24422 bytes
Desc: not available
Url : http://bioinformatics.org/pipermail/bioclusters/attachments/20050120/6b8a9ca5/attachment-0001.bin


More information about the Bioclusters mailing list