This site best when viewed with a modern standards-compliant browser. We recommend Firefox Get Firefox!.

Linux-HA project logo
Providing Open Source High-Availability Software for Linux and other OSes since 1999.

USA Flag UK Flag

Japanese Flag

Homepage

About Us

Contact Us

Legal Info

How To Contribute

Security Issues

1 February 2010 Hearbeat 3.0.2 released see the Release Notes

18 January 2009 Pacemaker 1.0.7 released see the Release Notes

16 November 2009 LINBIT new Heartbeat Steward see the Announcement

Last site update:
2014-11-26 12:39:03

Linux-HA Success Stories

The Linux-HA project has been in production in mission-critical applications since about 1999. Although there are thousands of production sites running it, a few have volunteered to tell their story for the Linux-HA web site.

Although the information below is organized by Industry, you can also view SuccessStoriesByApplication, and SuccessStoriesByProjectFriend (or you will be able to when we fix up those pages ;-)), and of course SuccessStoriesIndex.

Contents

  1. Linux-HA Success Stories
  2. Success Stories in Biosciences
  3. Success Stories in Computers and Electronics
  4. Success Stories in Education
  5. Success Stories in Engineering
  6. Success Stories in Financial Industries
  7. Success Stories in Food Services
  8. Success Stories in Government
  9. Success Stories in Manufacturing
  10. Success Stories in Media
  11. Success Stories in Retail
  12. Success Stories in Telecommunications
  13. Success Stories in Transportation
  14. Success Stories in Utilities
  15. Success Stories in xSPs

Success Stories in Biosciences

Emageon provides medical imaging and archiving solutions for the healthcare industry.

This entry courtesy of BrianTinsley

We use heartbeat to provide resource failover in our Linux cluster solutions that power our medical image archive/distribution systems on IBM x-series machines. We are very proud of what we have been able to accomplish with Linux, but without software like heartbeat and Linux Virtual Server our solutions would not be half as good as what they have become. Perhaps one day your X-Ray or CAT scans will wind up passing through one of these Linux systems!

NDCHealth corporation is a healthcare information services company that services pharmacies, hospitals, physicians, and pharmaceutical manufacturers.

This entry courtesy of StevenWilson

NDCHealth has applied heartbeat to a variety of services creating highly-available Linux solutions. With heartbeat's extensible 'init' style scripting support, it is used as a framework to be applied where high-availability Linux is a requirement.

The Sanger Institute is a genome research institute primarily funded by the Wellcome Trust. Our purpose is to further the knowledge of genomes, particularly through large scale sequencing and analysis.

This entry courtesy of Guy Coates

Ensembl is a joint project between the Sanger Institute and EMBL - European Bioinformatics Institute (EBI) to develop a software system which produces and maintains automatic annotation on selected eukaryotic genomes.

Ensembl releases all of its data into the public domain via a number of different sources, including a publicly available mysql database which contains the entire ensembl dataset (currently 2TB). The database handles 5 million queries and exports 20 Gbytes of data per day.

We use heartbeat to make the mysql service highly available. The database is hosted on a pair of dual Intel Xeon servers, with the data stored on a XFS file-system presented from our SAN. Heartbeat, via an APC MasterSwitch STONITH device, ensures that the database is only active on a single node at a time and provides fast fail-over in the case of hardware or network failures. Heartbeat enables scientists around the world to have continual access to Ensembl data.

Success Stories in Computers and Electronics

Heimdall's Limited provides high level security and networking services in New Zealand.

This entry courtesy of Martin Forest.

Apart from consulting and project services, Heimdall's provides a gui for iptables. We had problems convincing customers that iptables is a powerful and reliable firewall alternative. As a result, we wrote a powerful management interface for iptables. Bifrost manages the incoming and outgoing traffic flows with two rule sets in a very complex and powerful way. After the first public release, it did not take more than a few weeks until there were over 1000 installations of Bifrost (demo).

Based on demands from some of our licensed customers we wanted to implement High Availability for Bifrost. This was done with Heartbeat. It took a fair amount of lab work to get it right and to figure out what to do. We realised that if people have problems configuring iptables, they may have problems configuring Heartbeat. We therefore implemented Heartbeat management in Bifrost

High on the list is to implement VPN (freeswan) with Heartbeat. We have it working stably and reliably in the lab. We hope to have it implemented in V0.9.3 of Bifrost.

We are very happy with the reliability of Linux HA. We are deeply impressed with the work on following standards and high quality of code. Well done and congratulations to a success. And keep up the good work.

Incredimail provides provides Experience Email - an advanced email program that will forever change, both the look and feel of emails, as well as the behavior of the email program itself.

This entry courtesy of Alex Kramarov

w4.incredimail.com and w5.incredimail.com running as a cluster on www2 incredimail.com with heartbeat. They are a web cluster with apache running on IBM x330 servers.

Success Stories in Education

The Natural Resource Ecology Laboratory at Colorado State University centers on multi-disciplinary research in ecosystem science to improve our understanding of the complex interactions between humans, management activities, and ecosystems.

This entry courtesy of Ty! Boyack

We are using heartbeat at NREL, in both academic and research areas. Currently our uses include fully transparent firewalls (used in conjuction with mac spoofing and proxy arp), web servers (apache), and soon mysql servers. We have also been testing the system as a disk server, and expect that our next deployment of disk servers will be built on heartbeat-based failover clusters. This is a great product - I'd love to see it expand to N+1 cluster environments.

The Department of Geological Sciences at Southern Methodist University performs research on geophysical imaging, remote sensing, infrasound and seismo-acoustic sensing, tectonics, and seismicity of industrial mining explosions.

This entry courtesy of David P. Anderson

The Department of Geological Sciences runs a web and ftp server consisting of two inexpensive 933Mz systems configured to operate as a single server using linux virtual server with hardware redundancy and heartbeat monitoring provided by the Linux-HA utilities. This nifty low-cost solution provides a robust redundant, scalable architecture which avoids single points of failure and which incorporates content mirroring and automated fail-over. The system provides continuous infrasound and seismic data streams for the S.M.U. TXAR and NVAR seismographic stations, as well as web service for the Geological Sciences research programs. In that role it has also successfully survived three slashdottings in 2002-2003!

The Department of Informatics at the University of Bergen has as its goal to conduct education and research on a high international level.

This entry courtesy of Jan-Frode Myklebust, system engineer at BCCS

We use heartbeat in two clusters. One is for a two-node cluster doing failover Samba and NFS. The other is a two-node cluster for email (smtp, imap) and printing (cups).

One cluster uses both IBM xSeries machines and ServeRAID disk hardware.

FIXME, write more details ASAP...

Success Stories in Engineering

Heapy Engineering provides mechanical and electrical engineering and design services to architects, developers, governmental agencies, contractors and engineers of other disciplines.

This entry courtesy of Kirk Lawson

Heapy Engineering uses Heartbeat 0.4.7-1 with inclusion of AudibleAlarm on a two node fail-over cluster (no load balancing) Intranet Web Server. Even though it's "old" code, it's worked perfectly for us

Success Stories in Financial Industries

The Citysavingsbank Munich provides financial services in Bavaria, is one of the largest savingsbanks in Germany.

This entry courtesy of Jens Grigel of Citysavingsbank Munich.

We've just recently put our Linux-HA based Intranet DNS Servers in production. The system is built on 2 IBM x340 xSeries each with 1 x 1.0 GHz P3, 512MB Ram, ServeRAID 4LX, 3x18GB HD (Raid-1/HotSpare) and 5 Nics. The System is running under Redhat Linux 8.0, using heartbeat 0.4.9e, compiled from sources. Linux-HA uses DRBD-0.6.1 to sync the DNS data (dynamic DNS for W2K's Active Directory) between the two systems.

Ipfail is used to monitor the network links and the x340 builtin "Advanced System Management Processor" is our stonith device. To use the ASM as STONITH device I modified some existing stonith code (with rudimentary programming knowledge) to call a MPCLI script which logs into the other system via the ASM-Interconnect network and powercycles the x340.

As this combination works perfectly and stable we are migrating more IBM xSeries based standalone systems to Linux-HA systems. The next systems will be Intranet Webservers running on x340 xSeries and some internal firewall systems running on x335 and x345 xSeries. All of our xSeries are equipped with either ASM or "Remote Supervisor Adaptor" which lets us use our above mentioned STONITH implementation.

An anonymous South American company provides financial services in South America, and is part of a major South American financial group.

This entry courtesy of Renzo Alejandro Granados of http://conectiva.com.co/ who planned and installed the systems.

This customer uses Linux-HA to provide a a highly available Internet/Intranet solution (web, mail, webmail, proxy, etc). This system gives service to ~200 Internal Users, and access to their website (internal webmail and external content) to the Internet. Hardware consists of 2 IBM Netfinity 5500 (M10 and M20, 256 Mb RAM, ~18 Gb Hot Swap, etc...). This installation uses Conectiva Linux.

It is now their policy not to allow their internal infrastructure to be documented externally, so their name can no longer appear on this page.

SuccessStories_2fAnonSouthAmerican

Major Asset Manager in Scandanavia

This entry provided courtesy of Sigurd Urdahl of Linpro AS, who implemented this solution for the customer.

One of the largest asset managers in Scandinavia uses Heartbeat to implement failover clusters for three FTP and LPD queuing proxies. The proxies facilitate critical communication with external parties, and at the same time implement strong separation between internal and external networks.

Success Stories in Food Services

The One of the leading Bagel quick service restaurant chains in the US, with approximately 250 neighborhood stores.

This entry courtesy of Rubin Bennett of RB Technologies, who planned and installed the systems.

This customer has a Mandriva Linux cluster running hearbeat and DRBD/ ReiserFS. It serves as the Intranet/ SMTP gateway system for the entire organization (4000+ employees). It also runs an internal website that takes critical data from a legacy DOS system in the stores and imports it via a series of PHP pages and SQL*Plus scripts into the company's Oracle database. That site is also served by the HA cluster, and the data that gets imported is replicated between the systems.

Success Stories in Government

The Lujan Security Systems (Lujsansoft) is a provider of Linux-based badgereader systems.

This entry courtesy of Alan Richter

We are using VNC so that a) the customers can access it from whatever kind of computer they are using and b) Comtrol rocket port serial hubs so we don't have to deal with the headaches associated with long streches of RS232 or RS485 cabling.

We are using heartbeat to run the badge reader program, the driver for the serial hubs and VNC and it all works beautifully. The installation where I installed my two node cluster is a linear accelerator in Los Alamos NM USA and the badge reader system MUST stay up while the "beam" is running, it is a matter of health and safety.

Setting up heartbeat was cake and pie (piece of cake, easy as pie) I can't believe how easy it is to set up and use. Oh yeah it works perfectly too.

The US Fish and Wildlife Service is a division of the US government whose mission is working with others to conserve, protect and enhance fish, wildlife and plants and their habitats for the continuing benefit of the American people.

This entry courtesy of Patrick Scannel.

We use heartbeat and rsync to set up a 2 unit HA cluster that is a Samba PDC for a 100 user network. Samba does not offer BDC capabilities at this time, so without your HA software, I would not have made the switch to open source. Thanks. The machines run RH 7.2 and Samba 2.2.2. They are old desktop reject machines, one of which had no available serial ports, so I stuck 3 nics in each of them. One for the network, and 2 for heartbeat/rsync.

Success Stories in Manufacturing

Agilent Technologies - The premier measurement company -- advancing electronics, communications, life sciences and chemical analysis.

This entry courtesy of Ray Frush

Here at our Fort Collins, CO site we've implemented 3 Linux HA clusters so far.

Cluster 1 is a middle ware Oracle Database and Web server that was implemented when the native Oracle clustering tools failed to provide smooth failover. At the time of deployment, Oracle RAC-9 was a new product on Linux, and turned out to have some early adoptor issues. We switched to a dual active database and Linux-HA to keep the project moving. The cluster has been running for nearly 2 years. Cluster 2 was built as a second middleware set of Web servers.

Both Cluster 1 and Cluster 2 track and report realtime status data for our manufacturing process line. Availabilty of the realtime status data depends on the virtual server to be available to collect, process and present (via the web server) the data to management as well as operations.

Our third cluster is used as our site's e-mail master host. Using the virtual host to float between the two nodes allows for seamless avilabilty to the users while we perform maintainence. Due to the volume of e-mail, we would frequently swamp our upstream e-mail hop with queued mail if our old server was down for several hours.

Heartbeat is being tested for deployment of cluster nodes separated by building at our site. To avoid single room power failures (due to scheduled bi-annual system inspections) we are using Intel fiber LAN cards to drive heartbeat over 50micron fiber links between our buildings.

Gerfor SA is a plastics manufacturer in Colombia.

This entry courtesy of Renzo Alejandro Granados of conectiva.com.co who planned and installed the system.

Gerfor has deployed Linux-HA to provide highly available application servers (xdmcp, vnc) Giving access to applications such as StarOffice, Browsers, and clients of their mission critical system. The cluster has 2 servers (933 Mhz, 1Gb, 18Gb available to the OS after hw raid, 3 NICs) using Conectiva Linux 6.0 (up to date).

It gives service to 90 "thin clients" (old PCs). Normally each server has the half of the clients ("load balancing").

MAN Nutzfahrzeuge AG is one of the most important subsidiary companies of the MAN AG. Our business is it to build trucks and busses, which are shipped to the whole world. IT services and infrastructure across the group are consolidated in our data processing centers.

This entry courtesy of Tobias Mucke who planned, installed, configured and documented this solution.

We are now running Heartbeat in conjunction with DRBD in production since the beginning of 2003. This stable solution, known from expierence, makes it possible to provide highly available services and to guarantee this fact to our customers in service level agreements. The Heartbeat cluster solution is one part of our three tier architecture, with a loadbalanced cluster in front and an Oracle DB at the backend. It provides the application server for our central document management system which powers the whole intranet of our enterprise. Besides that the HA environment consists of an NFS server and an administration tool called CfEngine. For each of these services we configured an own resource group with its own IP addresses, filesystems and so on.

Thanks to the Heartbeat team for this great piece of software and the support given through the mailing list. Keep it up!

Sony is one of the best known names in audio, video, televisions, information systems, communications and electronic components.

The entry is courtesy of Shallyee Shang

Heartbeat is now running on a few production servers, which are very important in the manufacturing of many recent digital products. We have several Heartbeat clusters, in conjunction with DRBD, which are mainly constructed in two type in our systems. One is as a two node cluster running beside product lines, serving the equipment manufacturing process. Our other usage is as the load balancer of a big data warehouse server which run the Linux virtual server package. These clusters had run since Sep 2003, with almost no trouble occured.

As of July 2004, real failover has occured twice in our Heartbeat clusters. One is when the facility staff accidently pulled the power supply to the active server. When this happened, all services failed over to standby side successfully. The other is when a real hardware failure occured on active side. Although cluster did not fail over to the standby side automatically because of the nature of the hardware failure, we are happy that we could recover the services in a very short time.

Success Stories in Media

The Bavarian Radio Station, Munich

This entry courtesy of Werner Schultheiß

olympia.ard.de (=olympia.br-online.de), 1/2002. Web services "Olympic Games, Salt Lake City" for the German ARD with 2 IBM Netfinity a.k.a. x350 (4 CPUs, 2 GB RAM, 280 GB RAID5), heartbeat over 2 serial null modem cables, local iptables(8) firewall, rsync(8)-based data import by the inhouse staging and deploying system . System took a hit because of a typo, and heartbeat(8) took over once during the games. Nobody took any notice of it during the games ;-}. Gone to archive in 6/2002.

LVS evaluation cluster, 5/2002. Heartbeated (redundant) load balancer on 2 IBM x330 (2 CPUs, 1.2 GB RAM) with Linux LVS on Kernel 2.4.18, heartbeat over 1 serial and 1 ethernet cross cable connection which is also used by rsync(1) for data consistency. Worked great and is currently fall back solution if problems with BigIron switch persist.

firewall gateway internet-dmz, 7/2002. Hardware and system id. to "LVS evaluation cluster", plus 7 NIC ports and iptables firewall. It was a bit messy to include patches for 4-port-NIC D-LINK DFE-580TX into Kernel 2.4.18. System is up and running. The systems will migrate from HD to CDROM (diskless).

Several other installations are expected to be completed in 2002 and beyond. These include a firewall router (dmz-intranets), NFS fileserver, Samba file server, Oracle, MySQL and Postgres databases.

GREAT app, works perfectly. I would be happy to contribute with tech and work. Werner Schultheiß, Letras Berlin-Munich, 20020925.

The BBC exists to enrich people's lives with great programmes and services that inform, educate and entertain. Its vision is to be the most creative, trusted organisation in the world.

This entry courtesy of Neil Loffhagen

As an organisation we had the need to replace an old NT 4 Server running as a proxy for several overseas sites. It was decided to go for Squid, on a Linux box, distribution not initially decided. Company wanted some system with failover, so that if one server went down, another would automatically step in and take over the role. We needed a tool that would function over just a network connection, as the Servers would each be in different locations, although in the same site. After looking around the heartbeat tool was chosen. On paper it seemed very much to fill the function of swapping services between servers. Then we needed to chose a distro to run it on.

Initially was a bit hesitant as it seemed purely a text editing job. No real GUI to use for setup. On investigation realised that there was a GUI available with the SuSe, now Novell distribution. So installed SuSe on two Servers and went to use the GUI. Must admit to finding it a bit confusing :( So did a bit more reading of how the text files needed to be set up and found it not difficult at all. Like many an old Windows person, had a bit of an aversion of the command line and configuration text file editing. However, was pleasantly surprised. The two main files we edited are below:

ha.cf

bcast eth0
keepalive 2
warntime 10
deadtime 30
udpport 694
auto_failback on
node msquid01
node msquid02
ping 10.182.64.119
respawn hacluster /usr/lib/heartbeat/ipfail

haresources

msquid01 10.182.64.47 squid

We found the following a good introduction to Linux High-Availability, which should explain our chosen settings in a bit more detail - http://www.linux-mag.com/2003-11/availability_01.html

Once the above files were in place, we rebooted the Servers. Once rebooted, login as normal user and checked that the heartbeat and squid services are both running. And they were. Users were able to browse the web okay, using the virtual IP address. So, now the test. We did a ssh to the virtual IP and ended up logging in the main Server, which was what was expected. We then powered down the main server and users were able to continue browsing the web. We did another ssh to the virtual IP address and this time ended up on the backup server, which again was expected. Failover was working okay.

The above article recommends using serial connection, as well as Ethernet, but due to differing locations that was not possible in our case.

Another reason, we chose heartbeat, was the cost. Although we had to purchase SuSe we felt it was a cost saving compared to another major Linux distos that used their own failover tool. To save even more cost you should be able to use Debian or other "free" distros.

The Weather Channel® brings timely weather information to the world through cable networks, the internet, radio and newspapers.

This entry courtesy of Joe Henggeler

At The Weather Channel, we currently use heartbeat on a two-node cluster which handles transient storage for delivery/receipt of (1) critical weather data for our customers, and (2) video footage from cameramen. The hardware is : 2 Intel 1U's each connected to a 2-channel 350 GB RAID array. The services this cluster offers are ftp, nfs, http, and other custom services. The OS is Mandrake (to take advantage of ReiserFS). A future use for heartbeat within our organization is to port this to other operating systems (eg., Solaris, FreeBSD) for even more mission critical services.

From my experience, I was extremely impressed with the ease of installation/compilation on linux, and the stability of the cluster. This cluster has been running for approximately eight months (with forced manual failovers for updates and maintenance), and heartbeat has been running solid with virtually no interruptions in service.

Success Stories in Retail

Evozon has been offering software development and system administration services since 2005..

This entry courtesy of Tudor Precup, sales and marketing manager for Evozon

We used Heartbeat as a high availability solution for one of our customer at web, database and file system layers. The web and database cluster uses heartbeat to check the availability of the primary node, and to handle IP fail over to a peer node in case of a malfunction. The shared file system is configured with NFS, which is stored on a DRBD partition. Heartbeat manages the mounting of the DRBD partition and the start up of NFS. In case of failure, the services are moved to a peer node. The system was design for high availability at each level and is expected to offer a reliable service to over 1500 users.

In another scenario, we use Heartbeat in combination with DRBD for Mysql highly availability. This setup has been live and working for a long time and Heartbeat is managing the fail over in a professional manner.

Success Stories in Telecommunications

The ADC Telecommunications provides products, services, software and solutions to meet your customers' communication needs.

This entry courtesy of Tony Willoughby

We use Heartbeat (and DRBD) in two products. The first is the CUDA 12000 IP Access Switch. This is a Cable Modem Termination System (CMTS) which uses Linux for service provisioning and network management.

From the web page: Contains integrated service provisioning and element/network management functions eliminating the need for separate systems while capitalizing on the high availability Linux-based management module.

The second product is the FastFlow Broadband Provisioning Manager. The FBPM provides the same provisioning services that are available on the Cuda 12000 on a stand alone Linux platform.

Consultronics is a global leader in the field of telecommunications test equipment.

This entry courtesy of Greg Louis.

Consultronics Limited has at its head office a perimeter network (DMZ), of which the outer and inner bastions are pairs of Linux machines. At the time of writing, these are running kernel 2.4.13-ac4 with heartbeat-0.4.9.0k to provide service IP addresses. Our marketing folks want 24x365 availability of www.consultronics.com (we sell in 80 countries and the sun never sets on our customer base), and the financial and manufacturing staff want to be sure of access (through the tri-homed inner bastion) to the inner sanctum where the company's ERP system is kept. Running heartbeat on the bastions was a first step: the intent is to make the webserver itself an HA pair and to add a secondary route to the Internet to back up the existing T1.

Motorola is a global leader in providing integrated communications solutions and embedded electronic solutions.

This entry courtesy of Damian Ohara

Motorola has 4 HA pairs running in Swindon, UK. Three clusters serve an SMB front for NFS mounted data on EMC Celerras: PII 233MHz, 3GB HD, 96MB RAM, 2x 3com 905, SuSE. The other cluster serves site Anonymous FTP front for NFS mounted data on EMC Celerras and is similar to the other machines.

All machines are reclaimed - obsoleted from the desktop when we moved to Win2K. We moved to Linux for this service because SMB on Solaris is flaky as hell (our experience) and on HP-UX is slow slow slow.

Trans-Video is a small, independent Cable Television Company in Northfield, Vermont and provides video and data services to their customers.

This entry courtesy of Rubin Bennett of RB Technologies, who planned and installed the systems for Trans-Video.

Trans-Video uses a Linux-HA cluster that runs all of the client/ user services for their High Speed Cable Internet Service:

  • DHCP (failover configuration) for provisioning both Cable modems and Customer systems,
  • TFTP (boot file for cable modems),
  • WWW site / home pages for users (Apache),
  • FTP from internal addresses for users to update their sites,
  • SMTP for users' mail.

Success Stories in Transportation

The DFS Deutsche Flugsicherung GmbH (German Air navigation Services) is a government-owned company responsible for all air traffic control in Germany. With headquarters in Langen, near Frankfurt, DFS employs 5,200 staff, runs operations at four control centres and 19 international and regional airports across the country..

This entry courtesy of Alexander Schanz, Head of Linux Competence and Service Centre

Because system availability is vital to the safe handling of air traffic, DFS has implemented a fail-over solution by running Heartbeat on a pair of clustered servers at all international airports in Germany. The solution consists of two servers equipped with HW-Raid, DRBD and Heartbeat 2.x as cluster software. The application on these servers delivers important weather, runway and airport information to the air traffic controllers at 19 international Airports in Germany.

We have ported the application from a proprietary UNIX/Cluster Solution to Linux and Heartbeat and saved a considerable amount of money while maintaining the same level of reliability and gaining flexibility by using the Heartbeat/DRBD combination.

FedEx Inc. provides access to a growing global marketplace through a network of supply chain, transportation, business and related information services.

This entry courtesy of FedEx.

Our Wireless Data Manager (WDM) handles private-MMP and public-GPRS data sent from our trucks worldwide. We have 150+ WDM boxes distributed in most countries. Their primary use is to serve handheld devices for package delivery. I chose HA so that upstream hosts can connect to a single IP address instead of using clustering. This was after much discussion of other solutions such as load balancing (LB), and disaster recovery (DR).

We have a routing table which we could have shared using DRBD, but we chose instead to use anonymous-ftp for three reasons: 1) we are behind a firewall, 2) less key management, 3) we control the read/writes to this shared file.

Linux-HA (heartbeat) has been embraced by our group since the user interface is simple, solid, extendable and gives 15-second failover performance. We will probably add fencing via a STONITH device in the near future.

MAN Nutzfahrzeuge AG is one of the most important subsidiary companies of the MAN AG. Our business is it to build trucks and busses, which are shipped to the whole world. IT services and infrastructure across the group are consolidated in our data processing centers.

This entry courtesy of Tobias Mucke who planned, installed, configured and documented this solution.

We are now running Heartbeat in conjunction with DRBD in production since the beginning of 2003. This stable solution, known from expierence, makes it possible to provide highly available services and to guarantee this fact to our customers in service level agreements. The Heartbeat cluster solution is one part of our three tier architecture, with a loadbalanced cluster in front and an Oracle DB at the backend. It provides the application server for our central document management system which powers the whole intranet of our enterprise. Besides that the HA environment consists of an NFS server and an administration tool called CfEngine. For each of these services we configured an own resource group with its own IP addresses, filesystems and so on.

Thanks to the Heartbeat team for this great piece of software and the support given through the mailing list. Keep it up!

SideStep is a free low-fare travel search tool that helps you comparison shop for the Web's best travel deals.

This entry provided courtesy of Peter Mueller.

SideStep uses heartbeat in an LVS cluster with mon & ldirectord. Future plans for linux clusters include file servers & various other projects. Great product.

The Thessaloniki Port Authority is Greece's second largest port.

This entry courtesy of Giannis Stoilis

Our connection to the internet is handled by 2 Debian GNU/Linux servers(stable edition) communicating with heartbeat 0.4.9.0l-7.2 from Debian repository. In case of a failure at our main transparent proxy/internet gateway server, the backup server takes over the proxy's IP address, thus providing the port with a reliable connection to the internet.

Using a keepalive of 1 second and deadtime of 3 seconds, tests have shown that in case of a failure, the network connection can be offline for as little as half a second or for as long as 5 seconds, for the IP takeover change to finish.

Success Stories in Utilities

ISO New England manages the New England region's electric bulk power generation and transmission systems and associated tariffs. ISO New England also administers the regional wholesale electricity marketplace on an Internet-based markets system

This entry provided by Matt Soffen and Rudy Pawul.

We've been using heartbeat since around June 1999 and have had it in production since sometime around version 0.4.3. Initial use was strictly a webserver using rsync to syncronize the small amount of data involved every 15 minutes or so. We are now using it on 20 production clusters running samba, Oracle, dhcp, many other applications and, of course, web servers. We now synchronize our data using DRBD, as well as rsync. Including development and integration environments, we have over 50 heartbeat clusters on site. Using heartbeat linux clusters provides us the high availability we require, while allowing us to reduce the initial added hardware costs and subsequent support costs associated with commercial Unixes.

Editor's note: ISO-NE was the first production heartbeat customer in the world. Rudy wrote much of the documentation, and Matt acts as release coordinator for FreeBSD and other non-Linux versions, and they even put the Linux-HA logo on their site. Special thanks go out to Rudy and Matt!

Success Stories in xSPs

Allegiance Internet provides enterprise level Internet solutions to customers in the U.S.

This entry courtesy of KevinDwyer, Network Security Engineer.

We deployed several firewall pairs running heartbeat to protect the Network Operations Center and the Network Security/Abuse network. Heartbeat allows us to reuse older hardware without worrying about failures and thus interruptions in service.


Intercom TI is a Linux service company, offering commercial implementations of firewalls, Qmail servers and remote Linux server support and administration.

This entry courtesy of Francisco Jen Ou

We installed an HA pair for an ISP in Sao Paulo, Brazil, who offers wireless Internet access for some 2,500 users. The HA pair runs Conectiva Linux 8.0 (kernel 2.4.18) and heartbeat 0.4.9 over serial connection. Its main applications are:

bandwidth control, based on CQB, for limiting bandwidth usage by single users on the shared radio network.

firewall and NAT for the radio users

Apache with MySQL and Sun ASP (former ChiliSoft ASP), hosting CRM and ERP systems.

DNS cache and server based on djbdns.

Levonline AB is a Stockholm-based webhosting company using the latest technology based on Linux.

This entry courtesy of Jerker Nyberg

At Levonline, we use Heartbeat together with LVS for redundant load balancing of the webservers.

We also use Heartbeat for redundant MySQL (data is written to two servers and read from the active one), DNS and routing on the same machines. Here is a simple illustration of our setup: http://www.levonline.com/support/#lvs

Thank you for great software

The Norcross Group provides a comprehensive set of Internet Engineering resources (consulting services).

This entry courtesy of Greg Freemyer.

We've put in 2 HA installs for 2 different customers. Both are on Alpha's running Tru64 5.x I ported heartbeat 4.7 about 18 months ago, and that was what I used. With both I use externally shared storage, which I move from node to node.

One customer is an ISP and is running the typical suite of ISP apps: radius, ftp, web-server, smtp, pop.

The other is a small e-commerce company, and they are running Oracle DB and Oracle Application Server (OAS).

tummy.com, ltd. has been providing Linux system and network administration consulting services and hosting since 1995.

This entry courtesy of Sean Reifschneider

Heartbeat has allowed us to provide decreases in both planned and unplanned outages. To accomplish this, we created a fairly simple configuration involving switching the gateway IP address between the cluster nodes, and dynamic routing daemons for the links with our upstream ISPs.

Heartbeat and spare hardware allowed us to create a prototype pair of redundant routers at no cost beyond our time. Based on my experience with other commercial high availability solutions, I would estimate that heartbeat took no more time to learn and deploy than other high availability solutions.

After the initial test environment was set up and thoroughly tested, we were able to deploy the redundant routers with no additional cost outlay. We were able to re-purpose a recently released server to run as the standby node, bring that up in our network for a week to ensure there were no problems with it, and then re-load our regular router with the new configuration, and bring it up as the primary.

While we can now survive a number of networking and hardware failures, we're finding that one of the primary benefits of this solution is that we can do hardware and software maintenance on the routers while experiencing no downtime. A clean shutdown of the primary router results in a seamless fail-over to the secondary machine with no lost packets.

Heartbeat has allowed us to provide the benefits of a very expensive Cisco solution for the expense of a couple of dual-port gigabit cards. I haven't priced it out, but based on other hardware we've purchased I would estimate that a similar Cisco solution would have cost at least $20,000 and possibly up to $100,000. Heartbeat has allowed us to provide world-class service to our hosting clients at reasonable costs. Additionally, we were able to prototype the deployment and provide a proof-of-concept at no cost (more info).

Heartbeat is a great addition to the Linux software stack. Thanks Alan.

Editor's note: tummy.com, ltd. is one of the Linux-HA ProjectFriends, providing mailing list, CVS and other services for the project, for which we are most appreciative.

Web-Com is a Santa Clara, CA based Internet development firm specializing in web based applications for small and mid-sized business.

This entry courtesy of Joel Fowler

Web-Com Development is a web systems development firm located in Santa Clara, California. We also offer hosting for sites we develop. Some examples include our own http://www.web-com.com and http://www.democracynow.net.

We are currently in the process of developing webSentry, a web site monitoring and service management application, which will be offered as a service. The application is not yet publically available.

High availability was a requirement of webSentry. It includes both batch and online elements that are managed by heartbeat. HA was also beneficial to the other virtual web sites supported by these clustered servers.

Our application infrastructure consists of the following components: RH Linux, MySQL (with two-way replication between the clustered servers), Apache (including perl cgi, mod_perl, mod_ jk, and mod_ssl), Tomcat

Resources under heartbeat control: production VIP, serviceNanny.pl, webSentry Collectors (batch). ServiceNanny.pl is a perl script which (1) starts Tomcat, (2) monitors the health of important online resources (Apache, Tomcat, MySQL, DNS, and gateway access), and (3) takes appropriate remedial action (e.g. restarts Tomcat, initiates heartbeat failover, and reboots the system) to ensure online availability.