Skip to main content

Cornell University

High Availability Clustering and Data Recovery Options

Take advantage of the these options to keep your managed server available during routine maintenance, unexpected outages, or for recovery after a disaster. 

This article applies to: Managed Servers

On This Page

Options are available to provide for high availability during routine maintenance or unexpected outages and for service recovery in the event of a disaster. 

Select options during server setup on the New Server Configuration checklist. The Systems Support group will work with you to determine the best options based on your availability needs and goals.

Important: For critical services like DNS, servers are spread between both buildings, using different networks, routers, and even off-campus locations.

Using the 3-Tier Server Farm network and other high availability software, following are options that can help achieve your availability goals.

Servers in Different Buildings

What is it? Since the Server Farm network spans two buildings, servers can be installed on the same network in different buildings to allow service IPs, applications, storage, etc. to float from one node to the next. This option uses the high-availability cluster software, SUN clustering for the Solaris operating system and VMware for the Windows operating system. 

How Does it Help? In a disaster in which Rhodes Hall is lost, the Server Farm networks in CCC will be re-routed by hand through another router. IPs, storage, and applications will remain the same.

Multiple Connections per Server

What is it? Sun/Solaris and VMware servers can be connected to the same network through two different NICs (network interface cards) and two different tiered switches to have redundant paths to the network. 

How Does it Help? During routine switch maintenance and upgrades, the Solaris OS uses IPMP (IP network multipathing) to switch connections between tier 1 and tier 2 to provide high availability. IPMP is usually used with the Sun cluster software product, but can be used on stand-alone Sun servers for the same purpose. VMware uses active/active load balancing between the to NIC's for all the vlans it supports. Each virtual server automatically has access to this redundant network path.

Load Balancer and Several Back-end Servers

What is it? A load balancer can be used to direct traffic to more than one server behind the load balancer. The primary load balancer is on a tier 1 network. The backup load balancer is on a tier 2 network. 

How Does it Help? If some of the back-end servers are on a tier 1 network and some are on a tier 2 network, the service remains available during switch maintenance. To provide for disaster recovery, place back-end servers in both data centers as well. For more information, see the load balancer page.

Fault Tolerant Applications

What is it? Fault tolerant applications are designed to continue working despite network outages. Server names for the service are configured into the application on the user's workstation. The application attempts to connect to each server in turn until successful. 

How Does it Help? If your applications are designed for fault tolerance, to provide high availability to your users, distribute the servers on both tier 1 and tier 2 network switches. To provide for disaster recovery, place servers in both data centers as well.

Comments?

To share feedback about this page or request support, log in with your NetID

At Cornell we value your privacy. To view
our university's privacy practices, including
information use and third parties, visit University Privacy.