.: What Are Survivable Computer Systems

By:Darren Miller

Category:Home / Computers / Data Recovery

Definition Of A Survivable Computer System

----------------------------

A computer system, which may be made up of multiple individual systems and components, designed to provide mission critical services must be able to perform in a consistent and timely manner under various operating conditions. It must be able to meet its goals and objectives whether it is in a state of normal operation or under some sort of stress or in a hostile environment. A discussion on survivable computer systems can be a very complex and far reaching one. However, in this article we will touch on just a few of the basics.



Computer Security And Survivable Computer Systems

--------------------------------------------------

Survivable computer systems and computer security are in many ways related but at a low-level very much different. For instance, the hardening of a particular system to be resistant against intelligent attacks may be a component of a survivable computer system. It does not address the ability of a computer system to fulfill its purpose when it is impacted by an event such as a deliberate attack, natural disaster or accident, or general failure. A survivable computer system must be able to adapt, perform its primary critical functions even if in a hostile environment, even if various components of the computer system are incapacitated. In some cases, even if the entire "primary" system has been destroyed.



As an example; a system designed to provide real-time critical information regarding analysis of specialized medications ceases to function for a few hours because of wide spread loss of communication. However, it maintains the validity of the data when communication is restored and systems come back online. This computer system could be considered to have survived under conditions outside of its control.



On the other hand, the same system fails to provide continuous access to information under normal circumstances or operating environment, because of a localized failure, may not be judged to have fulfilled its purpose or met its objective.



Fault Tolerant And Highly Availability Computer Systems

----------------------------

Many computer systems are designed with fault tolerant components so they continue to operate when key portions of the system fail. For instance; multiple power supplies, redundant disk drives or arrays, even multiple processors and system boards that can continue to function even if its peer component is destroyed or fails. The probability of all components designed to be redundant failing at one time may be quite low. However, a malicious entity that knows how the redundant components are configured may be able to engineer critical failures across the board rendering the fault tolerant components ineffective.



High availability also plays a role in a survivable computer system. However this design component may not maintain computer system survivability during certain events such as various forms of malicious attack . An example of this might be a critical web service that has been duplicated, say across multiple machines, to allow continuous functionality if one or more the individual web servers was to fail. The problem is that many implementations of high availability use the same components and methodology on all of the individual systems. If an intelligent attack or malicious event takes place and is directed at a specific set of vulnerabilities on one of the individual systems, it is reasonable to assume the remaining computer systems that participate in the highly available implementation are also susceptible to the same or similar vulnerabilities. A certain degree of variance must be achieved in how all systems participate in the highly available implementation.



What's The Difference Between An Attack, Failure, And Accident?

How Do These Differences Impact A Survivable Computer System

----------------------------------------------------------

In many cases when I am discussing the security of systems with customers, the question of business continuity and disaster recovery come up. Most companies that provide a service that they deem critical just know the system needs to be operational in a consistent manner. However, there is typically little discussion about the various events or scenarios surrounding this and that can lead to great disappointment in the future when what the customer thought was a "survivable computer system" does not meet their expectations. Some of the items I like to bring up during these conversations is what their computer systems goal and objective is, what specifically does continuous operation mean to them, and specifically what constitutes an attack, failure, or accident that can cause loss of operation or failure to meet objectives.



A failure may be defined as a localized event that impacts the operation of a system and its ability to deliver services or meet its objectives. An example might be the failure of one or more critical or non-critical functions that effect the performance or overall operation of the system. Say, the failure of a module of code that causes a cascading event that prevents redundant modules from performing properly. Or, a localize hardware failure that incapacitates the computer system.



An accident is typically an event that is outside the control of the system and administrators of a local / private system. An example of this would be natural disasters such as hurricanes, if you live in south Florida like I do, or floods, or wide spread loss of power because the utility provider cut the wrong power lines during an upgrade to the grid. About two years ago, a client of mine who provides web based document management services could not deliver revenue generating services to their customers because a telecommunications engineer cut through a major phone trunk six blocks away from their office. They lost phone and data services for nearly a week.



An now we come to "attack". We all know accidents will happen, we know that everything fails at one time or another, and typically we can speculate on how these things will happen. An attack, executed by an intelligent, experienced individual or group can be very hard to predict. There are many well known and documented forms of attacks. The problem is intelligence and human imagination continuously advance the form of malicious attacks and can seriously threaten even the most advanced designed survivable computer systems. An accident or failure does not have the ability to think out of the box or realize that a highly available design is flawed because all participants use the same design. The probability that an attack might occur, and succeed may be quite low, but the impact may be devastating.



Conclusion

-----------------------------------------------

One of the reasons I wrote this article was to illustrate that it's not all about prevention. Although prevention is a big part of survivable computer system design, a critical computer system must be able to meet its objectives even when operating under hostile or stressful circumstances. Or if the steps taking for prevention ultimately prove inadequate. It may be impossible to think of all the various events that can impact a critical computer system but it is possible to reasonably define the possibilities.



The subject of survivable computer systems is actually one of complexity and ever evolving technology. This article has only touched on a few of the basic aspects of computer system survivability. I intend on continuing this article to delve deeper into the subject of survivable computer systems.



You may reprint or publish this article free of charge as long as the bylines are included.



Original URL (The Web version of the article)

------------

http://www.defendingthenet.com/NewsLetters/WhatAreSurvivableComputerSystems.htm

Digg del.icio.us Blink Stumble Spurl Reddit Netscape Furl

Article keywords: Survivable Computer Systems, Business Continuity, Disaster Recovery, Fault Tolerance, High Availability

Article Source: http://www.articles32.com

Darren Miller is an Information Security Consultant with over seventeen years experience. He has written many technology & security articles, some of which have been published in nationally circulated magazines & periodicals. If you would like to contact Darren you can e-mail him at Darren.Miller@defendingthenet.com. If you would like to know more about computer security please visit us at www.defendingthenet.com.







.: New Data Recovery Articles

1). A Brief History of Ontrack Data Recovery
When we talk of data recovery, terms like hard disk data recovery, hard drive recovery, and other terms related to computers are always in the surface. In terms of companies offering data recovery, several names can be mentioned, but definitely only few are known.

2). The Current and Future State of the Hard Drive and Data Recovery Industries
Where have we been and where are we going when it comes to Data Recovery? Find out the details in the following article.

3). What To Do For Data Tape Recovery
It always seems like computer problems wait until the worst moment to then spring up and say HaHa, Gotcha! Fret no more, this article looks into the various ways to combat data tape corruption and recover the all important data!

4). Prevent Data Loss! A Few Tips to Live by so You Never have to Visit a Data Recovery Specialist
Preventing a major data loss on your personal computer can be prevented by following these simple guidelines

5). Website Spiders, Discover Incredible Tips On Getting Your Site Indexed Super Fast
How do website spiders crawl and index a site? If you own a website, you should at least know how it is done. The only way visitors will be able to view your website is through the information gathered by the spiders and handed over to the search engines.

6). Data Recovery The Truth
Everything you ever wanted to know about data recovery.

7). Getting Acquainted With Offsite Data Backup
Off-site data backup should be an important part of your life if you have any data that you can not afford to lose. It may seem a bit confusing to you, but once you understand the basics you will agree that no one should be with out off-site data backup.


.: Top Data Recovery Articles

1). Hard Disk Failure and Data Recovery
Hard Disk: An Introduction Hard disk is a non-volatile data storage device that stores electronic data on a magnetic surface layered onto hard disk platters. Word Hard is use to differentiate it from a soft, or floppy disk. Hard disks hold more data and can store from 10 to more than 100 gigabytes, whereas most floppies have a maximum storage capacity of 1.

2). XCACLS, SUNINACL, And Other Permissions Security Recovery Tools
You Have 50GB Of Data To Move Along With Permissions Security ---------------------------- This article is about several tools that can save a Windows administrators you know what in the event of a large scale permissions security problem. Here is a fictional scenario we can use to illustrate the use of the XCACLS tool. We need to move or copy 50GB worth of data that is comprised of several thousand directories containing hundreds of thousands of small files from one storage system to another.

3). How To Recover Data Or Survive A Hard Disk Disaster
Disk failure occurs when a hard disk drive no longer operates and the information on it can no longer be accessed by the computer. This can happen for no reason at all or due to an external factor such as exposure to fire or water or High Magneticwaves or suffering a sharp impact How seriously the disk failure is varies.

4). Types of Computer Infection
Brief descriptions of the kinds of malicious software that abounds in todays computerised world are described and explained.

5). Data Recovery - What To Do When Your Hard Drive Fails
How many times have you experienced that sickening feeling when your hard drive suddenly fails? How many times have you experienced that your hard disk just does not boot and all the data may be gone forever? A hard drive failure is one of the most common problems and worst nightmares faced by computer operators all over the world. Precious data is lost either at home or in big corporate environments.

6). Can USB Data Recovery Be Recovered?
When you store important information on a USB device, you take the chance of losing that information. Losing data on a USB can be kind of a mystery, but there are companies out there that can help you get that data back. These companies use engineering that can recover your lost data over ninety six percent of the time. These companies can even recover data that has been stored on a damaged USB device.

7). COMPUTER PROBLEMS-Data Recovery and PC Protection
Data recovery and PC protection go hand in hand. Data recovery is a challenge faced by everyone using a computer. If you are reading this article I am sure you too like me have faced it sometime or the other. Now data loss can be due to a system crash that is the software crash or due to the hardware crash. Around 66% of the data loss is caused by software and user oriented problems.


Page loaded in 0.213 seconds.