All Topics  
Safety engineering

 

   Email Print
   Bookmark   Link






 

Safety engineering



 
 
Safety engineering is an applied science strongly related to systems engineering
Systems engineering

Systems engineering is an interdisciplinary field of engineering that focuses on how complex engineering projects should be designed and managed....
 and the subset System Safety Engineering. Safety engineering assures that a life-critical system
Life-critical system

A life-critical system or safety-critical system is a system whose failure ormalfunction may result in:* death or serious injury to people, or...
 behaves as needed even when pieces fail.

The process Ideally, safety-engineers take an early design of a system, analyze it to find what faults can occur, and then propose safety requirements in design specifications up front and changes to existing systems to make the system safer.






Discussion
Ask a question about 'Safety engineering'
Start a new discussion about 'Safety engineering'
Answer questions from other users
Full Discussion Forum



Recent Posts









Encyclopedia


Safety engineering is an applied science strongly related to systems engineering
Systems engineering

Systems engineering is an interdisciplinary field of engineering that focuses on how complex engineering projects should be designed and managed....
 and the subset System Safety Engineering. Safety engineering assures that a life-critical system
Life-critical system

A life-critical system or safety-critical system is a system whose failure ormalfunction may result in:* death or serious injury to people, or...
 behaves as needed even when pieces fail.

Overview


The process

Ideally, safety-engineers take an early design of a system, analyze it to find what faults can occur, and then propose safety requirements in design specifications up front and changes to existing systems to make the system safer. In an early design stage, often a fail-safe system can be made acceptably safe with a few sensors and some software to read them. Probabilistic fault-tolerant systems can often be made by using more, but smaller and less-expensive pieces of equipment.

Far too often, rather than actually influencing the design, safety engineer
Safety engineer

Scope of a Safety EngineerTo perform their professional functions, safety engineering professionals must have education, training and experience in a common body of knowledge....
s are assigned to prove that an existing, completed design is safe. If a safety engineer then discovers significant safety problems late in the design process, correcting them can be very expensive. This type of error has the potential to waste large sums of money.

The exception to this conventional approach is the way some large government agencies approach safety engineering from a more proactive and proven process perspective. This is known as System Safety
System safety

The system safety concept calls for a risk management strategy based on identification, analysis of hazards and application of remedial controls using a systems-based approach....
. The System Safety philosophy, supported by the System Safety Society and many other organizations, is to be applied to complex and critical systems, such as commercial airliners, military aircraft, munitions and complex weapon systems, spacecraft and space systems, rail and transportation systems, air traffic control system and more complex and safety-critical industrial systems. The proven System Safety methods and techniques are to prevent, eliminate and control hazards and risks through designed influences by a collaboration of key engineering disciplines and product teams. Software safety is a fast growing field since modern systems functionality are increasingly being put under control of software. The whole concept of system safety and software safety, as a subset of systems engineering, is to influence safety-critical systems designs by conducting several types of hazard analyses to identify risks and to specify design safety features and procedures to strategically mitigate risk to acceptable levels before the system is certified.

Additionally, failure mitigation can go beyond design recommendations, particularly in the area of maintenance. There is an entire realm of safety and reliability engineering known as "Reliability Centered Maintenance" (RCM), which is a discipline that is a direct result of analyzing potential failures within a system and determining maintenance actions that can mitigate the risk of failure. This methodology is used extensively on aircraft and involves understanding the failure modes of the serviceable replaceable assemblies in addition to the means to detect or predict an impending failure. Every automobile owner is familiar with this concept when they take in their car to have the oil changed or brakes checked. Even filling up one's car with gas is a simple example of a failure mode (failure due to fuel starvation), a means of detection (fuel gauge
Fuel gauge

A fuel gauge is an Measuring instrument used to indicate the level of fuel contained in a tank. Commonly used in cars, these may also be used for any tank including underground storage tanks....
), and a maintenance action (fill 'er up!).

For large scale complex systems, hundreds if not thousands of maintenance actions can result from the failure analysis. These maintenance actions are based on conditions (e.g., gauge reading or leaky valve), hard conditions (e.g., a component is known to fail after 100 hrs of operation with 95% certainty), or require inspection to determine the maintenance action (e.g., metal fatigue). The Reliability Centered Maintenance concept then analyzes each individual maintenance item for its risk contribution to safety, mission, operational readiness, or cost to repair if a failure does occur. Then the sum total of all the maintenance actions are bundled into maintenance intervals so that maintenance is not occurring around the clock, but rather, at regular intervals. This bundling process introduces further complexity, as it might stretch some maintenance cycles, thereby increasing risk, but reduce others, thereby potentially reducing risk, with the end result being a comprehensive maintenance schedule, purpose built to reduce operational risk and ensure acceptable levels of operational readiness and availability.

Analysis techniques

The two most common fault modeling techniques are called "failure modes and effects analysis" and "fault tree analysis". These techniques are just ways of finding problems and of making plans to cope with failures, as in Probabilistic Risk Assessment
Probabilistic risk assessment

Probabilistic risk assessment is a systematic and comprehensive methodology to evaluate risks associated with a complex engineered technological entity ....
 (PRA or PSA). One of the earliest complete studies using PRA techniques on a commercial nuclear plant was the Reactor Safety Study (RSS), edited by Prof. Norman Rasmussen (see WASH-1400
WASH-1400

WASH-1400, 'The Reactor Safety Study was a report produced in 1975 for the Nuclear Regulatory Commission by a committee of specialists under Professor Norman Carl Rasmussen....
)

Failure modes and effects analysis

In the technique known as "failure mode and effects analysis
Failure mode and effects analysis

A failure modes and effects analysis is a procedure for analysis of potential failure modes within a system for classification by severity or determination of the effect of failures on the system....
" (FMEA), an engineer starts with a block diagram of a system. The safety engineer
Safety engineer

Scope of a Safety EngineerTo perform their professional functions, safety engineering professionals must have education, training and experience in a common body of knowledge....
 then considers what happens if each block of the diagram fails. The engineer then draws up a table in which failures are paired with their effects and an evaluation of the effects. The design of the system is then corrected, and the table adjusted until the system is not known to have unacceptable problems. It is very helpful to have several engineers review the failure modes and effects analysis.

Fault tree analysis

First a little history to put FTA into perspective. It came out of work on the Minuteman Missile System. All the digital circuits used in the Minuteman Missile System were designed and tested extensively. The failure probabilities as well as failure modes well understood and documented for each circuit. I believe it was GTE/Sylvania, one of the prime contractors, discovered that the probability of failure for various components were easily constructed from the boolean expressions for those components. [Note there was one complex digital system constructed by GTE/Sylvania about that time with no logic diagrams only pages of boolean expressions. These worked out nicely because logic diagrams are designed to be read left to right the way the engineer creates the design. But when they fail the technicians must read them from right to left.] In any case this analysis of hardware lead to the use of the same symbology and thinking for what (with additional symbols) is now known as a Fault Tree. Note the de Morgan's equivalent of a fault tree is the success tree.

In the technique known as "fault tree analysis", an undesired effect is taken as the root ('top event') of a tree of logic. There should be only one Top Event and all concerns must tree down from it. This is also a consequence of another Minuteman Missile System requirement that all analysis be Top Down. By fiat there was to be no bottom up analysis. Then, each situation that could cause that effect is added to the tree as a series of logic expressions. When fault trees are labeled with actual numbers about failure probabilities, which are often in practice unavailable because of the expense of testing, computer program
Computer program

Computer programs are Instruction for a computer. A computer requires programs to function. Moreover, a computer program does not run unless its instructions are executed by a Central processing unit; however, a program may communicate an Algorithm#Formalization of algorithms to people without running....
s can calculate failure probabilities from fault trees.

Fault Tree
The Tree is usually written out using conventional logic gate
Logic gate

A logic gate performs a logical operation on one or more logic inputs and produces a single logic output. The logic normally performed is Boolean logic and is most commonly found in digital circuits....
 symbols. The route through a Tree between an event and an initiator in the tree is called a Cutset. The shortest credible way through the tree from Fault to initiating Event is called a Minimal Cutset.

Some industries use both Fault Trees and Event Trees (see Probabilistic Risk Assessment
Probabilistic risk assessment

Probabilistic risk assessment is a systematic and comprehensive methodology to evaluate risks associated with a complex engineered technological entity ....
). An Event Tree starts from an undesired initiator (loss of critical supply, component failure etc) and follows possible further system events through to a series of final consequences. As each new event is considered, a new node on the tree is added with a split of probabilities of taking either branch. The probabilities of a range of 'top events' arising from the initial event can then be seen.

Classic programs include the Electric Power Research Institute
Electric Power Research Institute

The Electric Power Research Institute conducts research on issues of interest to the electric power industry in the USA. EPRI is an independent, nonprofit organization funded by the electric utility industry....
's (EPRI) CAFTA software, which is used by almost all the US nuclear power plants and by a majority of US and international aerospace manufacturers, and the Idaho National Laboratory
Idaho National Laboratory

The Idaho National Laboratory is an 890-square-mile complex located in the desert land of eastern Idaho, between the town of Arco, Idaho and the city of Idaho Falls, at ....
's SAPHIRE
SAPHIRE

SAPHIRE is a probabilistic risk and reliability assessment software tool. SAPHIRE stands for Systems Analysis Programs for Hands-on Integrated Reliability Evaluations....
, which is used by the U.S. Government to evaluate the safety and reliability
Reliability engineering

Reliability engineering is an engineering field, that deals with the study of reliability: the ability of a system or component to perform its required functions under stated conditions for a specified period of time....
 of nuclear reactor
Nuclear reactor

A nuclear reactor is a device in which nuclear chain reactions are initiated, controlled, and sustained at a steady rate, as opposed to a nuclear bomb, in which the chain reaction occurs in a fraction of a second and is uncontrolled causing an explosion....
s, the Space Shuttle
Space Shuttle

NASA's Space Shuttle, officially called the Space Transportation System , is the spacecraft currently used by the United States government for its human spaceflight missions....
, and the International Space Station
International Space Station

The International Space Station is a research facility Assembly of the International Space Station in outer space. On-orbit construction of the station began in 1998, and is scheduled to be complete by 2011, with operations continuing until around 2015....
.

Safety certification

Usually a failure in safety-certified
Product certification

Product certification or product qualification is the process of certifying that a certain product has passed performance and quality assurance tests or qualification requirements stipulated in regulations such as a building code and nationally accredited test standards, or that it complies with a set of regulations governing quality an...
 systems is acceptable if, on average, less than one life per 109 hours of continuous operation is lost to failure. Most Western nuclear reactors, medical equipment, and commercial aircraft
Aircraft

An aircraft is a vehicle which is able to flight by being supported by the air, or in general, the atmosphere, of a planet. Examples include balloons, airplanes and helicopters....
 are certified to this level. The cost versus loss of lives has been considered appropriate at this level (by FAA for aircraft under Federal Aviation Regulations
Federal Aviation Regulations

The Federal Aviation Regulations, or FARs, are rules prescribed by the Federal Aviation Administration governing all aviation activities in the United States....
).

Preventing failure


Probabilistic fault tolerance: adding redundancy to equipment and systems


Once a failure mode is identified, it can usually be prevented entirely by adding extra equipment to the system. For example, nuclear reactors contain dangerous radiation
Radiation

In physics, radiation describes any process in which energy emitted by one body travels through a medium or through space, ultimately to be absorbed by another body....
, and nuclear reactions can cause so much heat
Heat

In physics and thermodynamics, heat is any transfer of energy from one body or thermodynamic system to another due to a difference in temperature....
 that no substance might contain them. Therefore reactors have emergency core cooling systems to keep the temperature down, shielding to contain the radiation, and engineered barriers (usually several, nested, surmounted by a containment building
Containment building

A containment building, in its most common usage, is a steel or Reinforced concrete structure enclosing a nuclear reactor. It is designed to, in any emergency, contain the escape of radiation to a maximum pressure in the range of 60 to 200 psi ....
) to prevent accidental leakage.

Most biological
Biology

Biology is a branch of the natural sciences concerned with the study of living organisms and their interaction with each other and their environment ....
 organisms have a certain amount of redundancy: multiple organs, multiple limbs, etc.

For any given failure, a fail-over, or redundancy can almost always be designed and incorporated into a system.

When does safety stop, where does reliability begin?


Assume there is a new design for a submarine
Submarine

A submarine is a watercraft capable of independent operation below water. It differs from a submersible, which has only limited underwater capability....
. In the first case, as the prototype
Prototype

A prototype is an original type, form, or instance of something serving as a typical example, basis, or standard for other things of the same category....
 of the submarine is being moved to the testing tank, the main hatch
Trapdoor

A trapdoor is a door set into a floor or ceiling . An exposed trapdoor could also be called a hatch, although hatches may not be necessarily horizontal....
 falls off. This would be easily defined as an unreliable hatch. Now the submarine is submerged to 10,000 feet
Foot

The foot is an anatomical structure found in many animals. It is the terminal portion of a limb which bears weight and allows locomotion. In many animals with feet, the foot is a separate organ at the terminal part of the leg made up of one or more segments or bones, generally including claws or nails....
, whereupon the hatch falls off again, and all on board are killed. The failure is the same in both cases, but in the second case it becomes a safety issue. Most people tend to judge risk on the basis of the likelihood of occurrence. Other people judge risk on the basis of their magnitude of regret, and are likely unwilling to accept risk no matter how unlikely the event. The former make good reliability engineers, the latter make good safety engineers.

Now let us say there is a need to design a Humvee
High Mobility Multipurpose Wheeled Vehicle

The High Mobility Multipurpose Wheeled Vehicle is a military Four-wheel drive motor vehicle created by AM General. It has largely supplanted the roles formerly served by the M151 1/4 ton MUTT, the Gama Goat, their M718A1 and M792 ambulance versions, the CUCV, and other light trucks with the Military of the United States, as well as being...
 with a rocket launcher attached. The reliability engineer could make a good case for installing launch switches all over the vehicle, making it very likely someone can reach one and launch the rocket. The safety engineer could make an equally compelling case for putting only two switches at opposite ends of the vehicle which must both be thrown to launch the rocket, thus ensuring the likelihood of an inadvertent launch was small. An additional irony is that it is unlikely that the two engineers can reconcile their differences, in which case a manager who doesn't understand the technology could choose one design over the other based on other criteria, like cost of manufacturing.

Inherent fail-safe design

When adding equipment is impractical (usually because of expense), then the least expensive form of design is often "inherently fail-safe". The typical approach is to arrange the system so that ordinary single failures cause the mechanism to shut down in a safe way. (For nuclear power plants, this is termed a passively safe design, although more than ordinary failures are covered.)

One of the most common fail-safe systems is the overflow tube in baths and kitchen sink
Kitchen sink

Kitchen sink may refer to:* Everything but the kitchen sink, an expression denoting excess* Kitchen Sink, a 1989 short film by Alison Maclean...
s. If the valve sticks open, rather than causing an overflow and damage, the tank spills into an overflow.

Another common example is that in an elevator
Elevator

An elevator or lift is a vertical transport vehicle that efficiently moves people or goods between floors of a building. They are generally powered by electric motors that either drive traction cables and counterweight systems, or pump hydraulic fluid to raise a cylindrical piston....
 the cable supporting the car keeps spring-loaded brake
Brake

A brake is a device for applying a force against the friction of the road, slowing or stopping the motion of a machine or vehicle, or alternatively a device to restrain it from starting to move again....
s open. If the cable breaks, the brakes grab rails, and the elevator cabin does not fall.

Inherent fail-safes are common in medical equipment, traffic and railway signals, communications equipment, and safety equipment.

Containing Failure

It is also common practice to plan for the failure of safety systems through containment and isolation methods. The use of isolating valves, also known as the Block and bleed manifold
Block and bleed manifold

A block and bleed manifold is a hydraulic manifold that combines one or more block/isolate hydraulic valve, usually ball valves, and one or more bleed/vent valves, usually ball or needle valves, into one component, for interface with other components of a hydraulics system....
, is very common in isolating pumps, tanks, and control valves that may fail or need routine maintenance. In addition, nearly all tanks containing oil or other hazardous chemicals are required to have containment barriers set up around them to contain 100% of the volume of the tank in the event of a catastrophic tank failure. Similarly, long pipelines have remote-closing valves periodically installed in the line so that in the event of failure, the entire pipeline is not lost. The goal of all such containment systems is to provide means of limiting the damage done by a failure to a small localized area.

See also

  • Earthquake engineering
    Earthquake engineering

    Earthquake engineering is the study of the behavior of buildings and structures subject to seismic loading. It is a subset of both structural engineering and civil engineering....
  • Forensic engineering
    Forensic engineering

    Forensics engineering is the investigation of material science, product , structures or components that fail or do not operate/function as intended, causing personal injury or damage to property....
  • Public safety
    Public Safety

    Public safety involves the prevention of and protection from events that could endanger the safety of the Public from significant danger, Injury, or Property damage, such as crimes or disasters ....
  • Safety engineer
    Safety engineer

    Scope of a Safety EngineerTo perform their professional functions, safety engineering professionals must have education, training and experience in a common body of knowledge....
  • System safety
    System safety

    The system safety concept calls for a risk management strategy based on identification, analysis of hazards and application of remedial controls using a systems-based approach....
  • Nuclear safety
    Nuclear safety

    Nuclear safety covers the actions taken to prevent nuclear and radiation accidents or to limit their consequences. This covers nuclear power plants as well as all other nuclear facilities, the transportation of nuclear materials, the use and storage of nuclear materials for medical, power, industry, and military uses....
  • Life-critical (also safety-critical)
  • Reliability engineering
    Reliability engineering

    Reliability engineering is an engineering field, that deals with the study of reliability: the ability of a system or component to perform its required functions under stated conditions for a specified period of time....
  • Reliability theory
    Reliability theory

    Reliability theory developed apart from the mainstream of probability and statistics. It was originally a tool to help nineteenth centuryMarine insurance and life insurance companies compute profitable rates to charge their customers....
  • Reliability theory of aging and longevity
    Reliability theory of aging and longevity

    Reliability theory of aging and longevity is a scientific approach aimed to gain theoretical insights into mechanisms of biological aging and species survival patterns by applying a general theory of systems failure, known as reliability theory....
  • Human reliability
    Human reliability

    Human reliability is related to the field of human factors engineering, and refers to the reliability of humans in fields such as manufacturing, transportation, the military, or medicine....
  • Risk assessment
    Risk assessment

    Risk assessment is a step in a risk management process. Risk assessment is the determination of quantitative or qualitative value of risk related to a concrete situation and a recognized threat ....
  • Risk management
    Risk management

    Risk management is activity directed towards the assessing, mitigating and monitoring of risks. In some cases the acceptable risk may be near zero....
  • Air brake (rail)
    Air brake (rail)

    An air brake is a conveyance brake applied by means of Gas compressor. Modern trains rely upon a fail-safe air brake system that is based upon a design patented by George Westinghouse on March 5, 1872....
  • Biomedical engineering
    Biomedical engineering

    Biomedical engineering is the application of engineering principles and techniques to the medical field. It combines the design and problem solving skills of engineering with medical and biological sciences to help improve patient health care and the quality of life of individuals....
  • SAPHIRE
    SAPHIRE

    SAPHIRE is a probabilistic risk and reliability assessment software tool. SAPHIRE stands for Systems Analysis Programs for Hands-on Integrated Reliability Evaluations....
     (risk analysis
    Risk analysis

    Risk Analysis can refer to:*Risk analysis **Probabilistic risk assessment, an engineering safety analysis*Risk analysis * Certified Risk Analyst...
     software)
  • Some of the techniques of safety engineering have been applied to the field of security engineering
    Security engineering

    Security engineering is a specialized field of engineering that deals with the development of detailed engineering plans and designs for security features, controls and systems....
    .
  • Redundancy (engineering)
    Redundancy (engineering)

    In engineering, redundancy is the duplication of critical wikt:Components of a system with the intention of increasing reliability of the system, usually in the case of a backup or fail-safe....
  • Double switching
    Double switching

    Double switching is the practice of using a multipole switch to close or open both the positive and negative sides of a Direct current electrical circuit, or both the hot and neutral sides of an Alternating current circuit....
  • Workplace safety
    Workplace safety

    Workplace safety is a category of management responsibility in places of employment.To ensure the safety and health of workers, managers establish a focus on safety that can include elements such as:...
  • DO-178B
    DO-178B

    DO-178B, Software Considerations in Airborne Systems and Equipment Certification is a guidance for software development published by RTCA, Incorporated....
  • DO-254
    DO-254

    DO-254, Design Assurance Guidance for Airborne Electronic Hardware is a standard for complex electronic hardware development published by RTCA, Incorporated....
  • ARP4761
    ARP4761

    ARP4761, Guidelines and Methods for Conducting the Safety Assessment Process on Civil Airborne Systems and Equipment is a standard from the Society of Automotive Engineers....
  • Hazard analysis
    Hazard analysis

    A hazard analysis is a process used to assess risk. The results of a hazard analysis is the identification of unacceptable risks and the selection of means of controlling or eliminating them....
  • Hazop
    Hazop

    Hazard and operability studies are a methodology for identifying and dealing with potential problems in industrial processes, particularly those which would create a hazardous situation or a severe impairment of the process....
  • Process Safety Management
    Process Safety Management

    Process Safety Management is a regulation, promulgated by the U.S. Occupational Safety and Health Administration , intended to prevent an incident like the 1984 Bhopal Disaster....


Further reading



External links

  • (official website)
  • (official website)
  • (official website)
  • (official website)
  • (official website)
  • Institution of Chemical Engineers
    Institution of Chemical Engineers

    Sorry, no overview for this topic
     (IChemE)