All Topics  
Software testing

 

   Email Print
   Bookmark   Link






 

Software testing



 
 
Software Testing is an empirical
Empirical

The word empirical denotes information gained by means of observation, experience, or experiment, as opposed to theory. A central concept in science and the scientific method is that all evidence must be empirical, or empirically based, that is, dependent on evidence or Logical consequence that are observable by the senses....
 investigation conducted to provide stakeholders with information about the quality of the product or service under test , with respect to the context in which it is intended to operate. This includes, but is not limited to, the process of executing a program or application with the intent of finding software bugs.

ing can never completely establish the correctness of computer software.






Discussion
Ask a question about 'Software testing'
Start a new discussion about 'Software testing'
Answer questions from other users
Full Discussion Forum



Encyclopedia


Software Testing is an empirical
Empirical

The word empirical denotes information gained by means of observation, experience, or experiment, as opposed to theory. A central concept in science and the scientific method is that all evidence must be empirical, or empirically based, that is, dependent on evidence or Logical consequence that are observable by the senses....
 investigation conducted to provide stakeholders with information about the quality of the product or service under test , with respect to the context in which it is intended to operate. This includes, but is not limited to, the process of executing a program or application with the intent of finding software bugs.

Overview

Testing can never completely establish the correctness of computer software. Instead, it furnishes a criticism or comparison that compares the state and behaviour of the product against oracles
Oracle machine

In computational complexity theory and Computability theory , an oracle machine is an abstract machine used to study decision problems. It can be visualized as a Turing machine with a black box, called an oracle, which is able to decide certain decision problems in a single operation....
—principles or mechanisms by which someone might recognize a problem. These oracles may include (but are not limited to) specifications, comparable products, past versions of the same product, inferences about intended or expected purpose, user or customer expectations, relevant standards, applicable laws, or other criteria.

Over its existence, computer software has continued to grow in complexity and size. Every software product has a target audience. For example, the audience for video game software is completely different from banking software. Therefore, when an organization develops or otherwise invests in a software product, it presumably must assess whether the software product will be acceptable to its end users, its target audience, its purchasers, and other stakeholders. Software testing is the process of attempting to make this assessment.

A study conducted by NIST in 2002 reports that software bugs cost the U.S. economy $59.5 billion annually. More than a third of this cost could be avoided if better software testing was performed.

History

The separation of debugging from testing was initially introduced by Glenford J. Myers in 1979. Although his attention was on breakage testing ("a successful test is one that finds a bug"), it illustrated the desire of the software engineering community to separate fundamental development activities, such as debugging, from that of verification. Dave Gelperin
Dave Gelperin

Doctor Dave Gelperin was the chair of the IEEE 829-1989 . With Jerry E. Durant he went on to develop High Impact Inspection Technology that builds upon traditional inspections but utilizes a test driven additive....
 and William C. Hetzel
William C. Hetzel

Doctor William C. Hetzel wrote the book The Complete Guide to Software Testing it continues to be pivotal in the software testing culture and remains a consistent source of reference....
 classified in 1988 the phases and goals in software testing in the following stages:

  • Until 1956 - Debugging oriented
  • 1957-1978 - Demonstration oriented
  • 1979-1982 - Destruction oriented
  • 1983-1987 - Evaluation oriented
  • 1988-2000 - Prevention oriented


Software testing topics


Scope

A primary purpose for testing is to detect software failures so that defects may be uncovered and corrected. This is a non-trivial pursuit. Testing cannot establish that a product functions properly under all conditions but can only establish that it does not function properly under specific conditions. The scope of software testing often includes examination of code as well as execution of that code in various environments and conditions as well as examining the aspects of code: does it do what it is supposed to do and do what it needs to do. In the current culture of software development, a testing organization may be separate from the development team. There are various roles for testing team members. Information derived from software testing may be used to correct the process by which software is developed.

Defects and failures

Not all software defects are caused by coding errors. One common source of expensive defects is caused by requirements gaps, e.g., unrecognized requirements, that result in errors of omission by the program designer. A common source of requirements gaps is non-functional requirements
Non-functional requirements

In systems engineering and requirements engineering, a non-functional requirement is a requirement that specify criteria that can be used to judge the operation of a system, rather than specific behaviors....
 such as testability
Testability

Testability, a property applying to an empirical hypothesis, involves two components: the logical property that is variously described as contingency, defeasibility, or falsifiability, which means that counterexamples to the hypothesis are logically possible, and the practical feasibility of observing a reproducibility series of such count...
, scalability
Scalability

In telecommunications and software engineering, scalability is a desirable property of a system, a network, or a process, which indicates its ability to either handle growing amounts of work in a graceful manner, or to be readily enlarged....
, maintainability
Maintainability

In software engineering, the ease with which a software product can be modified in order to:* correct defects* meet new requirements* make future maintenance easier, or...
, usability
Usability

Usability is a term used to denote the ease with which people can employ a particular tool or other human-made object in order to achieve a particular goal....
, performance
Computer performance

Computer performance is characterized by the amount of useful work accomplished by a computer system compared to the time and resources used.Depending on the context, good computer performance may involve one or more of the following:...
, and security
Computer security

Computer security is a branch of technology known as information security as applied to computers. The objective of computer security can include protection of information from theft or corruption, or the preservation of availability, as defined in the security policy....
.

Software faults occur through the following process. A programmer makes an error
Human Error

Human Error is the stage name of Rafal Kuczynski , a Poland electronic musician, working mostly in the ambient music genre, produced only with a computer....
 (mistake), which results in a defect
Fault (technology)

In document ISO/CD 10303-226, a fault is defined as an abnormal condition or defect at the component, equipment, or sub-system level which may lead to a failure....
 (fault, bug) in the software source code
Source code

In computer science, source code is any collection of statements or declarations written in some human-readable computer programming language....
. If this defect is executed, in certain situations the system will produce wrong results, causing a failure
Failure

Failure in general refers to the state or condition of not meeting a desirable or intended objective. It may be viewed as the opposite of success....
. Not all defects will necessarily result in failures. For example, defects in dead code
Dead code

Dead code is a computer programming term for code in the source code of a program which is executed but whose result is never used in any other computation....
 will never result in failures. A defect can turn into a failure when the environment is changed. Examples of these changes in environment include the software being run on a new hardware
Hardware

Hardware is a general term that refers to the physical cultural artifacts of a technology. It may also mean the physical components of a computer system, in the form of computer hardware....
 platform, alterations in source data
Source data

Source data is the origin of information found in electronic media.Often when data is captured in one electronic system and then transferred to another, there is a loss of audit trail or the inherent data cannot be absolutely verified....
 or interacting with different software. A single defect may result in a wide range of failure symptoms.

Compatibility

A frequent cause of software failure is compatibility
Computer compatibility

A family of computer models is said to be compatible if certain software that runs on one of the models can also be run on all other models of the family....
 with another application or new operating system
Operating system

An operating system is an interface between hardware and applications; it is responsible for the management and coordination of activities and the sharing of the limited resources of the computer....
 (or, increasingly web browser
Web browser

A Web browser is a application software which enables a user to display and interact with text, images, videos, music, games and other information typically located on a Web page at a website on the World Wide Web or a local area network....
 version). In the case of lack of backward compatibility
Backward compatibility

In technology, for example in telecommunications and computing, a device or technology is said to be backwards compatible if it allows input generated by older devices....
 this can occur because the programmers have only considered coding the programs for, or testing the software, on the latest operating system they havemay not be fully compatible with earlier mixtures of software/hardware. This could be considered a prevention oriented strategy that fits well with the latest testing phase suggested by Dave Gelperin
Dave Gelperin

Doctor Dave Gelperin was the chair of the IEEE 829-1989 . With Jerry E. Durant he went on to develop High Impact Inspection Technology that builds upon traditional inspections but utilizes a test driven additive....
 and William C. Hetzel
William C. Hetzel

Doctor William C. Hetzel wrote the book The Complete Guide to Software Testing it continues to be pivotal in the software testing culture and remains a consistent source of reference....
 cited below .

Input combinations and preconditions

A problem with software testing is that testing under all combinations of inputs and preconditions (initial state) is not feasible, even with a simple product. This means that the number of defect
Software bug

A software bug is an error, flaw, mistake, failure, or fault in a computer program that prevents it from behaving as intended . Most bugs arise from mistakes and errors made by people in either a program's source code or its software architecture, and a few are caused by compilers producing incorrect code....
s in a software product can be very large and defects that occur infrequently are difficult to find in testing. More significantly, non-functional
Non-functional requirements

In systems engineering and requirements engineering, a non-functional requirement is a requirement that specify criteria that can be used to judge the operation of a system, rather than specific behaviors....
 dimensions of quality (how it is supposed to be versus what it is supposed to do) -- for example, usability, scalability
Scalability

In telecommunications and software engineering, scalability is a desirable property of a system, a network, or a process, which indicates its ability to either handle growing amounts of work in a graceful manner, or to be readily enlarged....
, performance
Computer performance

Computer performance is characterized by the amount of useful work accomplished by a computer system compared to the time and resources used.Depending on the context, good computer performance may involve one or more of the following:...
, compatibility
Backward compatibility

In technology, for example in telecommunications and computing, a device or technology is said to be backwards compatible if it allows input generated by older devices....
, reliability -- can be highly subjective; something that constitutes sufficient value to one person may be intolerable to another.

Static vs. dynamic testing

There are many approaches to software testing. Reviews
Code review

Code review is systematic examination of computer source code intended to find and fix Software bug overlooked in the Software development, improving both the overall software quality and the developers' skills....
, walkthrough
Software walkthrough

In software engineering, a walkthrough or walk-through is a form of software peer review "in which a designer or programmer leads members of the development team and other interested parties through a software product, and the participants ask questions and make comments about possible errors, violation of development standards, and oth...
s or inspections
Software inspection

Inspection in software engineering, refers to peer review of any work product by trained individuals who look for defects using a well defined process....
 are considered as static testing
Static testing

Static testing is a form of software testing where the software isn't actually used. This is in contrast to dynamic testing. It is generally not detailed testing, but checks mainly for the sanity of the code, algorithm, or document....
, whereas actually executing programmed code with a given set of test case
Test case

A test case in software engineering is a set of conditions or variables under which a tester will determine whether an software application or software system meets specifications....
s is referred to as dynamic testing
Dynamic testing

Dynamic testing is a term used in software engineering to describe the testing of the dynamic behavior of code. That is, dynamic analysis refers to the examination of the physical response from the system to variables that are not constant and change with time....
. The former can be, and unfortunately in practice often is, omitted, whereas the latter takes place when programs begin to be used for the first time - which is normally considered the beginning of the testing stage. This may actually begin before the program is 100% complete in order to test particular sections of code (modules or discrete functions). For example, Spreadsheet
Spreadsheet

A spreadsheet is a computer application that simulates a paper worksheet. It displays multiple cells that together make up a grid consisting of rows and columns, each cell containing either alphanumeric text or numeric values....
 programs are, by their very nature, tested to a large extent "on the fly
On the fly

Colloquial usageIn colloquial use, on the fly means something created when needed. The phrase is used: to explain that something wasn't planned ahead, or...
" during the build process as the result of some calculation or text manipulation is shown interactively immediately after each formula is entered.

Software verification and validation

Software testing is used in association with verification and validation:
  • Verification
    Verification

    selfref|For Wikipedia's verification policy see...
    : Have we built the software right (i.e., does it match the specification?)? It is process based.
  • Validation
    Validation

    The word validation has several uses:* In common usage, validation is the process of checking if something satisfies a certain criterion. Examples would include checking if a statement is true , if an appliance works as intended, if a computer system is secure, or if computer data are compliant with an open standard....
    : Have we built the right software (i.e., is this what the customer wants?)? It is product based.


The software testing team

Software testing can be done by software testers. Until the 1950s the term "software tester" was used generally, but later it was also seen as a separate profession. Regarding the periods and the different goals in software testing there have been established different roles: test lead/manager, test designer, tester, test automater/automation developer, and test administrator.

Software Quality Assurance (SQA)

Though controversial, software testing may be viewed as an important part of the software quality assurance
Software quality assurance

Software quality assurance consists of a means of monitoring the software engineering processes and methods used to ensure quality. The methods by which this is accomplished are many and varied, and may include ensuring conformance to one or more standards, such as ISO 9000 or CMMI....
 (SQA) process. In SQA, software process specialists and auditors take a broader view on software and its development. They examine and change the software engineering process itself to reduce the amount of faults that end up in defect rate. What constitutes an acceptable defect rate depends on the nature of the software. An arcade video game designed to simulate flying an airplane would presumably have a much higher tolerance for defects than mission critical
Mission Critical

The term mission critical refers to any factor which is crucial to the successful completion of an entire project. It may also refer to a project the success of which is vital to the mission of the organization which attempts it....
 software such as that used to control the functions of an airliner. Although there are close links with SQA, testing departments often exist independently, and there may be no SQA function in some companies.

Software Testing is a task intended to detect defects in software by contrasting a computer program's expected results with its actual results for a given set of inputs. By contrast, QA is the implementation of policies and procedures intended to prevent defects from occurring in the first place.

Testing methods

Software testing methods are traditionally divided into black box testing
Black box testing

Black box testing takes an external perspective of the test object to derive test cases. These tests can be functional or non-functional, though usually functional....
 and white box testing
White box testing

White box testing uses an internal perspective of the system to design test cases based on internal structure. It requires programming skills to identify all paths through the software....
. These two approaches are used to describe the point of view that a test engineer takes when designing test cases.

Black box testing

Black box testing
Black box testing

Black box testing takes an external perspective of the test object to derive test cases. These tests can be functional or non-functional, though usually functional....
 treats the software as a black box without any knowledge of internal implementation. Black box testing methods include equivalence partitioning
Equivalence partitioning

Equivalence partitioning is a software testing technique in which test cases are designed to execute representatives from each equivalence partition, i.e....
, boundary value analysis
Boundary value analysis

Boundary value analysis is a software testing design technique used to determine test cases covering off-by-one errors....
, all-pairs testing
All-pairs testing

All-pairs testing or pairwise testing is a combinatorial software testing method that, for each pair of input parameters to a system , tests all possible discrete combinations of those parameters....
, fuzz testing
Fuzz testing

Fuzz testing, fuzzing, Robustness Testing or Negative Testing is a software testing technique that provides random data to the inputs of a computer program....
, model-based testing
Model-based testing

Model-based testing is software testing in which test cases are derived in whole or in part from a model that describes some aspects of the system under test ....
, traceability matrix
Traceability matrix

A traceability matrix is a table that correlates any two Baseline that require a many to many relationship to determine the completeness of the relationship....
, exploratory testing and specification-based testing.

Specification-based testing : Specification-based testing aims to test the functionality according to the requirements. Thus, the tester inputs data and only sees the output from the test object. This level of testing usually requires thorough test cases to be provided to the tester who then can simply verify that for a given input, the output value (or behavior), is the same as the expected value specified in the test case.

Specification-based testing is necessary but insufficient to guard against certain risks.


Advantages and disadvantages : The black box tester has no "bonds" with the code, and a tester's perception is very simple: a code MUST have bugs. Using the principle, "Ask and you shall receive," black box testers find bugs where programmers don't. BUT, on the other hand, black box testing is like a walk in a dark labyrinth without a flashlight, because the tester doesn't know how the back end was actually constructed. That's why there are situations when 1. A black box tester writes many test cases to check something that can be tested by only one test case and/or 2. Some parts of the back end are not tested at all

Therefore, black box testing has the advantage of an unaffiliated opinion on the one hand and the disadvantage of blind exploring on the other.

White box testing

White box testing
White box testing

White box testing uses an internal perspective of the system to design test cases based on internal structure. It requires programming skills to identify all paths through the software....
, by contrast to black box testing, is when the tester has access to the internal data structures and algorithms (and the code that implement these)

Types of white box testing
The following types of white box testing exist:
  • api testing - Testing of the application using Public and Private APIs.
  • code coverage
    Code coverage

    Code coverage is a measure used in software testing. It describes the degree to which the source code of a computer program has been tested. It is a form of testing that inspects the code directly and is therefore a form of white box testing....
     - creating tests to satisfy some criteria of code coverage. For example, the test designer can create tests to cause all statements in the program to be executed at least once.
  • fault injection
    Fault injection

    In software testing, fault injection is a technique for improving the Code coverage of a test by introducing faults in order to test code paths, in particular error handling code paths, that might otherwise rarely be followed....
     methods.
  • mutation testing methods.
  • static testing
    Static testing

    Static testing is a form of software testing where the software isn't actually used. This is in contrast to dynamic testing. It is generally not detailed testing, but checks mainly for the sanity of the code, algorithm, or document....
     - White box testing includes all static testing.


Code completeness evaluation
White box testing methods can also be used to evaluate the completeness of a test suite that was created with black box testing methods. This allows the software team to examine parts of a system that are rarely tested and ensures that the most important function points have been tested.


Two common forms of code coverage are:
  • function coverage, which reports on functions executed
  • and statement coverage, which reports on the number of lines executed to complete the test.
They both return a coverage metric, measured as a percentage.

Grey Box Testing

In recent years the term grey box testing has come into common usage. This involves having access to internal data structures and algorithms for purposes of designing the test cases, but testing at the user, or black-box level.Manipulating input data and formatting output do not qualify as grey-box because the input and output are clearly outside of the black-box we are calling the software under test. This is particularly important when conducting integration testing
Integration testing

'Integration testing' is the phase of software testing in which individual software modules are combined and tested as a group. It follows unit testing and precedes system testing....
 between two modules of code written by two different developers, where only the interfaces are exposed for test. Grey box testing may also include reverse engineering
Reverse engineering

Reverse engineering is the process of discovering the technological principles of a device, object or system through analysis of its structure, function and operation....
 to determine, for instance, boundary values or error messages.

Acceptance testing


Acceptance testing can mean one of two things:
  1. A smoke test
    Smoke test

    Smoke testing is a term used in plumbing, woodwind repair, electronics, computer software development, and the entertainment industry. It refers to the first test made after repairs or first assembly to provide some assurance that the system under test will not catastrophically fail....
     is used as an acceptance test prior to introducing a build to the main testing process.
  2. Acceptance testing performed by the customer is known as user acceptance testing (UAT).


Regression Testing

Regression testing is any type of software testing
Software testing

Software Testing is an empirical investigation conducted to provide stakeholders with information about the quality of the product or service under test , with respect to the context in which it is intended to operate....
 that seeks to uncover software regressions
Software regression

A software regression is a software bug which makes a feature stop functioning as intended after a certain event . A software performance regression is a situation where the software still functions correctly, but performs slowly or uses more memory when compared to previous versions....
. Such regressions occur whenever software functionality that was previously working correctly stops working as intended. Typically regressions occur as an unintended consequence
Unintended consequence

Unintended consequences are outcomes that are not the results originally intended in a particular situation. The unintended results may be foreseen or unforeseen, but they should be the logical or likely results of the action....
 of program changes. Common methods of regression testing include re-running previously run tests and checking whether previously fixed faults have re-emerged.

Non Functional Software Testing


Special methods exist to test non-functional aspects of software.
  • Performance testing checks to see if the software can handle large quantities of data or users
    Load testing

    Load testing is the process of putting demand on a system or device and measuring its response.In mechanical systems it refers to the testing of a system to certify it under the appropriate regulations, e.g....
    . This is generally referred to as software scalability
    Scalability

    In telecommunications and software engineering, scalability is a desirable property of a system, a network, or a process, which indicates its ability to either handle growing amounts of work in a graceful manner, or to be readily enlarged....
    . This activity of Non Functional Software Testing is often times referred to as Load Testing.
  • Usability testing
    Usability testing

    Usability testing is a technique used to evaluate a product by testing it on users. This can be seen as an irreplaceable usability practice, since it gives direct input on how real users use the system....
     is needed to check if the user interface is easy to use and understand.
  • Security testing
    Security Testing

    Security Testing: Process to determine that an IS protects data and maintains functionality as intended.The six basic security concepts that need to be covered by security testing are: confidentiality, integrity, authentication, authorization, availability and non-repudiation....
     is essential for software which processes confidential data and to prevent system intrusion by hackers
    Hacker (computer security)

    In common usage, a hacker is a person who breaks into computers. The subculture that has evolved around hackers is often referred to as the computer underground....
    .
  • Internationalization and localization
    Internationalization and localization

    In computing, internationalization and localization are means of adapting computer software to different languages and regional differences. Internationalization is the process of designing a software application so that it can be adapted to various languages and regions without engineering changes....
     is needed to test these aspects of software, for which a pseudolocalization
    Pseudolocalization

    Pseudolocalization is a software testing method that is used to test Internationalization and localization aspects of software. Specifically, it brings to light potential difficulties with localization by replacing localizable text with text that imitates the most problematic characteristics of text from a wide variety of languages, and by...
     method can be used.


In contrast to functional testing, which establishes the correct operation of the software (correct in that it matches the expected behavior defined in the design requirements), non-functional testing verifies that the software functions properly even when it receives invalid or unexpected inputs. Software fault injection
Fault injection

In software testing, fault injection is a technique for improving the Code coverage of a test by introducing faults in order to test code paths, in particular error handling code paths, that might otherwise rarely be followed....
, in the form of fuzzing
Fuzz testing

Fuzz testing, fuzzing, Robustness Testing or Negative Testing is a software testing technique that provides random data to the inputs of a computer program....
 is an example of non-functional testing. Non-functional testing, especially for software, is designed to establish whether the device under test can tolerate invalid or unexpected inputs, thereby establishing the robustness of input validation routines as well as error-handling routines. Various commercial non-functional testing tools are linked from the Software fault injection
Fault injection

In software testing, fault injection is a technique for improving the Code coverage of a test by introducing faults in order to test code paths, in particular error handling code paths, that might otherwise rarely be followed....
 page; there are also numerous open-source and free software tools available that perform non-functional testing.

Testing process

A common practice of software testing is performed by an independent group of testers after the functionality is developed before it is shipped to the customer. This practice often results in the testing phase being used as project
Project management

Project management is the List of academic disciplines of planning, organizing and managing resources to bring about the successful completion of specific project goals and objectives....
 buffer to compensate for project delays, thereby compromising the time devoted to testing. Another practice is to start software testing at the same moment the project starts and it is a continuous process until the project finishes.

In counterpoint, some emerging software disciplines such as extreme programming
Extreme Programming

Extreme Programming is a software engineering methodology prescribing a set of daily stakeholder Extreme Programming#Practices that embody and encourage particular Extreme Programming#XP values ....
 and the agile software development
Agile software development

Agile software development is a group of software development methodologies that are based on similar principles. Agile methodologies generally promote a project management process that encourages frequent inspection and adaptation, a leadership philosophy that encourages teamwork, self-organization and accountability, a set of engineering be...
 movement, adhere to a "test-driven software development
Test-driven development

Test-driven development is a software development technique that uses short development iterations based on pre-written test cases that define desired improvements or new functions....
" model. In this process unit tests are written first, by the software engineers
Software engineering

Software engineering is the application of a systematic, disciplined, quantifiable approach to the development, operation, and maintenance of software, and the study of these approaches....
 (often with pair programming
Pair programming

Pair programming is a software development technique in which two computer programmer work together at one keyboard. One types in code while the other code review each line of code as it's typed in....
 in the extreme programming methodology). Of course these tests fail initially; as they are expected to. Then as code is written it passes incrementally larger portions of the test suites. The test suites are continuously updated as new failure conditions and corner cases are discovered, and they are integrated with any regression tests that are developed. Unit tests are maintained along with the rest of the software source code and generally integrated into the build process (with inherently interactive tests being relegated to a partially manual build acceptance process).

Testing can be done on the following levels:
  • Unit testing tests the minimal software component, or module. Each unit (basic component) of the software is tested to verify that the detailed design for the unit has been correctly implemented. In an object-oriented environment, this is usually at the class level, and the minimal unit tests include the constructors and destructors.
  • Integration testing
    Integration testing

    'Integration testing' is the phase of software testing in which individual software modules are combined and tested as a group. It follows unit testing and precedes system testing....
     exposes defects in the interfaces and interaction between integrated components (modules). Progressively larger groups of tested software components corresponding to elements of the architectural design are integrated and tested until the software works as a system.
  • System testing
    System testing

    System testing of software or hardware is testing conducted on a complete, integrated system to evaluate the system's compliance with its specified requirements....
     tests a completely integrated system to verify that it meets its requirements.
  • System integration testing
    System integration testing

    System Integration Testing , in the context of software systems and software engineering, is a testing process that exercises a software system's coexistence with others....
     verifies that a system is integrated to any external or third party systems defined in the system requirements.


Before shipping the final version of software, alpha and beta testing are often done additionally:
  • Alpha testing is simulated or actual operational testing by potential users/customers or an independent test team at the developers' site. Alpha testing is often employed for off-the-shelf software as a form of internal acceptance testing, before the software goes to beta testing.
  • Beta testing comes after alpha testing. Versions of the software, known as beta versions, are released to a limited audience outside of the programming team. The software is released to groups of people so that further testing can ensure the product has few faults or bugs. Sometimes, beta versions are made available to the open public to increase the feedback
    Feedback

    Feedback describes the situation when output from an event or phenomenon in the past will influence the same event/phenomenon in the present or future....
     field to a maximal number of future users.


Finally, acceptance testing can be conducted by the end-user, customer, or client to validate whether or not to accept the product. Acceptance testing may be performed as part of the hand-off process between any two phases of development.

Regression testing

After modifying software, either for a change in functionality or to fix defects, a regression test
Regression testing

Regression testing is any type of software testing which seeks to uncover Software regression. Such regressions occur whenever software functionality that was previously working correctly stops working as intended....
 re-runs previously passing tests on the modified software to ensure that the modifications haven't unintentionally caused a regression
Software regression

A software regression is a software bug which makes a feature stop functioning as intended after a certain event . A software performance regression is a situation where the software still functions correctly, but performs slowly or uses more memory when compared to previous versions....
 of previous functionality. Regression testing can be performed at any or all of the above test levels. These regression tests are often automated
Test automation

Test automation is the use of software to control the execution of software testings, the comparison of actual outcomes to predicted outcomes, the setting up of test preconditions, and other test control and test reporting functions....
.

More specific forms of regression testing are known as sanity test
Sanity test

A sanity test or sanity check is a basic test to quickly evaluate the validity of a claim or calculation. In mathematics, for example, when multiplying by three or nine, verifying that the sum of digits, in the result, is a multiple of 3 or 9 , respectively, is a sanity test....
ing, when quickly checking for bizarre behavior, and smoke testing when testing for basic functionality.

Benchmark
Benchmark (computing)

In computing, a benchmark is the act of running a computer program, a set of programs, or other operations, in order to assess the relative performance of an object, normally by running a number of standard tests and trials against it....
s
may be employed during regression testing to ensure that the performance of the newly modified software will be at least as acceptable as the earlier version or, in the case of code optimization
Optimization (computer science)

In computing, optimization is the process of modifying a system to make some aspect of it work more efficiently or use fewer resources. For instance, a computer program may be optimized so that it executes more rapidly, or is capable of operating with less Computer data storage or other resources, or draw less power....
, that some real improvement has been achieved.

Finding faults


Finding faults early

It is commonly believed that the earlier a defect is found the cheaper it is to fix it. The following table shows the cost of fixing the defect depending on the stage it was found. For example, if a problem in the requirements is found only post-release, then it would cost 10–100 times more to fix than if it had already been found by the requirements review.

Time Detected
RequirementsArchitectureConstructionSystem TestPost-Release
Time Introduced Requirements 5–10× 10× 10–100×
Architecture - 10× 15× 25–100×
Construction - - 10× 10–25×


Testing Tools

Program testing and fault detection can be aided significantly by testing tools and debugger
Debugger

A debugger is a computer program that is used to test and debug other programs. The code to be examined might alternatively be running on an Instruction Set Simulator, a technique that allows great power in its ability to halt when specific conditions are encountered but which will typically be much slower than executing the code directly on...
s. Types of testing/debug tools include features such as:
  • Program monitors, permitting full or partial monitoring of program code including:
    • Instruction Set Simulator
      Instruction Set Simulator

      An instruction set simulator is a simulation model , usually coded in a high-level programming language, which mimics the behavior of a mainframe or microprocessor by "reading" instructions and maintaining internal variables which represent the processor's registers....
      , permitting complete instruction level monitoring and trace facilities
    • Program animation
      Program animation

      Techniques for program animationProgram animation is a particular feature of some Testing tools allowing programs to be executed step-by-step at either source code level or machine code level....
      , permitting step-by-step execution and conditional breakpoint
      Breakpoint

      A breakpoint, in software development, is an intentional stopping or pausing place in a computer program, put in place for debugging purposes. More generally, a breakpoint is a means of acquiring knowledge about a program during its execution....
       at source level or in machine code
      Machine code

      Machine code or machine language is a system of instructions and data executed directly by a computer's central processing unit. Machine code may be regarded as a primitive programming language or as the lowest-level representation of a compiled and/or assembly language computer program....
    • code coverage
      Code coverage

      Code coverage is a measure used in software testing. It describes the degree to which the source code of a computer program has been tested. It is a form of testing that inspects the code directly and is therefore a form of white box testing....
       reports
  • Formatted dump
    Dump

    Dump may refer to:* Landfill, a site for the disposal of waste * Dump , a Unix program for backing up file systems* Core dump, the recorded state of the working memory of a computer program at a specific time, generally when the program has terminated abnormally ...
     or Symbolic debugging, tools allowing inspection of program variables on error or at chosen points
  • Benchmark
    Benchmark

    The term benchmark originates from the chiseled horizontal marks that surveyors made, into which an angle-iron could be placed to bracket a levelling rod, thus ensuring that the levelling rod can be repositioned in exactly the same place in the future....
    s, allowing run-time performance comparisons to be made
  • Performance analysis
    Performance analysis

    In software engineering, performance analysis, more commonly today known as profiling, is the investigation of a program's behavior using information gathered as the program executes ....
    , or profiling tools that can help to highlight hot spots and resource usage


Some of these features may be incorporated into an Integrated development environment
Integrated development environment

An integrated development environment also known as integrated design environment or integrated debugging environment is a software application that provides comprehensive facilities to computer programmers for software development....
 (IDE).

Measuring software testing

Usually, quality is constrained to such topics as correctness
Correctness

In theoretical computer science, correctness of an algorithm is asserted when it is said that the algorithm is correct with respect to a program specification....
, completeness, security
Computer security audit

A computer security audit is a manual or systematic measurable technical assessment of a system or application. Manual assessments include interviewing staff, performing security vulnerability scans, reviewing application and operating system access controls, and analyzing physical access to the systems....
, but can also include more technical requirements as described under the ISO
International Organization for Standardization

The International Organization for Standardization , widely known as ISO , is an international standard-setting body composed of representatives from various national standards organizations....
 standard ISO 9126
ISO 9126

ISO 9126 is an international standard for the evaluation of software quality. The fundamental objective of this standard is to address some of the well known human biases that can adversely effect the delivery and perception of a software development project....
, such as capability, reliability
Reliability engineering

Reliability engineering is an engineering field, that deals with the study of reliability: the ability of a system or component to perform its required functions under stated conditions for a specified period of time....
, efficiency
Algorithmic efficiency

In computer science, efficiency is used to describe properties of an algorithm relating to how much of various types of resources it consumes. The two most frequently encountered are...
, portability
Porting

In computer science, porting is the process of adapting software so that an executable Computer program can be created for a computing environment that is different from the one for which it was originally designed ....
, maintainability
Maintainability

In software engineering, the ease with which a software product can be modified in order to:* correct defects* meet new requirements* make future maintenance easier, or...
, compatibility, and usability
Usability

Usability is a term used to denote the ease with which people can employ a particular tool or other human-made object in order to achieve a particular goal....
.

There are a number of common software measures, often called "metrics", which are used to measure the state of the software or the adequacy of the testing.

Testing artifacts

Software testing process can produce several artifacts
Artifact (software development)

The term artifact in connection with software development is largely associated with specific development methods or processes e.g., Unified Process....
.

Test case : A test case
Test case

A test case in software engineering is a set of conditions or variables under which a tester will determine whether an software application or software system meets specifications....
 in software engineering normally consists of a unique identifier, requirement references from a design specification, preconditions, events, a series of steps (also known as actions) to follow, input, output, expected result, and actual result. Clinically defined a test case is an input and an expected result. This can be as pragmatic as 'for condition x your derived result is y', whereas other test cases described in more detail the input scenario and what results might be expected. It can occasionally be a series of steps (but often steps are contained in a separate test procedure that can be exercised against multiple test cases, as a matter of economy) but with one expected result or expected outcome. The optional fields are a test case ID, test step or order of execution number, related requirement(s), depth, test category, author, and check boxes for whether the test is automatable and has been automated. Larger test cases may also contain prerequisite states or steps, and descriptions. A test case should also contain a place for the actual result. These steps can be stored in a word processor document, spreadsheet, database, or other common repository. In a database system, you may also be able to see past test results and who generated the results and the system configuration used to generate those results. These past results would usually be stored in a separate table.

Test script : The test script
Test script

A test script in software testing is a set of instructions that will be performed on the system under test to test that the system functions as expected....
 is the combination of a test case, test procedure, and test data. Initially the term was derived from the product of work created by automated regression test tools. Today, test script
Test script

A test script in software testing is a set of instructions that will be performed on the system under test to test that the system functions as expected....
s can be manual, automated, or a combination of both.

Test data : The most common test manually or in automation is retesting and regression testing
Regression testing

Regression testing is any type of software testing which seeks to uncover Software regression. Such regressions occur whenever software functionality that was previously working correctly stops working as intended....
, In most cases, multiple sets of values or data are used to test the same functionality of a particular feature. All the test values and changeable environmental components are collected in separate files and stored as test data. It is also useful to provide this data to the client and with the product or a project.

Test suite : The most common term for a collection of test cases is a test suite
Test suite

In software development, a test suite, less commonly known as a validation suite, is a collection of test cases that are intended to be used to test a software program to show that it has some specified set of behaviours....
. The test suite often also contains more detailed instructions or goals for each collection of test cases. It definitely contains a section where the tester identifies the system configuration used during testing. A group of test cases may also contain prerequisite states or steps, and descriptions of the following tests.

Test plan : A test specification is called a test plan
Test plan

A test plan is a systematic approach to testing a system such as a machine or software. The plan typically contains a detailed understanding of what the eventual workflow will be....
. The developers are well aware what test plans will be executed and this information is made available to the developers. This makes the developers more cautious when developing their code. This ensures that the developers code is not passed through any surprise test case or test plans.

Test harness : The software, tools, samples of data input and output, and configurations are all referred to collectively as a test harness
Test harness

In software testing, a test harness or automated test framework is a collection of software and test data configured to test a program unit by running it under varying conditions and monitoring its behavior and outputs....
.

A sample testing cycle

Although variations exist between organizations, there is a typical cycle for testing:
  • Requirements analysis
    Requirements analysis

    Requirements analysis in systems engineering and software engineering, encompasses those tasks that go into determining the needs or conditions to meet for a new or altered product, taking account of the possibly conflicting requirements of the various Stakeholder , such as beneficiaries or users....
    : Testing should begin in the requirements phase of the software development life cycle. During the design phase, testers work with developers in determining what aspects of a design are testable and with what parameters those tests work.
  • Test planning: Test strategy, test plan
    Test plan

    A test plan is a systematic approach to testing a system such as a machine or software. The plan typically contains a detailed understanding of what the eventual workflow will be....
    , testbed
    Testbed

    A testbed is a platform for experimentation for large New product development projects. Testbeds allow for rigorous, transparent and replicable testing of scientific theories, computational tools, and other new technologies....
     creation. A lot of activities will be carried out during testing, so that a plan is needed.
  • Test development: Test procedures, test scenarios
    Scenario test

    Scenario testing is a software testing activity that uses scenario tests, or simply scenarios, which are based on a hypothetical story to help a person think through a complex problem or system for a testing environment or they could be a description written in prose....
    , test case
    Test case

    A test case in software engineering is a set of conditions or variables under which a tester will determine whether an software application or software system meets specifications....
    s, test datasets, test scripts to use in testing software.
  • Test execution: Testers execute the software based on the plans and tests and report any errors found to the development team.
  • Test reporting: Once testing is completed, testers generate metrics and make final reports on their test effort
    Test effort

    In software development, test effort refers to the expenses for tests. There is a relation with test costs and failure costs . Some factors which influence test effort are: maturity of the Software Development Process, quality and testability of the testobject, test infrastructure, skills of staff members, quality goals and test strategy....
     and whether or not the software tested is ready for release.
  • Test result analysis: Or Defect Analysis, is done by the development team usually along with the client, in order to decide what defects should be treated, fixed, rejected (i.e. found software working properly) or deferred to be dealt with at a later time.
  • Retesting the resolved defects. Once a defect has been dealt with by the development team, it is retested by the testing team.
  • Regression testing: It is common to have a small test program built of a subset of tests, for each integration of new, modified or fixed software, in order to ensure that the latest delivery has not ruined anything, and that the software product as a whole is still working correctly.
  • Test Closure:Once the test meets the exit criteria, the activities such as capturing the key outputs, lessons learned, results, logs, documents related to the project are archived and used as a reference for future projects.


Certification

Several certification programs exist to support the professional aspirations of software testers and quality assurance specialists. No certification currently offered actually requires the applicant to demonstrate the ability to test software. No certification is based on a widely accepted body of knowledge. This has led some to declare that the testing field is not ready for certification. Certification itself cannot measure an individual's productivity, their skill, or practical knowledge, and cannot guarantee their competence, or professionalism as a tester.

Software testing certification types
Certifications can be grouped into: exam-based and education-based.
  • Exam-based certifications: For these there is the need to pass an exam, which can also be learned by self-study: e.g. for ISTQB or QAI.
  • Education-based certifications: Education based software testing certifications are instructor-led sessions, where each course has to be passed, e.g. IIST (International Institute for Software Testing).


Testing certifications
  • CATe offered by the International Institute for Software Testing
  • CBTS offered by the Brazilian Certification of Software Testing (ALATS)
  • Certified Software Tester (CSTE) offered by the Quality Assurance Institute (QAI)
  • Certified Software Test Professional (CSTP) offered by the International Institute for Software Testing
  • CSTP (TM) (Australian Version) offered by K. J. Ross & Associates
  • ISEB offered by the Information Systems Examinations Board
    ISEB

    ISEB is a worldwide exam body and a part of the British Computer Society.It began its life as a single qualification, the NCC and BCS collaborated on developing the 'Certificate in Systems Analysis and Design' for the Systems Analysis Examination Board....
  • ISTQB Certified Tester, Foundation Level (CTFL) offered by the International Software Testing Qualification Board
    ISTQB

    The ISTQB was founded in Edinburgh in November 2002. ISTQB A.I.S.B.L is as a legal entity based in Belgium.The ISTQB is responsible for the international qualification scheme called "ISTQB Certified Tester"....
     
  • ISTQB Certified Tester, Advanced Level (CTAL) offered by the International Software Testing Qualification Board
    ISTQB

    The ISTQB was founded in Edinburgh in November 2002. ISTQB A.I.S.B.L is as a legal entity based in Belgium.The ISTQB is responsible for the international qualification scheme called "ISTQB Certified Tester"....
     
  • CBTS offered by the Brazilian Certification of Software Testing (ALATS)
  • TMPF TMap® Next Foundation offered by theExamination Institute for Information Science


Quality assurance certifications
  • CSQE offered by the American Society for Quality
    American Society for Quality

    American Society for Quality , formerly known as American Society for Quality Control , is a knowledge-based global community of quality control experts, with nearly 85,000 members dedicated to the promotion and advancement of quality tools, principles, and practices in their workplaces and in their communities....
     (ASQ)
  • CSQA offered by the Quality Assurance Institute (QAI)
  • CQIA offered by the American Society for Quality
    American Society for Quality

    American Society for Quality , formerly known as American Society for Quality Control , is a knowledge-based global community of quality control experts, with nearly 85,000 members dedicated to the promotion and advancement of quality tools, principles, and practices in their workplaces and in their communities....
     (ASQ)
  • CMSQ offered by the Quality Assurance Institute (QAI)

Controversy

Some of the major software testing controversies
Software testing controversies

There is considerable variety among software testing writers and consultants about what constitutes responsible software testing. Members of the "context-driven" school of testing believe that there are no "best practices" of testing, but rather that testing is a set of skills that allow the tester to select or invent testing practices...
 include:

What constitutes responsible software testing : Members of the "context-driven" school of testing believe that there are no "best practices" of testing, but rather that testing is a set of skills that allow the tester to select or invent testing practices to suit each unique situation.

Agile vs. traditional : Should testers learn to work under conditions of uncertainty and constant change or should they aim at process "maturity"
Capability Maturity Model

The Capability Maturity Model in software engineering is a model of the maturity of the capability of certain business processes. A maturity model can be described as a structured collection of elements that describe certain aspects of maturity in an organization, and aids in the definition and understanding of an organization's processes....
? The agile testing
Agile testing

Agile testing is a software testing practice that follows the statutes of the agile software development, treating software development as the customer of testing....
 movement has received growing popularity since 2006 mainly in commercial circles , whereas government and military software providers are slow to embrace this methodology, and mostly still hold to CMMI.

Exploratory test vs. scripted : Should tests be designed at the same time as they are executed or should they be designed beforehand?

Manual testing vs. automated : Some writers believe that test automation
Test automation

Test automation is the use of software to control the execution of software testings, the comparison of actual outcomes to predicted outcomes, the setting up of test preconditions, and other test control and test reporting functions....
 is so expensive relative to its value that it should be used sparingly. Others, such as advocates of agile development, recommend automating 100% of all tests. More in particular, test-driven development
Test-driven development

Test-driven development is a software development technique that uses short development iterations based on pre-written test cases that define desired improvements or new functions....
 states that developers should write unit-tests of the x-unit type before coding the functionality. The tests then can be considered as a way to capture and implement the requirements.

Software design vs. software implementation : Should testing be carried out only at the end or throughout the whole process?

Who watches the watchmen? : The idea is that any form of observation is also an interaction, that the act of testing can also affect that which is being tested.

See also

  • Dynamic program analysis
    Dynamic program analysis

    Dynamic program analysis is the Program analysis that is performed by executing programs built from that software system on a real or virtual processor....
  • Formal verification
    Formal verification

    In the context of hardware and software systems, formal verification is the act of Mathematical proof or disproving the correctness of intended algorithms underlying a system with respect to a certain formal specification or property, using formal methods of mathematics....
  • Reverse semantic traceability
    Reverse semantic traceability

    Reverse Semantic Traceability is a quality control method for verification improvement that helps to insure high quality of Artifact s by backward translation at each stage of the software development process....
  • Static code analysis
    Static code analysis

    Static code analysis is the Program analysis that is performed without actually executing programs built from that software . In most cases the analysis is performed on some version of the source code and in the other cases some form of the object code....
  • GUI software testing
    GUI software testing

    In computer science, GUI software testing is the process of testing a product that uses a graphical user interface, to ensure it meets its written specifications....
  • Web testing
    Web testing

    Web testing is the name given to software testing that focuses on web applications, and is one of the fastest growing areas of software testing....
  • Pair Testing
    Pair Testing

    Pair Testing is a software development technique in which two team members work together at one keyboard to test the software application. One do the testing and other analyze the testing or review the testing.This can be done between one Tester and Developer or Business Analyst...


External links