Behaviorally anchored rating scales - AbsoluteAstronomy.com

Behaviorally anchored rating scales (BARS) are scales

Scale (social sciences)

In the social sciences, scaling is the process of measuring or ordering entities with respect to quantitative attributes or traits. For example, a scaling technique might involve estimating individuals' levels of extraversion, or the perceived quality of products...

used to rate performance

Job performance

Job performance is a commonly used, yet poorly defined concept in industrial and organizational psychology, the branch of psychology that deals with the workplace. It's also part of Human Resources Management. It most commonly refers to whether a person performs their job well...

. BARS are normally presented vertically with scale points ranging from five to nine. It is an appraisal method that aims to combine the benefits of narratives, critical incidents, and quantified ratings by anchoring a quantified scale with specific narrative examples of good, moderate, and poor performance.

Background

BARS were developed in response to dissatisfaction with the subjectivity involved in using traditional rating scales such as the graphic rating scale

Scale (social sciences)

. A review of BARS concluded that the strength of this rating format may lie primarily in the performance dimensions which are gathered rather than the distinction between behavioral and numerical scale anchors.

Benefits of BARS

BARS are rating scales that add behavioral scale anchors to traditional rating scales (e.g., graphic rating scales). In comparison to other rating scales, BARS are intended to facilitate more accurate ratings of the target person's behavior or performance. However, whereas the BARS is often regarded as a superior performance appraisal method, BARS may still suffer from unreliability

Reliability (statistics)

In statistics, reliability is the consistency of a set of measurements or of a measuring instrument, often used to describe a test. Reliability is inversely related to random error.-Types:There are several general classes of reliability estimates:...

, leniency bias and lack of discriminant validity

Discriminant validity

In psychology, discriminant validity tests whether concepts or measurements that are supposed to be unrelated are, in fact, unrelated.Campbell and Fiske introduced the concept of discriminant validity within their discussion on evaluating test validity. They stressed the importance of using both...

between performance dimensions.

Developing BARS

BARS can be developed using data collected through the critical incident technique

Critical Incident Technique

The Critical Incident Technique is a set of procedures used for collecting direct observations of human behavior that have critical significance and meet methodically defined criteria. These observations are then kept track of as incidents, which are then used to solve practical problems and...

, or through the use of comprehensive data about the tasks performed by a job incumbent, such as might be collected through a task analysis. In order to construct BARS, several basic steps, outlined below, are followed.

Examples of effective and ineffective behavior related to job are collected from people with knowledge of job using the critical incident technique. Alternatively, data may be collected through the careful examination of data from a recent task analysis.
These data are then converted into performance dimensions. To convert these data into performance dimensions, examples of behavior (such as critical incidents) are sorted into homogeneous groups using the Q-sort technique. Definitions for each group of behaviors are then written to define each grouping of behaviors as a performance dimension
A group of subject matter experts (SMEs) are asked to re-translate the behavioral examples back into their respective performance dimensions. At this stage the behaviors for which there is not a high level of agreement (often 50–75%) are discarded while the behaviors which were re-translated back into their resepctive performance dimensions with a high level of SME agreement are retained. The re-translation process helps to ensure that behaviors are readily identifiable with their respective performance dimensions.
The retained behaviors are then scaled by having SMEs rate the effectiveness of each behavior. These ratings are usually done on a 5- to 9-point Likert-type scale.
Behaviors with a low standard deviation
Standard deviation
Standard deviation is a widely used measure of variability or diversity used in statistics and probability theory. It shows how much variation or "dispersion" there is from the average...

(for examples, less than 1.50) are retained while behaviors with a higher standard deviation are discarded. This step helps to ensure SME agreement about the rating of each behavior.
Finally, behaviors for each performance dimensions, all meeting re-translation and criteria, will be used as scale anchors.

The source of this article is wikipedia, the free encyclopedia. The text of this article is licensed under the GFDL.