In the late nineteenth century, psychologist Edward Thorndike proposed the law of effect. The law of effect states that any behavior that has good consequences will tend to be repeated, and any behavior that has bad consequences will tend to be avoided. In the 1930s, another psychologist, B. F. Skinner, extended this idea and began to study operant conditioning. Operant conditioning is a type of learning in which responses come to be controlled by their consequences. Operant responses are often new responses. Just as Pavlov’s fame stems from his experiments with salivating dogs, Skinner’s fame stems from his experiments with animal boxes. Skinner used a device called the Skinner box to study operant conditioning. A Skinner box is a cage set up so that an animal can automatically get a food reward if it makes a particular kind of response. The box also contains an instrument that records the number of responses an animal makes. Psychologists use several key terms to discuss operant conditioning principles, including reinforcement and punishment. ReinforcementReinforcement is delivery of a consequence that increases the likelihood that a response will occur. Positive reinforcement is the presentation of a stimulus after a response so that the response will occur more often. Negative reinforcement is the removal of a stimulus after a response so that the response will occur more often. In this terminology, positive and negative don’t mean good and bad. Instead, positive means adding a stimulus, and negative means removing a stimulus. PunishmentPunishment is the delivery of a consequence that decreases the likelihood that a response will occur. Positive and negative punishments are analogous to positive and negative reinforcement. Positive punishment is the presentation of a stimulus after a response so that the response will occur less often. Negative punishment is the removal of a stimulus after a response so that the response will occur less often. Reinforcement helps to increase a behavior, while punishment helps to decrease a behavior.
Did you know you can highlight text to take a note? x
Operant conditioning, sometimes referred to as instrumental conditioning, is a method of learning that employs rewards and punishments for behavior. Through operant conditioning, an association is made between a behavior and a consequence (whether negative or positive) for that behavior. For example, when lab rats press a lever when a green light is on, they receive a food pellet as a reward. When they press the lever when a red light is on, they receive a mild electric shock. As a result, they learn to press the lever when the green light is on and avoid the red light. But operant conditioning is not just something that takes place in experimental settings while training lab animals. It also plays a powerful role in everyday learning. Reinforcement and punishment take place in natural settings all the time, as well as in more structured settings such as classrooms or therapy sessions. Operant conditioning was first described by behaviorist B.F. Skinner, which is why you may occasionally hear it referred to as Skinnerian conditioning. As a behaviorist, Skinner believed that it was not really necessary to look at internal thoughts and motivations in order to explain behavior. Instead, he suggested, we should look only at the external, observable causes of human behavior. Through the first part of the 20th century, behaviorism became a major force within psychology. The ideas of John B. Watson dominated this school of thought early on. Watson focused on the principles of classical conditioning, once famously suggesting that he could take any person regardless of their background and train them to be anything he chose. Early behaviorists focused their interests on associative learning. Skinner was more interested in how the consequences of people's actions influenced their behavior.
Skinner used the term operant to refer to any "active behavior that operates upon the environment to generate consequences." Skinner's theory explained how we acquire the range of learned behaviors we exhibit every day. His theory was heavily influenced by the work of psychologist Edward Thorndike, who had proposed what he called the law of effect. According to this principle, actions that are followed by desirable outcomes are more likely to be repeated while those followed by undesirable outcomes are less likely to be repeated. Operant conditioning relies on a fairly simple premise: Actions that are followed by reinforcement will be strengthened and more likely to occur again in the future. If you tell a funny story in class and everybody laughs, you will probably be more likely to tell that story again in the future. If you raise your hand to ask a question and your teacher praises your polite behavior, you will be more likely to raise your hand the next time you have a question or comment. Because the behavior was followed by reinforcement, or a desirable outcome, the preceding action is strengthened. Conversely, actions that result in punishment or undesirable consequences will be weakened and less likely to occur again in the future. If you tell the same story again in another class but nobody laughs this time, you will be less likely to repeat the story again in the future. If you shout out an answer in class and your teacher scolds you, then you might be less likely to interrupt the class again. Skinner distinguished between two different types of behaviors
While classical conditioning could account for respondent behaviors, Skinner realized that it could not account for a great deal of learning. Instead, Skinner suggested that operant conditioning held far greater importance. Skinner invented different devices during his boyhood and he put these skills to work during his studies on operant conditioning. He created a device known as an operant conditioning chamber, often referred to today as a Skinner box. The chamber could hold a small animal, such as a rat or pigeon. The box also contained a bar or key that the animal could press in order to receive a reward. In order to track responses, Skinner also developed a device known as a cumulative recorder. The device recorded responses as an upward movement of a line so that response rates could be read by looking at the slope of the line. There are several key concepts in operant conditioning. Reinforcement is any event that strengthens or increases the behavior it follows. There are two kinds of reinforcers. In both of these cases of reinforcement, the behavior increases.
Punishment is the presentation of an adverse event or outcome that causes a decrease in the behavior it follows. There are two kinds of punishment. In both of these cases, the behavior decreases.
Reinforcement is not necessarily a straightforward process, and there are a number of factors that can influence how quickly and how well new things are learned. Skinner found that when and how often behaviors were reinforced played a role in the speed and strength of acquisition. In other words, the timing and frequency of reinforcement influenced how new behaviors were learned and how old behaviors were modified. Skinner identified several different schedules of reinforcement that impact the operant conditioning process:
We can find examples of operant conditioning at work all around us. Consider the case of children completing homework to earn a reward from a parent or teacher, or employees finishing projects to receive praise or promotions. More examples of operant conditioning in action include:
In some of these examples, the promise or possibility of rewards causes an increase in behavior. Operant conditioning can also be used to decrease a behavior via the removal of a desirable outcome or the application of a negative outcome. For example, a child may be told they will lose recess privileges if they talk out of turn in class. This potential for punishment may lead to a decrease in disruptive behaviors. While behaviorism may have lost much of the dominance it held during the early part of the 20th century, operant conditioning remains an important and often used tool in the learning and behavior modification process. Sometimes natural consequences lead to changes in our behavior. In other instances, rewards and punishments may be consciously doled out in order to create a change. Operant conditioning is something you may immediately recognize in your own life, whether it is in your approach to teaching your children good behavior or in training the family dog. Remember that any type of learning takes time. Consider the type of reinforcement or punishment that may work best for your unique situation and assess which type of reinforcement schedule might lead to the best results. |