Home › Concepts › Operant Conditioning

Operant Conditioning

B.F. Skinner's account of how the consequences of our actions, reward and punishment, shape which behaviors we repeat and which we drop.

Reviewed by Michael Callans, MSW · 9 min read

Published July 16, 2026 · Last reviewed July 16, 2026

Illustration representing operant conditioning and reward-based learning

In short

Operant conditioning is a type of learning, developed by B.F. Skinner, in which behavior is shaped by its consequences. Behaviors followed by reinforcement become more likely to happen again, while behaviors followed by punishment become less likely. It distinguishes positive and negative reinforcement and positive and negative punishment, and it explains much of how habits, skills, and behavior are learned and changed.

What operant conditioning is

Operant conditioning is a form of learning in which the consequences of a behavior change how likely that behavior is to occur in the future. Unlike classical conditioning, which deals with automatic, reflexive responses, operant conditioning concerns voluntary behavior: the things an organism chooses to do.

The core principle is simple and powerful. Behavior that leads to favorable outcomes tends to be repeated, and behavior that leads to unfavorable outcomes tends to drop away. This is sometimes called the law of effect, an idea first proposed by Edward Thorndike before it was developed into a full system.

The American psychologist B.F. Skinner is the figure most associated with operant conditioning. Building on Thorndike's work, Skinner studied behavior with carefully controlled experiments and gave the field its central vocabulary of reinforcement and punishment.

Skinner and the operant chamber

Skinner devised a controlled apparatus, often called a Skinner box or operant chamber, to study learning precisely. A typical chamber contained a lever or key an animal could press, a mechanism to deliver food, and sometimes a floor that could deliver a mild electric shock.

Inside, an animal such as a rat or pigeon would explore and, by chance, perform a target action like pressing a lever. If the press delivered a food pellet, the animal pressed more often. If pressing the lever switched off an unpleasant shock, the animal again learned to press. By varying these consequences, Skinner could shape behavior reliably and measure it over time.

Skinner also described shaping, a technique of reinforcing successive approximations toward a goal behavior. By rewarding closer and closer attempts, an experimenter could train complex behaviors that the animal would never have produced on its own at first.

Reinforcement and punishment

Two concepts sit at the heart of the theory. Reinforcement makes a behavior more likely to recur. Punishment makes a behavior less likely to recur. Each can be positive, meaning something is added, or negative, meaning something is removed. The words positive and negative here mean added and subtracted, not good and bad.

Positive reinforcement adds something pleasant to encourage a behavior, such as praise or a treat for finishing homework. Negative reinforcement removes something unpleasant to encourage a behavior, such as the annoying seatbelt alarm stopping once you buckle up.

Positive punishment adds something unpleasant to discourage a behavior, such as a fine for speeding. Negative punishment removes something pleasant to discourage a behavior, such as losing screen time for breaking a rule. A common mistake is to treat negative reinforcement as a form of punishment. It is not; reinforcement always strengthens behavior.

Schedules of reinforcement

One of Skinner's most important contributions was showing that the timing and pattern of reinforcement, not just its presence, strongly affect learning. These patterns are called schedules of reinforcement.

Continuous reinforcement, rewarding every single instance of a behavior, produces fast learning but also fast extinction once the rewards stop. Partial or intermittent schedules, rewarding only some instances, produce slower learning but much more durable behavior.

Partial schedules come in several forms based on whether reinforcement depends on the number of responses (ratio) or the passage of time (interval), and whether it is fixed or variable. Variable-ratio schedules, where reinforcement comes after an unpredictable number of responses, produce the highest and steadiest rates of behavior. This is the principle behind gambling and slot machines, which is part of why such behaviors can be so persistent.

Operant conditioning in everyday life

Operant principles appear throughout daily life. Parents shape children's behavior with praise and consequences. Workplaces use bonuses, recognition, and performance reviews. Apps and games are engineered around variable rewards to keep users engaged. Habit formation can be understood as behavior that is repeatedly reinforced until it becomes automatic.

The same principles inform clinical and educational practice. Behavior therapy and applied behavior analysis use structured reinforcement to build skills and reduce harmful behaviors. Token economies, where desirable behavior earns tokens exchangeable for rewards, have been used in classrooms and treatment settings.

Research and ethics have refined how these tools are applied. Reinforcement is generally found to be more effective and less harmful than punishment for building lasting behavior, since punishment can suppress behavior without teaching what to do instead and may produce fear or resentment. Modern practice leans heavily on reinforcing desired behavior rather than punishing unwanted behavior.

Feature	Reinforcement (strengthens behavior)	Punishment (weakens behavior)
Positive (something added)	Add a reward, e.g. praise for chores	Add something unpleasant, e.g. a speeding fine
Negative (something removed)	Remove something unpleasant, e.g. alarm stops once buckled	Remove something pleasant, e.g. losing screen time
Effect on behavior	More likely to recur	Less likely to recur
Generally recommended?	Yes, builds lasting behavior	Less so, can suppress without teaching

Key takeaways

Operant conditioning, from B.F. Skinner, holds that behavior is shaped by its consequences.
Reinforcement makes a behavior more likely; punishment makes it less likely.
Positive means something is added and negative means something is removed; they do not mean good and bad.
Schedules of reinforcement affect how durable learning is, with variable-ratio schedules producing the most persistent behavior.
Reinforcement is generally more effective than punishment for building lasting behavior change.

Infographic explaining operant conditioning with reinforcement and punishment — Behavior is shaped by its consequences.

Trying to change a habit that won't budge?

A therapist can help you apply behavior-change principles to your own life. Search our directory for free.

Find a therapist

Frequently asked questions

What is operant conditioning in simple terms?

It is learning from consequences. Behaviors that are rewarded tend to be repeated, and behaviors that are punished tend to fade. B.F. Skinner developed the idea and gave it the vocabulary of reinforcement and punishment.

What is the difference between negative reinforcement and punishment?

Negative reinforcement removes something unpleasant to strengthen a behavior, so the behavior happens more. Punishment weakens a behavior so it happens less. The key is that reinforcement always increases behavior, while punishment decreases it.

What are the four types of operant conditioning?

Positive reinforcement (add a reward), negative reinforcement (remove something unpleasant), positive punishment (add something unpleasant), and negative punishment (remove something pleasant). Positive means added and negative means removed.

What is a schedule of reinforcement?

It is the pattern by which a behavior is rewarded. Continuous reinforcement rewards every instance, while partial schedules reward only some. Variable-ratio schedules, which reward after an unpredictable number of responses, produce the most persistent behavior.

How is operant conditioning different from classical conditioning?

Classical conditioning, studied by Pavlov, links a signal to an automatic, involuntary response. Operant conditioning shapes voluntary behavior through its consequences, using reinforcement and punishment.

Related concepts

Reviewed by Michael Callans, MSW
Reviewed for fidelity to Skinner's work and standard reinforcement terminology.

References

Skinner BF. The Behavior of Organisms: An Experimental Analysis. Appleton-Century; 1938.
Skinner BF. Science and Human Behavior. Macmillan; 1953.
Thorndike EL. Animal Intelligence: Experimental Studies. Macmillan; 1911.
Ferster CB, Skinner BF. Schedules of Reinforcement. Appleton-Century-Crofts; 1957.
Domjan M. The Principles of Learning and Behavior. 7th ed. Cengage Learning; 2015.

Important: This article is educational information, not a substitute for professional care or a diagnosis. If you are struggling, reach out to a licensed mental-health professional. In an emergency, call your local emergency number or, in the US, call or text 988.