Title | 12 Goal directed Behaviour |
---|---|
Course | Psychobiology |
Institution | University of Sussex |
Pages | 6 |
File Size | 305.3 KB |
File Type | |
Total Downloads | 583 |
Total Views | 1,025 |
Learning and memory IIGoal directed action Ak Operant (or type -2) conditioning Pavlobian instrumental interactions Violation of basic assumptions of associative learning Functions of Learning and Memory: Allow for adaptive behaviour within individual lifetime. Non-Associative Learning. Learning tha...
Learning and memory II Goal directed action Ak -
Operant (or type -2) conditioning
-
Pavlobian instrumental interactions
-
Violation of basic assumptions of associative learning
Functions of Learning and Memory: -
Allow for adaptive behaviour within individual lifetime.
-
Non-Associative Learning.
-
Associative Learning.
-
Learning that stimuli exist in the world.
-
Learning associations between stimuli / events.
-
Learning association between actions and stimuli / events.
Thorndike’s Puzzle Box: Edward Thorndike (1874 - 1949)
-
Lever to open the door.
-
Used with cats.
-
Cats would move around until they accidentally hit the lever and the door opened.
-
The cats learned how to get out of the box.
-
As the number of trials increased, the time taken for the cat to get out decreased.
-
Operant Conditioning.
-
Law of Effect
Thorndike’s Laws of Learning: -
Behaviour that leads to a positive outcome is more likely to occur in the future.
-
Law of Exercise
-
Law of Readiness
-
Connections between responses and outcomes are strengthened by repetition.
-
Learning is motivated by an internal state.
-
Ex of internal state = hunger. -
If ur hungry and level gives food, u learn about the level faaster
Instrumental / Operant Conditioning: Frederick skinner 1904 - 1990 Skinner/ operant chamber; -
Stimulus light
-
Food hopper
-
Level that gives them food.
-
Learning adaptive behaviour. -
-
-
Through experience of success, failure. (Trial and Error)
Organism operates on the environment. -
Behaviour changes the environment.
-
(it explores the environment and learns (diff from pavlovs dogs cause they were given treats w/ bell where they didnt learn about environment)
Behaviour is instrumental or Goal-Directed. -
Obtains desired effect and is goal-directed.
-
Reward or avoid negative consequence -
-
Pavlovian conditions= stimuli associated w stimulus or stimuli and outcomes (S-S or S-O)
Associations between response and outcome (R-O Learning) -
Response outcome = R-O
Skinner’s Operant Behaviourism: Behaviour is shaped and maintained by its consequences
Reinforcer Stimulus / event that increases the likelihood of the preceding behaviour to occur. -
Positive Reinforcer -
Stimulus (usually positive) produced by the behaviour that increases the likelihood of the preceding behaviour to occur.
-
Negative Reinforcer -
Stimulus (usually negative) eliminated by the behaviour that increases the likelihood of the preceding behaviour to occur.
-
Punishment -
-
Negative stimulus / event that decreases the likelihood of the preceding behaviour to occur.
Omission -
Elimination of positive reinforcer decreases the likelihood of preceding behaviour
-
If pigeon is getting a bit of food and pressing the lever stops getting the food, it will stop pressing the level
Schedules of Reinforcement: A schedule of how they will reinforce a behavior (peck at lever = food every other time vs every time) Continuous Reinforcement -
Each behavioural response is reinforced.
Partial Reinforcement -
Behaviour is reinforced only part of the time.
Ratio Schedules -
Reinforcement given after every nth response. -
Fixed = response requirement always constant.
-
Variable = response requirement varies around average.
Interval Schedules -
Reinforcement given after a certain amount of time.
(responds at least once in __ seconds = reinforcements) -
Fixed = reward intervals constant. (the animal has to press it 10 times to get reward)
-
Variable = reward interval varies around mean time. (on average it has to respond a certain number of times (sometimes 5, sometimes 7 sometimes 3 but average of 5)
-
Persistence reinforcement -
Once in a while = people keep trying even if they’re losing (ex: gambling)
Operant Conditioning: Learning association between behaviour and consequences. -
Learning based on positive or negative consequences.
-
Behaviours shaped by schedules of reinforcement.
-
Ubiquitous -
See it in many species
-
Thorndike and Skinner: all learning is instrumental / operant?
-
Behaviourist Revolution -
Anti-mentalism
-
You can understand most behaviors by understanding the actions and the behaviors based on the outcome (which have been reinforced)
Behavior = reinforced. Freedom and dignity stand in the way of understanding work.
Token-economies or Contingency Management: 2006 stizer M perry Annu rev. - Widely used in substance abuse treatment to reduce relapse. - People are being positively reinforced for not relapsing
Associative Learning: -
Respondent (Pavlovian) Conditioning: -
Learning associations between stimuli in the environment.
-
Reinforcement not contingent on response.
-
Behaviour is elicited (respondent)
-
Involuntary behaviours (reflexes?)
-
-
Operant (Skinnerian) Conditioning: -
Learning associations between actions and stimuli/outcomes.
-
Reinforcement contingent on response.
Behaviour is emitted (operant) -
Voluntary (spontaneous)
Dual-Process Approach: both pavlovian and skinnerian Example: Avoidance Learning:
-> Shuttle box
-
Rat placed in a chamber with 2 compartments w/ a door.
-
One chamber has obnoxious stimulus => mild food shocks
-
Tone that precedes the eclectic shock.
-
The animal will run to the other side => negative reinforcements bc avoids that part The minute the tone is played, the animal runs to the other side.
Barrier for escape or avoidance. -
Escape following US.
-
Avoidance following CS -
Classical Conditioning -
-
Tone leads to shock.
Operant Conditioning -
Escape / Avoidance leads to safety.
Principles of Associative Learning: -
Learning through reinforcement.
-
Association by contiguity.
-
-
Co-occurrence in space and time.
-
The reinforcement should be in the same area and around the same time
Arbitrariness. -
-
Any stimulus, any response.
Empty Organism. -
Organism is black box – collection of associations.
-
You are the sum of our experiences
Passive Organism. -
Learning happens TO the organism. (pavlovian)
Summary table Respondent (pavlovian) conditioning -
Learning associations between stimuli in the environment (S-S learning)
-
Reinforcement not possibly likely to happen based on response -
Operant (skinnerian) conditioning -
Not dependent on the animal doing a behavior
Learning associations between actions and stimuli/outcomes (A-O learning) Reinforcement possibly likely to happen based on response -
Dependent on the animal doing a behavior
-
Behavior being elicited (respondent behavior)
-
Behavior is emitted (operant behavior)
-
Involuntary behaviors (reflexes)
-
Voluntary (spontaneous)...