12 Goal directed Behaviour PDF

Title	12 Goal directed Behaviour
Course	Psychobiology
Institution	University of Sussex
Pages	6
File Size	305.3 KB
File Type	PDF
Total Downloads	583
Total Views	1,025

Preview

CLICK TO PREVIEW PDF

Summary

Learning and memory IIGoal directed action Ak Operant (or type -2) conditioning Pavlobian instrumental interactions Violation of basic assumptions of associative learning Functions of Learning and Memory: Allow for adaptive behaviour within individual lifetime. Non-Associative Learning. Learning tha...

Description

Learning and memory II Goal directed action Ak -

Operant (or type -2) conditioning

-

Pavlobian instrumental interactions

-

Violation of basic assumptions of associative learning

Functions of Learning and Memory: -

Allow for adaptive behaviour within individual lifetime.

-

Non-Associative Learning.

-

Associative Learning.

-

Learning that stimuli exist in the world.

-

Learning associations between stimuli / events.

-

Learning association between actions and stimuli / events.

Thorndike’s Puzzle Box: Edward Thorndike (1874 - 1949)

-

Lever to open the door.

-

Used with cats.

-

Cats would move around until they accidentally hit the lever and the door opened.

-

The cats learned how to get out of the box.

-

As the number of trials increased, the time taken for the cat to get out decreased.

-

Operant Conditioning.

-

Law of Effect

Thorndike’s Laws of Learning: -

Behaviour that leads to a positive outcome is more likely to occur in the future.

-

Law of Exercise

-

Law of Readiness

-

Connections between responses and outcomes are strengthened by repetition.

-

Learning is motivated by an internal state.

-

Ex of internal state = hunger. -

If ur hungry and level gives food, u learn about the level faaster

Instrumental / Operant Conditioning: Frederick skinner 1904 - 1990 Skinner/ operant chamber; -

Stimulus light

-

Food hopper

-

Level that gives them food.

-

Learning adaptive behaviour. -

-

-

Through experience of success, failure. (Trial and Error)

Organism operates on the environment. -

Behaviour changes the environment.

-

(it explores the environment and learns (diff from pavlovs dogs cause they were given treats w/ bell where they didnt learn about environment)

Behaviour is instrumental or Goal-Directed. -

Obtains desired effect and is goal-directed.

-

Reward or avoid negative consequence -

-

Pavlovian conditions= stimuli associated w stimulus or stimuli and outcomes (S-S or S-O)

Associations between response and outcome (R-O Learning) -

Response outcome = R-O

Skinner’s Operant Behaviourism: Behaviour is shaped and maintained by its consequences

Reinforcer Stimulus / event that increases the likelihood of the preceding behaviour to occur. -

Positive Reinforcer -

Stimulus (usually positive) produced by the behaviour that increases the likelihood of the preceding behaviour to occur.

-

Negative Reinforcer -

Stimulus (usually negative) eliminated by the behaviour that increases the likelihood of the preceding behaviour to occur.

-

Punishment -

-

Negative stimulus / event that decreases the likelihood of the preceding behaviour to occur.

Omission -

Elimination of positive reinforcer decreases the likelihood of preceding behaviour

-

If pigeon is getting a bit of food and pressing the lever stops getting the food, it will stop pressing the level

Schedules of Reinforcement: A schedule of how they will reinforce a behavior (peck at lever = food every other time vs every time) Continuous Reinforcement -

Each behavioural response is reinforced.

Partial Reinforcement -

Behaviour is reinforced only part of the time.

Ratio Schedules -

Reinforcement given after every nth response. -

Fixed = response requirement always constant.

-

Variable = response requirement varies around average.

Interval Schedules -

Reinforcement given after a certain amount of time.

(responds at least once in __ seconds = reinforcements) -

Fixed = reward intervals constant. (the animal has to press it 10 times to get reward)

-

Variable = reward interval varies around mean time. (on average it has to respond a certain number of times (sometimes 5, sometimes 7 sometimes 3 but average of 5)

-

Persistence reinforcement -

Once in a while = people keep trying even if they’re losing (ex: gambling)

Operant Conditioning: Learning association between behaviour and consequences. -

Learning based on positive or negative consequences.

-

Behaviours shaped by schedules of reinforcement.

-

Ubiquitous -

See it in many species

-

Thorndike and Skinner: all learning is instrumental / operant?

-

Behaviourist Revolution -

Anti-mentalism

-

You can understand most behaviors by understanding the actions and the behaviors based on the outcome (which have been reinforced)

Behavior = reinforced. Freedom and dignity stand in the way of understanding work.

Token-economies or Contingency Management: 2006 stizer M perry Annu rev. - Widely used in substance abuse treatment to reduce relapse. - People are being positively reinforced for not relapsing

Associative Learning: -

Respondent (Pavlovian) Conditioning: -

Learning associations between stimuli in the environment.

-

Reinforcement not contingent on response.

-

Behaviour is elicited (respondent)

-

Involuntary behaviours (reflexes?)

-

-

Operant (Skinnerian) Conditioning: -

Learning associations between actions and stimuli/outcomes.

-

Reinforcement contingent on response.

Behaviour is emitted (operant) -

Voluntary (spontaneous)

Dual-Process Approach: both pavlovian and skinnerian Example: Avoidance Learning:

-> Shuttle box

-

Rat placed in a chamber with 2 compartments w/ a door.

-

One chamber has obnoxious stimulus => mild food shocks

-

Tone that precedes the eclectic shock.

-

The animal will run to the other side => negative reinforcements bc avoids that part The minute the tone is played, the animal runs to the other side.

Barrier for escape or avoidance. -

Escape following US.

-

Avoidance following CS -

Classical Conditioning -

-

Tone leads to shock.

Operant Conditioning -

Escape / Avoidance leads to safety.

Principles of Associative Learning: -

Learning through reinforcement.

-

Association by contiguity.

-

-

Co-occurrence in space and time.

-

The reinforcement should be in the same area and around the same time

Arbitrariness. -

-

Any stimulus, any response.

Empty Organism. -

Organism is black box – collection of associations.

-

You are the sum of our experiences

Passive Organism. -

Learning happens TO the organism. (pavlovian)

Summary table Respondent (pavlovian) conditioning -

Learning associations between stimuli in the environment (S-S learning)

-

Reinforcement not possibly likely to happen based on response -

Operant (skinnerian) conditioning -

Not dependent on the animal doing a behavior

Learning associations between actions and stimuli/outcomes (A-O learning) Reinforcement possibly likely to happen based on response -

Dependent on the animal doing a behavior

-

Behavior being elicited (respondent behavior)

-

Behavior is emitted (operant behavior)

-

Involuntary behaviors (reflexes)

-

Voluntary (spontaneous)...