THE SUBJECT'S HYPOTHESIS: ITS DETERMINANTS Al© ITS EFFECT ON RESEARCH DATA by RALPH KIRK SAFFORD B.A., Beloit College, l$6k A THESIS SUBMITTED IN THE REQUIREMENTS MASTER PARTIAL FUIFILMEWT OF FOR THE DEGREE OF OF ARTS i n the Department of Psychology We accept this thesis as conforming to the required standard THE UIVTIVERSITT OF BRITISH COLUMBIA January, 1971 In present ing t h i s thes i s in pa r t i a l f u l f i lmen t o f the requirements fo r an advanced degree at the Un ivers i ty of B r i t i s h Columbia, I agree that the L ib ra ry sha l l make it f r ee l y ava i l ab le for reference and study. I f u r ther agree that permission for extens ive copying of th i s thes i s fo r s cho la r l y purposes may be granted by the Head of my Department or by h i s representat ives . It i s understood that copying or pub l i c a t i on o f th i s thes i s f o r f i nanc i a l gain sha l l not be allowed without my wr i t ten permiss ion. Department of The Un ivers i ty of B r i t i s h Columbia Vancouver 8, Canada i i ABSTRACT Subjects' hypotheses about the purposes of experiments, regardless of the accuracy of such hypotheses, may contaminate research data. Experiment I was designed to assess how readily subjects generate hypotheses about experiments and what effect such hypotheses have on performance. In the context of a personality impressions experiment, forty subjects participated either in one of two bogus hypothesis conditions, in which they were given unauthorized information about the purpose of the experiment by an accomplice posing as a subject, or in a no bogus hypothesis condition, in which they received no unauthorized information but were interviewed at the experiment's termination for self-generated hypotheses. Bogus hypotheses were not found to have affected subjects' task performance and only two of the fourteen subjects interviewed reported an attempt to generate an hypothesis. This latter result was interpreted as contradicting the notion that subjects are strongly motivated to figure out the purpose of an experiment. It was hypothesized that subjects are indifferent towards the purposes of research generally, and that certain types of experimental stimuli must be present in an experiment i n order to arouse subjects to speculate about research purposes. Experiment II, designed to test this hypothesis, investigated the speculation arousal function of two such types of stimuli -experimental rationales and sensitization tasks. Thirty-six subjects participated in one of four conditions provided by i i i orthogonal manipulation of the two treatment variables and had t h e i r l e v e l of suspicion, apprehension, and speculation about the experiment assessed by a b r i e f post-experimental questionnaire. Neither factor was shotm to have an arousal function as no s i g n i f i c a n t differences uere obtained. Other factors possibly involved i n arousing subject speculation were discussed. i v TABLE OF CONTENTS Page Chapter I. Introduction - 1 Chapter I I . Experiment I: 7 - Jfethod 8 - Results 15 - Discussion • • ^2 Chapter I I I . Experiment I I : 29 - Method 31 - Results • --3h ~ Discussion - 39 Footnotes h3 References 1*5" Appendices Ii7 V LIST OF TABLES Page Table I. Means and standard deviations for hypothesis conditions 16 Table II. Summary of analysis of variance: Experiment I 17 Table III. leans and standard deviations for suspicion, apprehension, and purpose questions 35 Table 17. Summary of analysis of variance: Experiment II 36 CHAPTER I INTRODUCTION Investigators (Kelman, 1967j Orne, 1 9 6 2 ; Hiecken, 1 9 6 2 j Schultz, 1 9 6 9 ) have argued that subjects who participate in psychological research do not merely passively respond to the experimental stimuli but rather actively attempt to ascertain the true purpose of an experiment. Orne has stated, "The subject's performance in an experiment might almost be conceptualized as problem-solving behavior; that i s , at some level he sees i t as his task to ascertain the true purpose of the experiment and respond in a manner that w i l l support the hypothesis being tested ( T 9 ° 2 , p.7 7 9 ) - " Such attempts on the part of subjects to obtain insight into an experiment pose methodological problems for psychologists because they may be a source of art i f a c t . A subject's "hunch" or hypothesis about the real purpose of an experiment may significantly determine his performance in that experiment irrespective of the experimental treatment provided. Investigation of the subject's hypothesis has in practise meant investigation of only a subset of hypotheses - hypotheses which indicate awareness of the experimental contingencies due to demand characteristics (Page 1 9 6 8 , 19&9, in press; Page and Lumia, I 9 6 8 3 Sherman, 19&7; Silverman, 1 9 6 8 | Silverman and Regula, I 9 6 8 ) . Research on awareness due to demand characteristics i s based upon a paper by Orne ( 1 9 6 2 ) , in which he proposes that demand characteristics are a source of artifact in psychological experimentation. Demand characteristics are defined as "the totality of cues which convey an experimental hypothesis to the 2 subject (1962, p.779)." In demand awareness research, the typical approach is to re-examine some experiment or experimental paradigm the results of which have been "rather uncritically accepted into the psychological body of knowledge (Page 1?68, p.5°)311 i n an attempt to show that the results can be attributed to subjects 1 awareness of the experimental hypothesis. The procedure i s to interview subjects after the experiment and classify them as "aware" or "unaware" based upon their a b i l i t y to verbalize the experimental hypothesis, and on the basis of this classification determine whether or not the experimental effect can be accounted for by aware subjects alone. (This procedure has been followed i n a l l demand awareness studies except those of Sherman, [I967] and Silverman, [1968] in which no measure of awareness was obtained.) In a l l of the demand awareness studies done to date,, investigators have found that experimental effects formerly attributed to some treatment variable can be accounted for by subject awareness of demand characteristics. Subjects have been found to be demand-aware in the following experimental paradigms: figure-ground perception (Page, 1968)1 c l a s s i c a l conditioning of attitudes (Page, 1969); communicator c r e d i b i l i t y (Fage, in press); verbal operant conditioning (Page & Lumia, 1968)3 and the effects of distraction on persuasibility (Silverman & Regula, 1?68). Further, attitude change has been shown to be a function of demand characteristics in two studies i n which demand characteristics were manipulated (Sherman, 1967j Silverman, 1968). These studies show rather strikingly that demand characteristics are present i n a variety of experimental situations and that such cues can affect subjects' experimental performance. 3 Worthy as research on demand awareness has been, i t has unfortunately been the only work done on subjects' hypotheses. The possibility that inaccurate hypotheses (i.e., hypotheses which do not correspond to the experimental hypothesis) may be an important source of artifact has not been investigated. That a l l subjects' hypotheses - regardless of accurac3 r - ought to be investigated, i s indicated by the following considerations. First, there is no a p r i o r i reason for supposing that an inaccurate hypothesis is any less a potential data contaminant than ' an accurate hypothesis. A l l subjects who formulate hypotheses believe with varying degrees of assuredness that their hypotheses are accurate. Since demand awareness research has shown that accurate hypotheses can contaminate data, i t is only plausible to assume that inaccurate hypotheses can do likewise. Secondly, there i s reason t o believe that subjects are less l i k e l y to generate correct hypotheses in experiments than they are to generate incorrect ones. The concentration of research on demand awareness may in part be the result of a belief that subjects can generate accurate hypotheses (i.e.,, become demand aware) in most of the experiments they encounter. It can be argued, however, that subjects are not l i k e l y to be able t o generate accurate hypotheses very often. For demand awareness to be considered the rule in psychological research i t must be true that demand characteristics are present in most experiments and that subjects are perceptive to the extent of being able to distinguish demand characteristics from irrelevant stimuli. There is l i t t l e reason to suppose that demand characteristics are present in most experiments.^ Demand characteristics, ll being cues which convey to the subject the experimenter's intentions, are precisely those cues which experimenters characteristically take care to eliminate from their designs. Experimenters must succeed i n this enterprise at least occasionally. Indeed, i t seems to the author that the most reasonable position to take on this matter is that demand characteristics are present i n some, but not most, studies, their presence being primarily a function of the care taken by the experimenter to eliminate such cues. Demand awareness research to date provides evidence to support the notion that demand characteristics are present in some studies, but i t cannot be construed as evidence that demand characteristics are present in most studies. When demand characteristics are present in a study (and even i f they xrere present in a l l studies), the generation of accurate hypotheses is s t i l l not assured. Demand characteristics have the potential for conveying to the subject the experimenter's hypothesisj other experimental events just as surely have the potential of misleading the subject about the experimenter's hypothesis. Subjects have no way of knowing which are the "right" cues. For these reasons then, the generation of accurate hypotheses appears improbable. It can be argued on the other hand that erroneous hypotheses are more common In experiments. Orne ( 1 9 6 2 ) , Kelman ( 1 9 6 7 ) , Riecken ( 1 9 6 2 ) , and Schultz ( 1 9 6 9 ) a l l for various reasons state that subjects are highly motivated to ascertain the true purposes of research. Desire for awareness does not necessarily, as the demand characteristics advocates would have i t , insure awareness. Mhat i t may insure, however, is that subjects w i l l have hypotheses. That most of these hypotheses are incorrect i s not important] i t is only important that —J subjects think.them to be correct. Thus i t appears plausible that psychological research i s contaminated not so often by subjects who are aware of an experiment's purpose as by subjects who think (mistakenly) that they are aware. Because inaccurate hypotheses are just as likely to contaminate research data as accurate hypotheses and because inaccurate hypotheses may occur more often i n experiments, research in this area should not be restricted to accurate hypotheses alone. A l l the hypotheses of subjects deserve investigation. If the subject's hypothesis is to be considered a data contaminant of some significance i t must be shown, f i r s t , that subjects w i l l readily engage in speculation about an experiment's purpose in a variety of experimental situations (indicating that motivation to speculate is strong and remains so more or less independently of the type of experiment), and second, that such speculation affects subjects' experimental performance. The demand awareness literature contains ample evidence that a "correct" hypothesis can affect performance. It remains to be determined whether or not an incorrect hypothesis can do likewise. By studying the performance consequences of several hypotheses generated, about 2 the same experiment, this determination can perhaps be made.-3 In a preliminary study on the subject's hypothesis, the author attempted to determine the nature and extent of subject speculation in a given experimental setting as well as i t s consequences for subject experimental performance. In the context of a pretest-posttest opinion change experiment, some subjects were surreptitiously given information ("hypotheses") about what type of 6 change was expected and some subjects, given no information, were queried about spontaneously generated hypotheses by means of an extensive post-experimental questionnaire. The resultant opinion change for subjects in the hypothesis manipulation conditions was in the predicted direction but the difference was not s t a t i s t i c a l l y significant. Tiie amount of subject speculation i n the no information condition was quite high - approximately 90% of those subjects had some ideas concerning the purpose of the experiment. Not only was there some variety in the hypotheses generated, but more than a few of them were based upon experimental events that had no demand characteristics function. Some subjects simply attended to the "wrong" cues. It seems then that even in experiments the purpose of which ought to be f a i r l y obvious to subjects, i t i s possible to misread the signs and to generate an inaccurate hypothesis. As a result of these findings the need for further studies was indicated. The two studies reported below were undertaken for this purpose. The f i r s t experiment, designed p a r t i a l l y as a replication of the author's preliminary study, attempted to determine whether or not subjects w i l l try to figure out the purpose of an experiment and whether or not hypotheses affect performance. It was predicted that subjects would readily generate hypotheses and that such hypotheses would affect performance. Hie second experiment was concerned with factors that cause subject speculation. As the second experiment was based upon interpretation of the findings of the f i r s t experiment, discussion of i t s rationale and purposes w i l l be deferred u n t i l later. 7 CHAPTER II EXPERIriENT I The purpose of this experiment was to determine a) to what extent subjects generate hypotheses about the purpose of experiments and b) whether or not such hypotheses affect performance. The context in which these two factors, amount and effects of subject speculation, were tested consisted of an attitude change paradigm described to subjects as an experiment on personality impressions. Subjects were instructed to read a written communication in order to obtain an impression of the author's personality and then asked to f i l l out two forms, one related to their obtained personality impression and. one, presented as a "control" measure, related to their opinions about the communication. One group of subjects (Ho Bogus Hypothesis condition) after completing the experiment were interviexired individually i n order to ascertain how many of them generated hypotheses about the purpose of the study. A second and third group of subjects (Bogus Hypothesis condition) were, during the course of the experiment, given unauthorized information about the purpose of the experiment by an accomplice posing as a subject (these bogus hypotheses were designed to simulate hypotheses that subjects might be expected to generate on their cam). As the accomplice's information had a direct bearing on the form that was used, to measure subject opinion, this form served as the dependent measure in the study. One fault with the preliminary study was that the design was such that subjects could, generate an appropriate hypothesis 8 relatively easily. As ease of guessing an experiment's purpose is considered to be a factor in influencing only the accuracy and. not the quantity of hypotheses, a design in which purposes were more d i f f i c u l t for subjects to guess was used in this study. It was f e l t that by measuring opinions once only and emphasizing the personality impressions rationale, the experiment's "real" purpose would be rendered relatively obscure. It was predicted, f i r s t , that a considerable number of subjects in the No Bogus Hypothesis condition would report hypotheses about the experiment's purpose (although no precise prediction was made here, i t was expected that at least h a l l of the subjects would report hypotheses). It was expected that these hypotheses would be variable in content and based, not upon a few "significant" cues as might be predicted from a demand characteristics viewpoint, but upon a variety of experimental stimuli. Secondly i t was predicted that subjects in the two Bogus Hypotheses conditions would express opinions that differed from those expressed by subjects in the No Bogus Hypothesis condition, opinions that would be a function of the bogus hypothesis to which they were exposed. METHOD Subjects Each of forty volunteer subjects enrolled in introductory psychology courses during summer session at Vancouver City College participated in one of the three conditions of the experiment. The sample consisted of an equal number of males and females. o Subjects were run i n small groups ranging in size from three to six persons. Only two subjects reported that they had had prior experimental experience, and no subject reported any foreknowledge of this experiment. Procedure The experimental procedure outlined below was the same for a l l subjects participating in the experiment. At the start of the experiment subjects xrere given a "cover story" about the experiment's purpose and procedures. Subjects were told that the experiment was concerned with how "personality impressions are formed," They were told that they would be given a written article to read from xtfhich they were to attempt to obtain an impression of the author's personality, and that their impressions would be assessed on a Personality Impressions Scale. Subjects were also informed that they would be given an additional written task the purpose of which was to "control factors which may have unintentionally influenced" their obtained personality impression. Subjects then read a 550 word communication (Appendix A) arguing against the idea that soon, college graduates would have to undertake advanced graduate training to obtain employment positions open to them today.^ This communication was previously used by Papageorgis (1970). After subjects read the communication they completed the Personality Impressions Scale (Appendix B). Subjects had three tasks to perform on i t : respond to twelve seven-interval scales x-jith adjectival opposites descriptive of personality traits at the 10 endpoints (e.g. "Assertive-Submissive," "Intelligent-Unintelligent," etc.) i rate the strength of the personality impression they obtained from reading the communication! and write a three-sentence description of the author. The Scale i t s e l f was preceded by a set of written instructions. Next a five-item Opinionnaire (Appendix C ) , described to subjects as a control task, was administered. The Opinionnaire consisted of five statements about the communication, each statement followed by a seven interval scale with endpoints labelled "agree-disagree." The five statements were as follows: "The arti c l e , on the whole, was a good one"! "The evidence presented by the author was objective and factual 1 1! "The arguments used by the author were designed to play on the reader's emotions"! "The belief that advanced, graduate study w i l l soon be essential for employment i s false"! a n c ^ "The author of the article was sincere." At the end of the experiment a l l subjects were asked not to discuss the experiment with fellow students. Treatment conditions Subjects participating in the Bogus Hypothesis (BH) conditions received (to them) apparently unauthorized information about the purpose of the experiment from a female accomplice posing as a subject. Half of the subjects received positive information about the experimenter's intentions (pro Bogus Hypothesis), and half received, negative information (Con Bogus Hypothesis). The accomplice made her "pitch" just prior to the administration of the Opinionnaire while the experimenter was absent from the experimental room on the 11 pretext that he had forgotten to bring the Gpinionnaire forms T i l t h him. The accomplice was instructed to impart the following information to the respective groups: PRO BOGUS HYPOTHESIS I have a friend who was in this experiment before and she told me what i t ' s a l l about. This guy isn't testing personality. He's really testing our opinions about the a r t i c l e . Apparently he wants to see how open minded students are, see i f they're willing to be convinced by new ideas, like in the ar t i c l e . That's why he's going to test our opinions. CON BOGUS HYPOTHESIS I have a friend .... opinions about the a r t i c l e . Apparently he wants to see how gullible students are, see how easily they can be made to believe a foolish idea, like in the ar t i c l e . That's why he's going t o test our opinions. The accomplice's information was designed to sensitize subjects to an alternative interpretation of the purpose of the experiment and to simulate the sort of hypothesis that a subject might generate i f l e f t ' to his own devices - that persuasion was the rea l purpose of the study. Hie experimenter returned to the experimental room after a two minute absence and administered the Gpinionnaire. Subjects in the No Bogus Hypothesis (NBH) condition were given no unauthorized information about the purpose of the experiment and were interviewed individually afterwards to determine what hypotheses, i f any, they had generated on their own during the course of the experiment. At the end of the experiment proper, subjects were requested to remain for an interview. The experimenter then told subjects that his interest i n conducting the experiment was not to studs'" personality impressions but to find out what their personal reactions to such an experiment would be. They were told that the 12 sole function of the experiment was to give them experimental experience for the forthcoming interview. The experimenter then discussed (without revealing his interest in hypotheses) the importance of studying subjects 1 "reactions to experiments" and asked subjects to return to the experimental room at ten minute intervals for the interviews. The purpose of the interview was explained to subjects in this way so that obstacles to frank discussion might be eliminated and the possibility of an experimenter-subject "pact of ignorance" (Orne, 195>9a) avoided. Each interview took about ten minutes and was tape recorded. A l l subjects in the No Bogus Hypothesis condition were interviewed, and no subject had to wait more than thirty minutes for his interview session. During the interview, subjects were questioned about their prior attitudes toward and. knowledge of experiments, their reactions to each of the various tasks in this experiment, and specifically about any suspicions or hypotheses they had about the experiment's real purpose. If a subject reported an hypothesis, he was asked when he had formulated the hypothesis and what had caused him to formulate i t . The questioning was designed to be thorough and the interviewer attempted to follow up any promising leads with further questions. Analysis The Opinionnaire responses of subjects i n both Pro and Con BH conditions were summed across statements for each subject and the summed scores were compared with the summed Opinionnaire scores of subjects i n the MBH condition. To f a c i l i t a t e analysis, scoring for statement #3 was reversed. It was predicted that, in line with their 13 respective bogus hypotheses, subjects in the Pro Hypothesis group would agree more and subjects in the Con Hypothesis group would agree less with Cpinionnaire statements, than subjects in the No Bogus Hypothesis condition. A considerable number of subjects i n the HBH condition were expected to report hypotheses. A subject was not classified as having an hypothesis unless a l l four of the following c r i t e r i a were met; the subject must report that he had attempted to figure out the experiment's purpose (thus indicating that he had been sceptical of the experiment's stated purpose); he must be able to report an hypothesis that was specific enough to affect his experimental behavior (vague hunches were not acceptable)! his belief in his hypothesis must not be tentative; and he must have arrived at his hypothesis prior to the experiment's termination. Speculation that f a i l s to meet these c r i t e r i a w i l l not, theoretically at least, have the power to contaminate performance, and were not considered hypotheses, Two problems were encountered in the study - one problem having to do with implanting the hypothesis, the other having to do with the Cpinionnaire. Luring one of the sessions of the Con Hypothesis condition, while the accomplice was in the act of t e l l i n g subjects what the experiment was about, a subject questioned her authenticity, implying that she was a "plant." Devastating as this accusation ought to have been for the accomplice's a b i l i t y to maintain verisimilitude, she reported that she was able to manage the situation so that, i n the end, the Ill subject was himself discredited rather than the accomplice. (Ke was apparently playing a hunch and was easily dissuaded.) A welcome assist was even provided, the accomplice by several subjects who ridiculed the "plant" notion. Since a l l subjects, including even the one who spoke out, appeared to side with the accomplice i n the end, the data from this session were not excluded from the analysis. No such problems were encountered in the other bogus hypothesis sessions and for them a l l indications are that the accomplice's words went unquestioned (for example, following the accomplice's disclosure s subject in one session said, "Yeah, that's the way they [psychologists] do things"). The second d i f f i c u l t y had to do with one statement on the Opinionnaire Urh). After the experiment was completed i t was discovered that this statement ("The belief that advanced graduate study w i l l soon be essential for employment is false.") may have been a source of confusion to subjects. The statement was worded rather awkwardly ("The belief that advanced graduate study w i l l be essential ... is false" i s an awkward way of saying "Advanced graduate study w i l l not be essential , . . " ) , and therefore some subjects may have inadvertantly indicated an attitude toward the statement that was opposite from the one they actually held. To check on this possibility, the statement was tested for clarity on ten subjects not associated with the original experiment. Six-subjects were confused by the statement. Since subjects i n the experiment may have been similarly confused, the data from this statement were not used in the analysis. 1 5 RESULTS Opinionnaire Data The Opinionnaire data for the hypothesis comparison are summarized in Table I (one subject from the HBK condition was randomly eliminated to provide equal-sized groups for the analysis). The higher the mean score, the greater the agreement with the Opinionnaire statement(s). It was predicted that Opinionnaire scores for the Pro and Con BH conditions would be higher and lower respectively than Opinionnaire scores for the MBH condition. Note that the overall means bear out only half of this prediction: the Con Hypothesis mean (x - 3.75) is lower than the MBH mean (5c = h.9k), but the Pro Hypothesis mean (x = 3.87) i s not higher than the MBH mean. An analysis of variance performed on the Opinionnaire data (Table II) showed that these means diet not differ at conventional levels of significance (F = 2 . 6 2 , p < . 1 0 ) . Thus the bogus hypothesis manipulations did not have the predicted d i f f e r e n t i a l effect on Opinionnaire performance. Although the bogus hypothesis manipulation did not have the predicted effect on Opinionnaire performance, the manipulations nonetheless may have had some other effect. Not only are both BH means lower than the HBH mean, but the means for individual Opinionnaire statements in both Pro and Con BH groups are consistently lower than the corresponding means for the LIBIT condition, sometimes strikingly so (see particularly statement ;/3). Both pro and Con Bogus hypotheses seem to have affected performance (that i s , i f the borderline significance i s taken to reflect a true difference); 16 contrary to expectations however, the effect of both bogus hypotheses was the same. TABLE I MEANS AND STANDARD DEVIATIONS FOR HYPOTHESIS CONDITIONS Cpinionnaire Statement Bogus Hypothesis Condition Con Hypothesis No Hypothesis Pro Hypothesis 1 Iu08 14.23 2.92 2 3.31 iui5 3.31 3 3.31 5.62 It. 15 5 It. 31 5.77 5.08 7 3.75 U.9h 3.87 SD l i . 8 l 6. oil 6.58 13 13 13 17 TABLE II SUMMARY OF ANALYSIS OF VARIANCE: EXPERIMENT I Source df MS F Total 38 Hypothesis 2 89-95 2.62* Error 36 3i|.32 * p < .10 Interview Data Fourteen subjects participating in the No Bogus Hypothesis condition were interviewed for their hypotheses (the subject excluded from the Cpinionnaire data analysis was not excluded here). In a l l but a few cases, the indications are that subjects responded to the interviewer's questions honestly. For example, the reaction of subjects to questions about their suspicions and hunches was i n many cases that of surprise - a d i f f i c u l t response to fabricate effectively. Also, subjects reporting experimental behavior "out of the ordinary" did not display any apparent reluctance to disclose that behavior to the interviewer. Those few subjects whose interview responses Tirere "guarded" were noted by the interviewer, and their reports were given l i t t l e credence.^ 18 Contrary to expectations, no subject had an hypothesis about the purpose of the experiment that measured up to the c r i t e r i a . One subject came to the experiment with an elaborate hypothesis, but rejected i t as unsuitable when faced with experimental events. This subject (#36) thought he was going to be given an intelligence test. When queried further, he said that he thought the purpose of the experiment was a comparison of the IQ of students at his school with the IQ of students at a nearby university; How he arrived at this hypothesis could not be determined, but i t apparently arose out of a knowledge, and perhaps apprehension, of deception experiments. He realized part way through the experiment, however, that his hypothesis was not tenable, and rejected i t : "After having finished the [personality impressions] test I really didn't see how i t could be an intelligence test and I really didn't see what else you'd be testing." It is interesting that this subject's belief i n his hypothesis -unsuitable though i t was - remained unshaken u n t i l after he had completed the Personality Impressions Scale. He obviously did not believe that the experimenter's introductory remarks about the experiment's purpose and procedure, and required a wealth of disconfirming evidence (communication, Personality Impressions Scale instructions, etc.) before his belief i n his hunch was shaken. When an hypothesis can be maintained i n the face of such disconfirming evidence, belief i n that hypothesis must be strong. This suggests that subjects whose hypotheses do not conflict with experimental events may place a considerable amount of faith i n their hypotheses1 19 correctness. Subject #36 displayed a l l the attributes of a subject with an hypothesis except one: he f a i l e d to maintain belief i n his hypothesis. Therefore he could not be classified as having an hypothesis by the c r i t e r i a set forth in this study. The experimental performance of a subject who has and. then rejects an hypothesis is not l i k e l y to be contaminated by that hypothesis.^ Two subjects (#'s 20 and 25) reported that during the course of the experiment they thought i t might be about something other than personality impressions. However their hunches x^ere both vague and transitory i n nature. Not only could they not say what that "something else" might be, but the thought that the experiment was about something else apparently just "crossed their minds." Aside from such momentary doubts they seemed to maintain belief in the stated purpose of the experiment. These subjects therefore were not classified as having an hypothesis. None of the other eleven subjects reported anything that could be even remotely considered an hypothesis. No subject interviewed in this experiment therefore, was classified as having an hypothesis. Not only were hypotheses apparently absent from subjects i n this experiment, but suspicion of the experimenter's intentions, a factor which ought to lead subjects to entertain hypotheses, also appeared to be quite low. Only one subject was considered to be suspicious. This subject (#30) had apparently been forewarned about deception experiments by her psychology instructor, and heeded his warning. The relevant part of tho interview follows: 20 Interviewer: When I gave you the instructions at the beginning, did you believe them? Subject: No. Our psychology professor said that you never know what somebody is testing ... like you may be asked to do a certain task and they can be testing something else. Interviewer: So when I gave you the instructions you xrere a l i t t l e suspicious? Subject: No. I don't mind. [But a moment later:] I was wondering what your whole test was based on and what you were trying to find. Although this subject was suspicious, she was unable to come up with an hypothesis about the experiment's purpose: "I tried to figure i t out, but didn't have much luck." A l i t t l e more psychological and experimental sophistication perhaps, and this subject might have been able to figure i t out. One other subject (#22) reported that she was suspicious in the experiment, but her words are suspect. Not only were her interview responses guarded, but she seemed to be making an effort to give only those responses that the interviewer would x^ rant to hear. Because her responses to other interview questions did not appear to be forthright, the truthfulness of her reported suspiciousness was cast in doubt. Therefore she was not classified as suspicious. An interesting interview finding related to subject suspicion and hypothesis testing was the number of subjects reporting awareness of experimental deceptive practices who xrere not suspicious. Six of the fourteen subjects interviewed reported that they knew about deception experiments (how much they knew about them is another matter j none xirere truly sophisticated in their knowledge). Of these, only two reported that they suspected that the experimenter was testing 21 something other than personality impressions (subjects # 3 0 and # 3 6 previously discussed). The other four subjects said that, despite their knowledge of deception, they were not suspicious. These subjects gave one of two reasons for their lack of suspicion. One reason was that they came to the experiment expecting something "horrible" only to find that the experiment was not as they expected, and therefore dismissed the thought of deception from their minds. The other reason given was that to attempt to "second guess" the experimenter was "wrong". It appears then that subject knowledge of experimental deceptive practises, although perhaps a necessary condition for the arousal of suspicion and hypotheses, is not a sufficient condition for their arousal. When an experiment is innocuous or when subjects adopt a "proper" frame of mind, forewarned i s not necessarily forearmed. In conclusion, i t i s apparent from the interview data that subjects did not generate hypotheses about the purpose of the experiment as expected. Out of fourteen subjects interviewed, no subjects were classified as having an hypothesis and only two subjects were classified as suspicious (assuming that the subject with the aborted "Intelligence test" hypothesis ought to be placed in the "suspicious" category). A majority of subjects reported that they accepted the personality impressions rationale completely and that their behavior i n the experiment was task-oriented. The remainder of the subjects reported nothing more substantial than vague and fleeting doubts. The occurrence of hypotheses in subjects appears by these data to be rare.. 22 DISCUSSION Opinionnaire data The failure of the bogus hypothesis manipulation to produce the predicted effect was surprising, especially since a similar manipulation i n the author's preliminary study produced results which, although not significant, were at least in the predicted direction. Two possible explanations can be given for the failure of the manipulation: that the bogus hypotheses were not effectively communicated to subjects; or that not a l l subjects were desirous of confirming the experimenter's expectations. If subjects understood that the experimenter was really testing "open mindedness" or " g u l l i b i l i t y , " some of them may have decided to be negative responders (Masling, 1966 ) . To be negative responders (i.e., to give responses known to be the opposite of what the experimenter desires or, i n Masling's terminology, to give responses which "screw" the experimenter), subjects i n the Fro Bogus Hypothesis condition would have had to show minimal agreement with statements on the Opinionnaire, and subjects i n the Con Bogus Hypothesis condition would have had to show maximal agreement. As subjects i n both BH conditions showed (when compared with the mean of subjects in the NBH condition) minimal agreement with the Opinionnaire statements, i t appears unlikely that these data are an instance of Masling's "screw you" effect. Subjects i n the Con Bogus Hypothesis condition did not give the appropriate "screw you" response. Unless negative responding was used only by subjects i n the Fro Bogus Hypothesis condition (possibly as a result of a less plausible rationale i n the 23 "open mindedness" instructions), the Masling interpretation may be ruled out. It seems much more li k e l y that the results can be accounted for by some sort of communication breakdown. The accomplice did report however that in no case did she have any d i f f i c u l t y i n imparting a bogus hypothesis to subjects. The key to what happened therefore may l i e i n understanding the conditions under which subjects received the accomplice's information. It was determined from the interviews that most of the subjects in the KBH condition gave credence to the personality impressions rationale provided by the experimenter and were primarily task oriented. Subjects in the BH conditions, prior to receiving the accomplice's information, probably placed a similar amount of f a i t h i n that rationale. Discovering that the experimenter was not testing personality impressions probably took them by surprise. Given this situation, i t seems intui t i v e l y plausible that subjects were more interested i n determining the extent of the experimenter's deceit than i n learning the experiment's true purpose. Contrast this situation with the one under which subjects received bogus hypotheses i n the author's f i r s t study. There, subjects reported that they were suspicious from the beginning. They were probably quite interested i n learning from the accomplice what the experiment's true purpose was. But i n the present study, subjects, since they were probably unsuspecting, may have been caught off guard by the unanticipated deception, and so did not give the accomplice's information the consideration i t deserved. The bogus hypothesis manipulation therefore may have been only 2h effective in arousing distrust and suspicion and not effective i n communicating information about the experiment's purpose as intended. If the above surmise is correct (and i t seems to the author a more plausible account of the situation than that some subjects were negative responders), then this experiment must be considered to have inadequately tested the prediction made about the behavioral consequences of having an hypothesis. Further research is necessary. In light of the possibility that the hypothesis manipulation only succeeded in making subjects suspicious, the Opinionnaire data may be of increased interest. The fact that subjects i n both Pro and Con BH groups scored lower (p < .10) on the Opinionnaire than subjects i n the NBH condition (who were for the most part not suspicious) suggests that suspiciousness of the experimenter's intent may be a research contaminant i n i t s own right. These findings are i n line with those of Strieker, ilessick & Jackson (1967) who i n a conformity study found that suspicious subjects conformed less. Further investigation of this "suspicion effect" appears by these data to be recommended. Interview findings The fact that only 1$% of the subjects interviewed i n the NBH condition saw reason to doubt the alleged purpose of the experiment may be an important finding despite i t s unexpectedness. It is at odds with the findings of the author's preliminary study and at odds with the findings of most of the research on demand awareness (e.g. Page, I968, I969; Silverman & Regula, 1968), i n which a greater percentage of subjects sceptical of an experiment's alleged purpose and aware of 25 an experiment's true contingencies have been found. Possible causes of this discrepancy merit consideration. On the surface, i t appears that the relative absence of suspicion and speculation i n the MBIT condition of this experiment may be attributed to the subjects' lack of sophistication i n experimental and psychological matters. Only two subjects had participated i n a psychology experiment prior to this one, and a l l were enrolled in introductory psychology courses in which experimental psychology was apparently not heavily emphasized. It is possible that had subjects i n this experiment been more sophisticated, they might have been more prone to speculate about i t s purpose. Evidence from this and other experiments however suggests that subject sophistication may not i n fact be of much importance as a factor i n determining the amount of suspicion and speculation. F i r s t , suspicion has been found in subjects no less naive than subjects i n this study. Subjects i n the author's preliminary study were just as naive on the average as subjects in this study, and naivete did not prevent their being suspicious. Widespread suspicion has even been encountered in a sample of high school students (Strieker, Messick & Jackson, 1967). Secondly, sophistication and experimental experience have not been shown to lead to increased suspicion and speculation. Filleribaum (1966) exposed subjects to mild deception but found no increase in suspiciousness in a subsequent experiment. Page (I968, 1969) found that subject sophistication was associated with axtfareness of demand characteristics, but concluded that sophistication was not necessary for awareness because many unsophisticated subjects were aware i n his 26 studies. Holmes (1967) found that prior experimental experience increased the probability of a subject's becoming aware, but also found that more experienced subjects were less inclined to engage i n speculation about the purpose of an experiment. The relationship between sophistication and suspicion/speculation is then, not entirely clear. It appears that a sophisticated subject has a better chance of success i n his speculations, but i t does not appear that he i s any more motivated to speculate than i s a naive subject. Naive subjects have not been found to be any less suspicious than sophisticated subjects. Some other factor appears to underlie suspicion and speculation. One difference between this study and demand awareness studies is that the experimental paradigms that are chosen for investigation i n demand awareness research are chosen because they are believed to contain demand characteristics; the paradigm used in this study was not chosen for that reason. Thus i t may be the case that demand characteristics were present i n a l l the paradigms investigated for demand awareness, but were not present in the paradigm investigated in this study. Perhaps this difference accounts for the discrepancy in findings re subject speculation. The presence of demand characteristics i n an experiment may determine whether or not subjects w i l l speculate about that experiment. The only function that demand characteristics are supposed to have is to convey to subjects the experiment's purpose, but perhaps they have an arousal function as well: that i s , they cause previously unsuspecting subjects to doubt the experimenter's intentions. 27 If demand characteristics do function to arouse subject scepticism as well as to convey information about an experiment's purpose, the implications concerning the type of artifact being dealt with by investigators in this area are quite important. It has been assumed (Kelman, 1967j Orne, 1962j Reicken, 1962; Schultz, 1969) that the source of the artifact in part resided i n subjects' keen interest i n ascertaining the true purposes of research. This however may not be the case. What we may have is not a population of overly suspicious subjects, but an indeterminate number of poorly disguised experiments. Subjects may speculate about an experiment only when some procedural irregularity arouses their suspicions. If so, the artifact i s controlled not by finding some contaminant-free method of conducting experiments within the context of suspiciousness, but by designing experiments in which demand characteristics and other suspicion arousing cues are eliminated. Martin Orne (1969) has i n a recent a r t i c l e outlined procedures that could be used to eliminate such cues from experiments. If subjects only speculate when aroused and i f speculation can be eliminated by appropriate methodology, then determination of the nature and consequences of speculation - i.e., continuation of the research program of this study - may be of only minor importance. Of major importance i s determining whether or not certain experimental practises can account for the speculation of subjects heretofore attributed to high motivation. Experiment JJ therefore was designed as a preliminary investigation of two experimental practises believed 28 to be instrumental i n arousing subject suspicion and speculation. The too practises investigated were the use of sensitization tasks and cover stories. 29 CHAPTER III EXPERIMENT II In the author's preliminary study, most subjects speculated about the experiment's purpose. In the study just reported, speculation was evidently rare. The discrepancy in findings may be accounted for by procedural differences between the two s tudies. Practises eliminated from Experiment I may have been significant in arousing subjects to speculate i n the preliminary study. Two practises xxhich may have been significant i n this regard are as follows. In the preliminary study, subjects were not provided a cover story about the purpose of the experiment and had their opinions measured twice. The absence of a cover story and the taking of repeated opinion measures may, either singly or i n combination, cause subjects to speculate about an experiment's purpose. These two experimental practises were investigated i n this study. The significance of repeated attitude measures as a factor i n arousing subject, suspicion and speculation l i e s i n the fact that when a performance measure is administered twice i n an experiment, subjects may become sensitized to the possibility that their responses on the two measures w i l l be compared. Thus investigators often make an effort to disguise such measures (or resort to the use of a single measure). If the disguise i s somehow inadequate, or i f two measures having i n fact no relationship are similar-appearing, subjects may become suspicious. Thus any unexplained, sinnlaw--appearing measure*"-'-1 j^.xces in an experiment are l i k e l y to have an arousal function. 30 That failure to provide a rationale or cover story for an experiment can result in subject speculation seems plausible. A silent experimenter may be regarded by subjects as one who has "something to hide." Simple curiosity w i l l cause subjects to become interested i n finding out what the experimenter i s not t e l l i n g . Even i f the experimenter explains his i n a b i l i t y to describe the experiment's purpose, subjects w i l l have d i f f i c u l t y functioning without the framework that a rationale would provide. Thus they may find i t necessary to make some sense of the course of experimental events. As sensitization tasks and experiments without rationales seem li k e l y candidates for arousing subject suspicion and speculation, they were investigated i n this study. The variables x?ere manipulated orthogonally to provide four treatment conditions (rationale vs. no rationale and sensitization vs. no sensitization). The personality impressions/persuasion experiment format used in the f i r s t experiment was also used i n this study. The dependent measure was a brief post-experimental questionnaire designed to measure subjects' level of 7 suspicion, apprehension, and speculation.' It was predicted that subjects provided with no rationale for the experiment would report greater suspicion, apprehension, and speculation than subjects provided with a rationale, and that subjects receiving the sensitization task would be more suspicious, apprehensive and speculative than subjects not receiving i t . In addition, several correlational analyses of the data were undertaken. Level of suspicion was correlated with degree of agreement with the communication and extremity of opinion (both as measured by an Opinionnaire) and, i n both cases, the correlation was 31 predicted to be negative. These analyses were conducted because of the arguments presented i n the discussion section of the preceding experiment, that suspicion in a subject sample may be a data contaminant in i t s own right. The prediction that suspicious subjects w i l l agree less with Opinionnaire statements than unsuspicious subjects arises from the results of the previous study, in x^hich the Opinionnaire differences were interpreted as indicating a "suspicion effect." The level of suspicion-extremity of opinion prediction was based on the belief that suspicious subjects are "on guard," and, due to caution, avoid the expression of extreme viewpoints (e.g., avoid marking the endpoints of scales). METHOD Thirty-six undergraduate volunteers drawn from introductory psychology courses at the University of British Columbia participated i n one of the four treatment conditions. Prior experimental experience of subjects averaged 1.9 experiments (range, 0-6), and, as the experiment was conducted near the end of term, subjects should be considered both experimental^yand psychologically sophisticated. The sample was made up of 17 males and 19 females. The format for this experiment was similar to that used i n the f i r s t experiment. A l l subjects, regardless of treatment condition, were given xne r u i i o u i n g tasks to do: a) read a 550 word persuasive communication about higher education (the same communication used i n the f i r s t experiment); b) f i l l out a Personality Impressions Scale (the same scale as used i n the f i r s t experiment); 32 c) f i l l out an Cpinionnaire consisting of f i v e statements about I the communication (this form differed from that used in the f i r s t experiment only i n that the last two statements were re-phrased, as follows: "The conclusions arrived at i n the art i c l e are essentially correct" and, "The author obviously had ulterior motives for tirriting the a r t i c l e " - Appendix D)j d) answer a post-experimental Questionnaire (Appendix E) consisting of four questions about their reactions to the experiment. (The questions were as follows: "Were you suspicious of the experimenter's intentions?"; "Were you apprehensive about doing any of the tasks?"] "How much time did you spend attempting to guess the experiment's purpose?"] and a fourth question which was a f i l l e r item and was omitted from the analysis.) Each question was followed by a four point scale with intervals labelled as to the amount of the quantity in question - e.g., "not at a l l suspicious," "slightly suspicious," "moderately suspicious," and "very suspicious." The two treatment variables, rationale for the experiment and sensitization, were manipulated i n the following manner. Subjects participating i n the rationale condition received at the start of the experiment instructions providing them with a plausible account of the purposes of the study. They were told that the investigation was being undertaken to determine "how personality impressions are formed," The experimenter told them he was interested i n "what type of personality people attribute to an individual given that they have various sorts of information about him," and then went on to explain that they would be given a written a r t i c l e to read from which they WBre to attempt to obtain an impression of the author's personality. 33 In addition to these instructions, just prior to the administration of the Cpinionnaire, subjects were informed that i t was for control purposes, "because [their] opinions about the article may have unintentionally influenced [their] personality ratings." Subjects participating in the no rationale condition received no such account of the experiment. They were told at the beginning simply that they would have to do "various written tasks," and the Cpinionnaire was administered without explanation. In the sensitization task condition, an attitude survey form labelled. "Attitudes Toward Higher Education" (Appendix F) was administered prior to the communication. The form consisted of six statements, each statement folia-red by a five interval scale ranging from "definitely false" to "definitely true." Three of the statements were relevant to some point mentioned in the communication subjects would read later, and three were not relevant. The purpose of the survey form was to make subjects aware of the pos s i b i l i t y that i t was their opinions about the communication (which would subsequently be measured on the Cpinionnaire) and not their impressions of the personality of the author of the communication, that were of interest to the experimenter. Subjects receiving the attitude survey i n the rationale condition were told that i t was given for control purposes. In the no rationale condition the purpose of the survey was not explained. In the no sensitization condition the attitude survey was simply not administered. Following the instructions subjects began reading the communication. 3h A main effect for both variables, rationale and sensitization, was predicted. In addition, a negative correlation was expected to result from the level of suspicion - degree of agreement and level of suspicion - extremity of opinion analyses. RESULTS Questionnaire data are summarized in Table III. Note that responses to each question are tabled separately, and that the possible range of scores for each question is from one to four. Across a l l conditions, the scored responses to each question can be ranked from lowest to highest as follows: apprehension (x = l.h$), suspicion (x = 1.92), and guessing about the experiment's purpose (x = 2 .II4). A l l these scores are moderately low with respect to the quantity i n question, and the distributions are generally skewed. For example, the overall mean of I.92 for the suspicion question indicates that on the average, subjects were less than "slightly suspicious" about the experimenter's intentions. An analysis of variance was performed separately on each question (Table IV). (The data did not appear sufficiently compelling to perform a multivariate analysis of variance on the entire set of data.) A two way analysis of variance performed on data from the suspicion question yielded, no significant differences ( a l l Fs less than 1). The predictions that the absence of a rationale and the presence of a sensitization task create more suspicion were not confirmed. 35 On the apprehension question, a two way analysis of variance resulted i n a difference f o r the sensitization factor which approached conventional significance levels (F = 3.76, p .07). This difference however i s i n the direction opposite from that predicted: subjects reported more apprehension without sensitization than with i t . This clearly does not confirm the hypothesis with respect to the arousal function of sensitization. TABLE III MEANS AND STANDARD DEVIATIONS FOR SUSPICION, APPREHENSION, AND PURPOSE QUESTIONS (N = 9 per cell) GROUP RATIONALE NO RATIONALE Sus-picion Appre-hension Purpose Sus-picion Appre-hension Purpose Sensitization 2.11 (l.ll5) 1.11 (.33) 2-JUl* (.88) 1.89 (1.17) 1.22 m ) 1.78 (1.09) No Sensitization 1.78 (.83) 1.56 (1.01) 2.33 (.87) 1.89 (1.05) 1.89 (1.27) 2.00 (1.12) Note:- Standard deviations are in parentheses. 36 TABLE IV SUMMARY OF ANALYSIS OF VARIANCE: EXPERIMENT II 1. Suspicion Source df MS F Total 35 Sensitization (A) l .25 <1 Rationale (B) 1 .03 <1 AXB 1 .25 <1 Error 32 1.32 2. Apprehension Source df M § F Total 35 3.76* Sensitization (A) 1 2.78 Rationale (B) 1 .16 <1 AXB 1 .10 <1 Error 32 • 7li P < .07 3. Purpose Source df MS Total 35 Sensitization (A) 1 .03 <1 Rationale (B) 1 2.25 2.27 AXB 1 .25 <1 Error 32 ,99 For the purpose question, the two way analysis of variance did not yield any significant differences. The only F greater than one was for the main effect of the rationale variable (F = 2.27, ns), but again, the ordering of means for this variable was opposite to that predicted (mean reported time spent guessing about the experimenter's purpose was 2.39 for the rationale condition, and only 1,89 for the no rationale condition). The predictions that the absence of a rationale and the presence of a sensitization task cause subjects to spend more time guessing about the experiment's purpose were not confirmed. In sum, the factors of rationale for experiments and sensitization were not demonstrated to have the predicted effect upon subject suspicion, apprehension, or time spent attempting to guess an experiment's purpose. In fact, the only finding that approached significance was that subjects tended to be more apprehensive without sensitization. This finding i s d i f f i c u l t to interpret. The second aspect of the analysis, correlating Questionnaire responses with degree of agreement and extremity of opinion on the Opinionnaire, was conducted i n the following manner. Subjects were divided into two groups - suspicious and unsuspicious - based on their responses to the suspicion question on the questionnaire. Subjects who indicated that they were "slightly," "moderately," or "very" suspicious were placed i n one category, and subjects who indicated they were not suspicious were placed in the other category. There xtfere 18 suspicious and 18 unsuspicious subjects. 38 Suspicion level was f i r s t correlated with degree of agreement with the communication. Responses to the f i r s t four statements of the Cpinionnaire were summed (item #3 being reversed) to obtain an "agreement" score for each subject. Mean agreement for unsuspicious subjects was 18.00 and for suspicious subjects was 17.72 (the higher the score the more agreement). The point-biserial correlation for level of suspicion and degree'of agreement was vi r t u a l l y n i l (r-pb = -.03); the predicted negative correlation was not obtained. The f i f t h statement on the Cpinionnaire was analyzed separately as i t allowed a p a r t i a l check on the v a l i d i t y of the Questionnaire. It was expected that a subject might generalize the "ulterior motives" pronouncement made by the statement so as to be applicable to the experimenter. Since a suspicious subject by definition suspects an experimenter's motives, suspicion ought to correlate positively with agreement with this statement. However these two factors were not correlated (£p^  = +.09). Evidence to support the va l i d i t y of the Questionnaire as a measuring device was not obtained. In order to test the level of suspicion-extremity of opinion hypothesis, Cpinionnaire responses were rescored. A response to either of the endpoint intervals of the seven-interval Cpinionnaire scale was given the highest score, and a response to the "neutral" position was given the lowest score: disagree | 3 | 2 | l l 0 | l | 2 l 3 | agree Thus the higher the score the more extreme the opinion expressed (be i t agreement or disagreement). Subjects' responses were summed across a l l five statements for the analysis. The predicted 3 9 negative correlation between suspicion level and extremity of opinion was not, however, obtained (rp^ = +.0$). In summary, no performance differences were found between suspicious and unsuspicious subjects. DISCUSSION This study was based on the principle that the stimulus to subject suspicion and speculation is to be found i n certain common experimental practises rather than, for example, i n a subject's level of experimental and psychological sophistication. The effect that two such practises - sensitization task and no rationale - have on arousing suspicion and speculation could not be determined in this study. Such results, although discouraging, do not necessarily invalidate the principle, and research into the causes of suspicion and speculation, because of the important implications, ought to continue. The results of this study can perhaps be accounted for by the lack of divergence between the values of the variables investigated. First, the rationale-no rationale conditions may not have sufficiently differed due to a procedural requirement. It was necessary to t e l l subjects i n the no rationale condition as part of the instructions for f i l l i n g out the Personality Impressions Scale whose personality they were rating - otherwise they would have been unable to do the task. From these instructions they may have deduced that the purpose of the experiment had to do with personality impressions, and so i n effect have been l e f t with the same information as subjects in the rationale condition. Thus the two conditions may have had no difference i n fact. 1,0 Secondly, the sensitization manipulation may not have been a particularly strong manipulation. The attitude survey-Cpinionnaire relationship may have been made too subtle for subjects to comprehend, although this seems unlikely as inspection of the materials w i l l reveal. In retrospect, i t can only be said that i t behooves an investigator when studying a new area to choose unsubtle, divergent values for his variables. The honesty of subjects' Questionnaire reports i s also, of course, debatable. The use of extensive post-experimental interviews and questionnaires as recommended by Levin (19&1) and Orne (1962) probably are necessary to prevent responses which are too casual. Subject error may have obscured any treatment effects i n the data as well as obviated the necessity of the correlational analyses. The two studies reported in this paper, although predominantly unsuccessful i n providing new knowledge in this area, w i l l hopefully be instrumental i n leading to new avenues of research. The study of demand awareness, in many ways meritorious, nonethele ss may be restricted i n generali zability. By studying a l l the hypotheses that subjects entertain rather than just the accurate ones, i t may be possible to account for more instances of data contamination. It is of major importance, however, that future research in this area be directed towards determining the source of subjects' motivation to speculate and, i f that source i s found to b^ certain experimental hi practises, towards enumeration of those practises which arouse subjects to speculate. If investigators adopt this course of action the artifactual problem can perhaps be solved. Speculation-arousing stimuli can be controlled or preferably eliminated from research designs entirely. One strategy for determining the source of subjects' motivation to speculate might entail replication of some of the demand awareness studies with demand characteristics present and with demand characteristics absent. If removal of demand characteristics results not only in reduction of subject awareness but also in a reduction i n subject suspicion and speculation, i t can be concluded that motivation to speculate i s not a function of extra-experimental factors such as knowledge of experimental deceptive practises, but a function of stimuli or events occurring within particular experiments, and specific to those experiments. Some experimental stimuli which may be instrumental i n arousing subject speculation and which therefore may be of interest for further research are as follows: 1) Stress. An experiment that is stressful for subjects may arouse them to speculate about i t s purpose. It is d i f f i c u l t to undergo stress without sufficient cause, and because a subject participating in a stressful experiment may experience misgivings about his having volunteered to participate, he may attempt to counteract these feelings by attributing a purpose to the experiment that would justify his participation. Stressful experiments therefore may increase subjects' motivation to have hypotheses. 2) The unexpected. .Subjects may become aroused to speculate about an experiment i n which they have been surprised, h2 especially when the surprise i s an unpleasant one. For example, any large discrepancy between an experiment and information given subjects about that experiment at the time of recruitment may cause speculation, because of subjects' desire to anticipate future experimental surprises. Unannounced tasks may have a similar effect on subjects. Experimental practises which catch subjects off guard may put them on guard for the remainder of the experiment. 3) Inconsistency. Any inconsistencies i n the t o t a l i t y of stimuli subjects receive about an experiment may cause them to generate hypotheses about that experiment. Tasks that appear to be unrelated to the experiment's stated purpose, contradictory utterances on the part of the experimenter, and procedures which conflict with subjects' knowledge of "accepted experimental practise," may induce subjects to speculate. Although there are other p o s s i b i l i t i e s , investigation of the speculation arousing effects of these three classes of variables would seem particularly worthwhile. Coupled with investigation of the source of subjects' desire to figure out an experiment's purpose, this type of research may lead, i f not to solution of this artifactual problem, at least to i t s further c l a r i f i c a t i o n . U3 FOOTNOTES ^ Orne asserts that demand characteristics are present i n a l l experiments, but his argument does not appear to support his claim. In making this assertion, he argues: One of the basic characteristics of the human being i s that he w i l l ascribe purpose and meaning even in the absence of purpose and meaning. In an experiment where he knows some purpose exists, i t i s inconceivable for him not to form some hypothesis as to the purpose, based on some cues, no matter how meagre; this then w i l l determine the demancJcharacteristics which w i l l be perceived by and operate for a particular subject. (1962, P.780) These cues upon which "some hypothesis" is based are not necessarily the same cues which function to convey the experimenter's "demands" to the subject: according to Orne's formal definition demand characteristics cannot convey just any hypothesis, they can only convey the experimental hypothesis. In making this claim, Orne appears to have inadvertently extended his definition of demand characteristics to include any stimuli used by a subject as a basis for an hypothesis. Although i t is certainly the case that a l l experiments contain stimuli which a subject might use as a basis for an hypothesis, these stimuli are not necessarily demand characteristics. o The hypotheses of subjects do not necessarily "randomize out." The demand awareness literature has shown that a considerable proportion of subjects are capable of coming up with the same idea about an experiment's purpose. A considerable proportion of subjects in an experiment free of demand characteristics may come up with the same "wrong" ideajpibout i t s purpose. 3 Unpublished manuscript entitled "The subject's hypothesis," 1968. ^ Originally two communications were to be used i n this experiment. The second communication argued that the United States would contribute very l i t t l e that i s new to the exploration of outer space. As U.S. astronauts were making their f i r s t landing on the moon at the time this experiment was being conducted, this communication condition was eliminated. 5" Independent judges were not used to classify subjects with respect to hypotheses. Page and Lumia (1968) report interjudge r e l i a b i l i t y to be very high when scoring for demand awareness, and their scoring procedure differs l i t t l e from the one used i n this study. 1*1* ° Although this subject's performance on the Personality Impressions Scale might have been affected by his hypothesis, i t i s unlikely, because he "didn't see how i t cculd be an intelligence test." The performance of a subject whose attentions are divided may be affected, but this study is not addressed to that problem. 7 The apprehension question was included because of i t s possible relationship to suspicion and speculation. BIBLIOGRAPHY Fillenbaum, S. Prior deception and subsequent experimental performance: the "Faithful" subject. Journal of Personality  and Social Psychology, 1966, Ij, 532-537. Holmes, D. Amount of experience i n experiments as a determinant of performance in later experiments. Journal of Personality  and Social Psychology, 1967, 7, I4O3 -WTT Kelman, H. C, Human use of human subjects: the problem of deception in social psychological experiments. Psychological Bulletin, 1967, 67, 1-11. Levin, S. 1-1. The effects of awareness on verbal conditioning. Journal of Experimental Psychology, I96I, 6 l , 67-75. Masling, J. Role related behavior of the subject and psychologist and i t s effects upon psychological data. Nebraska Symposium  of Motivation, 1966, ll), 67-103. Orne, M. T. The nature of hypnosis: artifa c t and essence. Journal  of Abnormal and Social Psychology, 1959, 58, 277-299. (a) Orne, M. T. On the social psychology of the psychological experimentj with particular reference to demand characteristics and their implications. American Psychologist, 1962, 17, 776-783. Orne, M. T. Demand characteristics and the concept of quasi-controls. In Rosenthal, R. and Rosnow, R. L. (Eds.) Artifact i n  Behavioral Research, Hew York: Academic Press, 1969. Pp. 11)3-179. Page, M. M. Modification of figure-ground perception as a function of awareness of demand characteristics. Journal of Personality  and Social Psychology, I968, 9 , 59-66. . Page, M.' M. Social psychology of a c l a s s i c a l conditioning of attitudes experiment. Journal of Personality and Social  Psychology, 1969, 11, 177-186. Page, M. M. Role of demand awareness in the communicator cre d i b i l i t y effect. Journal of Social Psychology, in press. Page, M. M. and Lumia, A. R. Co-operation with demand characteristics and the bimodal distribution of verbal conditioning data. Psychonomic Science, I968, 12, 2U3-2kh. Papageorgis, D. Effects of disguised and persuasion contexts on beliefs. Journal of Social Psychology, 1970, 80, !<3-k8. ls6 Riecken, I-I. W. A program for research on experiments in social psychology. In N. F. Washburn (Ed,) Decisions, Values and  G roups. Vol. 2. New York: Pergamon Press, 1962. Pp. 25-1*1. Rosenberg, M. J. Mien disonnance f a i l s : on eliriiinating evaluation apprehension from attitude measurement. Journal of  Personality and Social Psychology, 1965, 1, 28-1*2. Schultz, D. P. The human subject i n psychological research. Psychological Bulletin, 1969, 72, 21h-228. Sherman, S. R. Demand characteristics i n an experiment on attitude change. Sociometry, I967, 30, 2]*6-26l. Silverman, I. Role-related behavior of subjects i n laboratory studies of attitude change. Journal of Personality and  Social Psychology, 1968, 8, 3li3-3lt8. Silverman, I. and Regula, R. C. Evaluation apprehension, demand characteristics, and the effects of distraction on persuasibility. Journal of Social Psychology, I968, 75, 273-281. ~ * Strieker, L. J., Messick, S., & Jackson, D. N. Suspicion of deception: Implications for conformity research. Journal  of Personality and Social Psychology, 1967, 5> 379-3-W. 1*7 APPENDICES A. Communication B. Personality Impressions Scale C. Cpinionnaire D. Cpinionnaire - Experiment LT E. Questionnaire F. Attitude Survey APPENDIX A Recently, there has been cons iderable d i scuss ion and debate about higher educat ion. Here, we wish to focus on a s ing le misapprehension that seems to have gained some acceptance.by the pub l i c . Th i s is the f a l s e conc lus ion that in the. near future co l l ege graduates w i l l need to undertake advanced graduate study before they are considered e l i g i b l e fo r many jobs or pos i t i ons open to them today. On the face of th ings , t h i s content ion may appear p l au s i b l e ; c l o se r examination, however, r e -veals that i t is i nco r rec t , and that there is no reason to be l ieve that the co l l ege degree is about to lose i t s s tatus . Since b e l i e f s about the status of co l l ege degrees are of cons iderable importance in de te r -mining educational p o l i c i e s , we should b r i e f l y review some evidence that shows that undergraduate co l l ege degrees w i l l remain a s u f f i c i e n t q u a l i f i c a t i o n for employment. To begin with the erroneous b e l i e f that advanced graduate degrees w i l l be essent ia l for employment in the future has developed as a r e -su l t of mi sapp l i ca t ion of a f a m i l i a r analogy: a f t e r a l l , was not the high school diploma s u f f i c i e n t some time ago, and is i t not now necessary to have a co l l ege degree in add i t ion to the diploma? In many ways, the d ip loma/col lege degree s i t u a t i o n is indeed t rue . On the other hand, i t provides no j u s t i f i c a t i o n fo r drawing a s im i l a r analogy between the co l l e ge degree and advanced graduate degrees. Many jobs today do re -qu i re s k i l l s that are not taught in high school and are acquired in c o l l e g e . These same jobs , however, do not requ i re graduate-1 eve I s k i l l s , and hence graduate work cannot become a r e a l i s t i c requirement for them. A second, and equal ly important reason behind the ins i s tence on co l l ege attendance that we f ind today has nothing whatsoever to do with education and s k i l l s , but is simply a r e su l t of the i n a b i l i t y of the labor market to absorb too many people at any one time. Co l lege a t t e n -dance increases the age at which a person enters the labor market, and . t h i s increase in minimum age is necessary to prevent the unemployment and low wages that would r e su l t from too many young people seeking em-ployment at once. A very s i m i l a r phenomenon is found in the trend t o -ward an e a r l i e r ret irement age. Th i s too shr inks th© s i z e of the a v a i l -able labor f o r ce , and, again, reduces pressures that the economy would have been unable to bear. F i n a l l y , let us note that add i t iona l pressures to shrink the labor force w i l l appear in the fu ture . These w i l l be met by fu r ther reduc-t i ons in the ret irement age and not by r a i s i n g the age of entry into the labor market. This second a l t e r n a t i v e is impossible because i t would tend to e l iminate the most product ive, hea l thy, and recent ly t ra ined segment of s k i l l e d workers and p ro fe s s i ona l s . Thus, i t can be seen that the b e l i e f that co l lege graduates w i l l soon need advanced graduate t r a i n i n g before they are considered e l i g i b l e f o r many pos i t i ons open to them today is completely unfounded. Co l lege attendance and graduation w i l l continue to be meaningful and important employment c r i - t e r i a . Advanced degrees w i l l continue to funct ion as they do now: they w i l l be requ i red , as they are now, only of a few pro fes s iona l s engaging in h igh ly spec i a l i zed s k i l l s ; as for example, phys ic ians and t op - l eve l s c i e n t i s t s . . : ' h9 APFENDDC B Number : • PERSONALITY IMPRESSION SCALE • W " Data on Rater 1 . Age , ,.. r 2 « Sex m m m m m m m m m m 3. Psychology course(s) In which presently enrolled (course 9 and section) . . 4. Have you ever participated In a psychology experiment before? • . . . Old you know anything about th is experiment before you came? _ 50 PERSONALITY IMPRESSION SCALE CTD. Instructions On the next page there are o number of scales* Adjectives descriptive of personality t ra i t s appear at the endpoints of each seaien Each scale Is marked Into seven Intervals. Place an " X " inside the Interval that best ref lects your Impression of the author's personality. To famil iar ize you with the use of the scale, here Is an example.. Suppose you were asked to give your Impression of the author's height* short i i i • i I I i i t a l l If your Impression Is that the author Is t a l l , place an "X" at the and of the scale closest to the word " t a l l " , as foilows: short i i i i i » i x i t a l l If your Impression is that the author Is short, place an "X** at the end of the scale closest to the word "short" , as toilows: short i X l I I , I I I J ta l I If your Impression Is that the author Is neither t a l l or short, but of average height, place an " X " at the midpoint of the scale, as foilows: short ( i i, , n ( * , t i i i t a l l In this fashion, place an " X " Inside an Interval In every one of the scales provided on the following page. Don't res t r i c t yourself to marking only those Intervals covered -In the Instructions - place an " X " In any one of the seven Intervals you wish. Work f a i r l y rapidly. Usually your f i r s t Impression Is the best' guess. After you have finished marking ycur Impressions, but not before, answer the question that appears on the last page of th is .form, v m. 51 Personality Impression Scale Con*t III Scale I* Assertive L 4, Stem 6. Eccentric 8. 9. 10. II. 12. Sociable L Mature Rigid 1 . .1 , , i 1 M • t , 1 L , , • 1 1 « 1 1 1 ,1 1 | 1 1 , \ , , 1 | | 1 \ 1 | , 1 .j . . 1 , 1 i i » - 1 \ , ! • » 1 I 1 i i » t • 1 1 • ••r-jj„-.r , .1 r | • | | 1 1 1 | | ' i 1 | » •, 1 . 0— • i 1 , , 1, . V 1 1 | Submissive Responsible Boorish Kindly Excitable Conventional UnIntel 11gent Cautious Reclusive i Adaptable 52 P e r s o n a l i t y Impression S c a l e Cont'd IV. STRENGTH OF IMPRESSION How strong was your Impression of the personality of the author of t h i s a r t i c l e ? quite weak i t . , , , , quite strong 3' APPENDIX C Number OPINIONNAIRE INSTRUCTIONS On the next page are statements, and each statement Is followed by a seven Interval scale. The l e f t hand side of the scale Is marked "disagree" and the right hand side marked "agree". The scale looks l i k e t h i s : disagree i t t • . i • i agree Place an "X" Inside the Interval that best r e f l e c t s how much you agree o r disagree with the statement that precedes I t , To f a m i l i a r i z e you with the use of the scale, here i s an example: Brunettes hBve more fun. disagree , , • i i . agree I f you are In strong agreement with the statement "Brunettes have more fun", place an "X" et the extreme r i g h t of the scale, as foilows: disagree - ' • ' » , ' 1 ' 1 * ' ° 9 r o e If you are In moderate disagreement with the statement, place on "X" In the next to la s t interval on the l e f t side of the scale, as foilows: disagree , i X i , i ' i i • ,. t,,. n j egree If you are In ml Id agreement with the statement, place an "X" In the interval Just t o the right of center, as follows: disagree i ^ t i i X , • .• , , agree Mark each scale on the following page In the same fashion, depending on how much you agree or disagree with the statement that precedes I t . Please mark a l l Items. OPINIONNAIRE The a r t i c l e , on the whole, was a good one. disagree r. ,„ i. i « ,. » _ , , f , « « agree The evidence presented by the author was objective and f a c t u a l , disagree . f r i i i i i agree The arguments used by the author were designed to play on the reader*s emotions. disagree > •,' •, .».',, » t • » » agree i "The be l i e f that advanced graduate study w i l l soon be essential f o r employment i s f a l s e " . disagree , ir i i i i t ti , r agree The author of the a r t i c l e was sincere. \ ' disagree i ; • r t ' i ', i . i , t . • • agree APPENDIX D OPSNIONNAIRE 1. The a r t i c l e , on ths whole, was a good one. i .' disagree j ] | i I _J _ | agree 2. Th© evidence presented by the author was objective and fa c t u a l . disagrsa L I — . L _ X _ JJl«-JL-_L-.«-J a S ™ © 3 0 Tho arguments used by tho author were designed to play on the reader's, emotions. disagra© X»^^i~«-^l..»~-~k^J-^_,^t_.^-J~.- I A9" " © 9 4„ The conclusions arrived at in 1 tho a r t i c l e aro assentiai By correct. 'disagree i i } J. J I j .. 5 agree 5. The author obviously had u l t e r i o r motives f o r wr i t i n g th© artici®. ." \ disagree j t 1 \ 1 ' .i. ,, i , , j agree APPENDIX E • . Numher niJESTl'ONNAIPE Please answer each of the fo l lowlnq questions by p lac ing an "X" fn the appropr iate In terva l . V'hen necessary, exp la in your answer In the space provided below each ouest ion. •1. How care fu l were you in fo l lowlnq in s t ruc t ions In t h i s experiment? L i - _ _ l L . I not at a l l a l i t t l e h i t moderately very ca re fu l carefu l care fu l care fu l 2. How susp ic ious were you of the experimenter 's Intentions? 1 I „ , L not at al I s l i g h t l y moderately very susp ic ious suspic ious susp ic ious susnlc lous 3. How apprehensive were you about doing any of the tasks In t h i s experiment? ...J I I i not at a l l s l i g h t l y moderately, very apprehensive apprehensive apprehensive apprehensive t 4. How much time d id you spend attempting to guess the experiment 's purpose? L none at aII a smalI amount L .1 a moderate a lot amount J 57 APPENDIX F Number ATTITUDES TOWARD HIGHER EDUCATION JjisJrjJc^ions_: Each of the fo l lowing statements Is fo l iov/ed by a -five interva l sca le ranrjlnq from " d e f i n i t e l y fa l se 1 ' to " d e f i n i t e l y t r u e " . P lace an "X" In the Interval that most c l o s e l y agrees with your opinion about the statement that precedes It. Please answer a l l items. '<. The tncreasinq demand for higher education is a d i r e c t r e s u l t of the i n a b i l i t y of Younq neople to f i nd work. L _ UL : . L _ _ = I L . _ _ _ v _ l d e f i n i t e l y probably uncerta in probably d e f i n i t e l y ' fa l se f a l s e t rue true 3. The purpose of education should be to teach people how to l i v e rather than how to earn a l i v i n g . I_ _J_ J I i d e f i n i t e l y probably uncerta in probably der In i re Iy f a l s e f a l s e t rue t rue • • \. 3. Most students, had they any other cho ice , would not attend univer .;- s i t y . v J ' d e f i n i t e l y probably uncerta in probably d e f i n i t e l y f a l s e f a l s e t rue true 4. U n i v e r s i t i e s wiI I soon devote increased e f f o r t to students in t h e i r graduate programs. L _ . ^ - J L i J -..Ji I d e f i n i t e l y probably uncerta in probably d e f i n i t e l y f a l s e f a l s e " t rue true 5. Un iver s i t y education should be more concerned with the press ing issues of the day. d e f i n i t e l y Probably uncerta in prohahiy d e f i n i t e l y f a l s e f a l s e t rue true 6. A hinh school diploma Is no lonqer s u f f i c i e n t tCTobtain a w e l l -paying job. ! L _ i d e f i n i t e l y probably uncerta in probably d e f i n i t e l y f a l s e f a l s e t rue t rue 


