SIECUS Publications

Back to But Does It Work?

But Does It Work?: Improving Evaluations of Sexuality Education

Symposium Participants;   Life Behaviors Of A Sexually Healthy Adult;   Recommended Resources;   Symposium Highlights;   Methods of Measurement

(Published in the SIECUS Report, Volume 25, Number 6 - August/September 1997)

But Does It Work in Adobe Acrobat But Does It Work?

With support from the California Wellness Foundation, SIECUS convened a symposium of 15 of the nation's most prominent researchers in sexuality education and teenage pregnancy prevention in October 1996 to help improve evaluations of sexuality education programs.

They addressed such questions as:

Unfortunately, evaluations of comprehensive sexuality education have tended to focus primarily on whether the programs have helped young people delay sexual activity and prevent unwanted pregnancy and disease. Evaluators have often defined program effectiveness as helping young people either postpone sexual intercourse or increase the use of contraception and condoms.

Other program goals of comprehensive sexuality education--such as helping young people develop an appreciation of their bodies or communicating effectively with peers and partners--are often overlooked in evaluations of sexuality education programs.

Comprehensive sexuality education has four primary goals:

The National Guidelines Task Force, convened by SIECUS to develop the Guidelines for Comprehensive Sexuality Education: Kindergarten through 12th Grade, identified 36 life behaviors of a sexually healthy adult that are the desired results of a K-12 sexuality education program.

They include interacting with both genders in respectful and appropriate ways, viewing family as a valuable source of support, practicing effective decision-making, and expressing one's sexuality while respecting the rights of others. Important outcomes include using contraception, preventing sexual abuse, avoiding sexually transmitted diseases, and practicing health-promoting behaviors, but they are not the only outcomes of a comprehensive program.

RECENT EVALUATION REVIEWS REFLECT BROAD GOALS

Recent reviews of evaluations of effective sexuality education, teenage pregnancy prevention, and HIV prevention programs have found that quality sexuality education programs:

These reviews also describe common characteristics of effective programs. Specifically, they:

FEW QUALITY EVALUATIONS

The symposium participants convened by SIECUS agreed that there are only a few well-designed, well-implemented, and well-funded evaluations of sexuality education. They also agreed that most published evaluations primarily determine the impact of programs based on three goals:

These goals are extremely difficult to attain given the limited nature (five to 12 sessions) of most sexuality education programs.

There are few published evaluations concerning many of the desired outcomes of sexuality education such as appreciation of one's body; identifying and living according to one's values; developing and maintaining meaningful relationships; avoiding exploitative or manipulative behaviors; and engaging only in consensual relationships.

Generally, evaluators have not assessed the effect of such programs on helping young people achieve the life behaviors of a sexually healthy adult, and they have not developed instruments to measure these behaviors.

In addition, evaluations have usually only examined short-term effects. Most include immediate post-tests that do not allow for passage of time. Only a few examine the impact of the program beyond a year and a half. But, as the sexuality educators at the symposium articulated, the young people may not experience results from the program until they start dating, leave home, or become adults.

Further, few program evaluations have been replicated. In those cases when replication has occurred, the results in another setting are often not as positive. As one symposium participant pointed out, "Successful programs may have more to do with individual teachers and their charisma, than with the actual program components." Conversely, one participant warned about the challenge of replicating a program with fidelity: "Is there really integrity in a replication from one site to another? My hunch is that 90 percent of the time, the only thing the programs have in common is the booklet the kids receive."

As a result of many of these issues, comprehensive, methodologically rigorous evaluations are extremely costly and time consuming, and are, therefore, inaccessible to most programs. Participants felt it might be valuable to conduct a longitudinal 20-year study comparing young people from a community who had attended a comprehensive sexuality education program with young people from a similar community who have received minimal education. They recognized, however, that such an evaluation was unlikely to occur.

UNIQUE CHALLENGES

Symposium participants agreed that comprehensive sexuality education presents unique challenges for program evaluations.

First, they acknowledged that changing human behavior is difficult and that simple educational efforts themselves have often met with limited success. They pointed out that most school-based courses are considered successful if they increase a student's knowledge and performance on standardized tests. Only health education efforts are held to a standard of behavior change outside of the classroom.

Second, they acknowledged that, unlike other academic programs, young people often learn about sexuality from a wide range of sources outside of school (families, television, movies, advertising, peers, magazines, partners, church, and youth organizations). Yet evaluations tend to focus only on the impact of the school program without considering the other influences: "Studies typically measured only the incremental effect of the intervention and not the cumulative effect of that intervention plus whatever sexuality education or reproductive health service the youth had previously or subsequently received. If the cumulative effect of all reproductive health education and services were measured, results would probably be stronger." 4

Both youth and adults approach sexuality education with established attitudes and beliefs. People come into programs with their own previous knowledge, their own family values, their own cultural values, their own experiences, and their own fears. The reality is that students in any class have very different needs for sexuality information. Some need basic information about their bodies; some need to know how to handle a sexually abusive stepparent; some need to look at their personal values; and some need to know how to set sexual limits with their partner. In addition, some are not dating; some are abstinent from all sexual behaviors; some are experimenting sexually; and some are having sexual intercourse. Most are heterosexual although some are bisexual, gay, lesbian or questioning their orientation.

Several participants commented on the fact that sexuality education affects people in different ways. In observing highly competent sexuality educators, a person senses that the young people in the class are excited, challenged, and learning. Yet, formal evaluations of these same programs have not always demonstrated positive results. The reason? As one participant noted, "For one person, one thing happens; for another person, something else happens. But if you measure only one behavior, and it was significant for only a few kids, it's lost and we don't capture it."

Participants said that evaluators need to better assess students' knowledge and attitudes at the baseline. What issues do they have coming into the program? Are they ready to learn what the program has to offer? How are they changed as a result of the program?

In addition, previous exposure to sexuality information complicates evaluation methodology. This may make it impossible to have true random experimental designs with random assignments to programs. Because people are exposed to both formal and informal "sexuality education" throughout their lives, there is probably no such thing as a true control group that has not been exposed to any information.

Third, classroom behaviors are often only proximal to behaviors in real life. For example, "if someone can negotiate condom use in a skill-building exercise in a classroom, do we really know whether they can do it in real life?" It is easier to assess whether a child can read or add a column of numbers both in school and outside of school than it is to know if they will practice effective decision-making, or sexual limit-setting outside the classroom.

Fourth, sexuality education is one of the only school-based programs that measures behaviors outside of the classroom. Most school-based programs are simply evaluated by how young people score on tests. Although studies show that sexuality education increases knowledge, it is sometimes seen as not effective because it does not change sexual behaviors. In the words of one participant, "We don't measure the success of the math curriculum on whether people can and do balance their checkbooks."

Fifth, sexual behaviors involve more than the classroom participant. In most cases, only one member of the relationship has participated in the program; yet, that person is also expected to influence the behavior of their partner.

Sixth, asking young people about their sexual behaviors raises particular methodological concerns. Research that relies on self-report about sexual behaviors must take into account self-report bias and the problem of "social desirability" (giving answers that they believe the researcher wants to hear). At pre-test and post-test, participants may under-report their involvement in risky or unhealthy behaviors, and, thus, reduce the measured impact of the program.

Seventh, there are often policy barriers to high quality evaluations of sexuality education programs. Evaluations of sexuality education may be too controversial in some communities to enable a thorough study. Legislation within many states, as well as the proposed Family Privacy Act at the federal level, limit the types of data that can be collected on sexual behavior. In some states, researchers must obtain parental consent before they can ask a young person about their sexual attitudes and behaviors.

Eighth, the group cautioned about treating sexuality education as synonymous with teenage pregnancy or HIV-prevention efforts. Teenage childbearing is affected by many social and economic factors such as poverty, racism, sexism, job opportunities, past history of sexual abuse, family stability, school failure, and risk-taking behaviors. Sexuality education is a necessary component of teenage pregnancy prevention efforts, but it cannot solve the problem alone.

One participant noted, "I say to communities, do you think it is possible to prevent pregnancies if our young people don't know where babies come from?...We ought to be asking communities, 'Do you have the sexuality education in place so that you're sure all young people are sexually literate?' But communities need to understand that the goals of teenage pregnancy prevention are very specific and not solved by short-term school-based interventions."

Another said that "simply addressing sexual beliefs, attitudes, and skills--and even improving access to contraception--will not address many of the factors leading to teenage childbearing, may not significantly change young people's motivation to avoid childbearing, and are unlikely to significantly reduce long-term sexual risk-taking."

EVALUATION METHODOLOGY

Rigorous evaluations share common criteria. Quality evaluations share similar characteristics. They

It is not easy or always possible, however, to incorporate all of these criteria into every study design because of the significant time and money investments.

Symposium participants cautioned that not all programs require this level of evaluation. They therefore mad some recommendations for both evaluators and program personel that follow.

RESEARCH METHODS

Symposium participants did not extensively address basic research methods. The "Recommended Resources" listed in this article will, however, provide extensive references for people new to program evaluation. In addition, "Methods of Measurement" provides a comparison of some of the most frequently used measurements that evaluators might consider in developing designs.

Evaluation designs can be divided into two categories: quantitative and qualitative. By using a quantitative design, the evaluator can answer questions that have to do with "how well" a program works, "to what extent" a program achieves its goals and objectives, "how much" learning and/or behavior change takes place, and "to what degree" to attribute measured outcomes to the evaluated program.

Quantitative evaluations are often very powerful and well-controlled. They cannot, however, answer questions about "why" a program is having some effect or "how" the program has the impact that it does. If a program is found effective (or ineffective) in reaching its objectives, is it because of one specific aspect such as the teacher, the setting, the participants, the environment, or some combination of these factors? For these types of questions, qualitative evaluation models may prove more useful.

Participants at the symposium urged that evaluations of comprehensive sexuality education programs combine both quantitative and qualitative approaches. Although these designs are often seen as opposites, evaluations are, in fact, often made much stronger by combining methods from both approaches--a strategy known as triangulation.

Qualitative methods are often extremely useful in helping to form the questions for an evaluation, in describing processes in particular programs, and in elucidating findings from traditional quantitative methodologies. In many cases, the emphasis of qualitative evaluation is on producing information that is useful for the further development of programs. Such evaluations typically do not lead to conclusions about whether a program was successful or unsuccessful, but, instead, produce detailed descriptions of how participants experience a program and its operation. From these descriptions, program directors can make decisions about future directions. Data collected from these evaluations are often given to program staff as the process is ongoing rather than after the data is collected. 6

There are three usual objectives of qualitative evaluations.

LIFE BEHAVIORS OF A SEXUALLY HEALTHY ADULT

Participants at the symposium spent considerable time on how to best evaluate progress toward achieving the desired life behaviors of a sexually healthy adult. The discussion included using established instruments and data collection techniques to measure variables specific to sexuality education. Participants also discussed the challenge for researchers to develop new instruments to measure variables not yet studied.

Discussions about measuring the impact of programs yielded the following general suggestions:

The group also spent time thinking about possibilities for measuring specific life behaviors, including adapting current instruments and developing new measurement approaches. The following ideas relate to some of the behaviors that might be measured. Many of the suggestions are, however, applicable to other variables.

Appreciation of one's own body. There are a number of well-tested scales available to measure attitudes about one's self and one's body image. Most of them focus on post-pubertal body image. 8 Because body image is an important variable that is related to other aspects of self-image, programs may want to focus on body image issues earlier in young people's lives. The development of scales that measure pre-pubertal body image (in males and females) is an important area for instrument development.

Communicating effectively with parents, peers and partners. There are many instruments to measure general communication skills with children, with adults, and across age groups. What is largely missing, however, are instruments that measure specific communication skills about sexuality issues. While some communication skills are applicable to many different types of situations, there may be specific skills related to communicating around sexual behaviors (for example, communicating about condom use or establishing sexual limits). Specific curriculum content can target them and distinct instruments can measure them. In creating these instruments, two considerations are paramount: First, how does one measure the quality and content, as well as the quantity of communication? Second, when measuring communication skills it is important to assess the communication from the perspectives of all of the people involved in the interaction.

Practicing effective decision-making. A suggested technique for measuring this skill is to provide scenarios to students that require them to decide what they would do in a certain situation, how they would respond, and what alternatives they would consider.

Identifying and living according to one's values. One suggestion for measuring this behavior is to have young people identify their three most important values around sexuality and then to ask them, "How does this influence your life?" Evaluators could use semantic differential scales that present two polar opposites along a continuum in relationship to some value to discover students' values. For example, a scale could have "sexually ethical" at one end and "sexually unethical" at the other. A student would choose a place along the continuum that best represented his or her values. The evaluator could then follow up with a question such as "How certain are you that this is your value?"

Interacting with both genders in respectful and appropriate ways. One problem with measuring this and certain others is in defining the related concepts in a measurable way, an action known as operationalization. Meeting participants acknowledged that, in some cases, it is easier to measure the absence of certain negative behaviors. For example, an individual could observe interactions of participants and define "respectful and appropriate" as the absence of physical violence, verbal abuse, exploitation, bullying, lying, and tricking. While these are easier to define, there are potential problems in defining a concept by what it is not.

One way of measuring positive indicators of relationships among people is to use a variety of "social distance" scales that examine the degree to which people are comfortable being close to other groups of people. 9 They could, for example, measure a person's comfort with people of the other gender as well as people of different sexual orientations, people with different sexual values, and people living with AIDS.

Participants warned, however, that some of the life behaviors do not easily lend themselves to evaluation. They also cautioned that some life behaviors are no easier to achieve than preventing teenage pregnancy. In the words of one participant, "Which is harder, preventing pregnancy or promoting gender equality?" One asked, "Should we really hold a 10-hour program to the standard of seeing pre-/post-test change in gender relationships?"

RECOMMENDATIONS FOR IMPROVING EVALUATIONS

Program directors, funding agencies, and evaluation experts are interested in evaluations for different reasons. These are the words of symposium participants:

An Evaluation Expert

"The scientist in me wants to know, 'Does a program work?' And if so, 'Why?' And if not, 'Why not?'"

A Program Director

"Program directors are interested in evaluations to improve the performance of their programs, to see whether they are really reaching the people they want to reach and whether the services they want to deliver are delivered to the right numbers of the right people effectively. . . . [I]n other words, 'Is my program doing a good job and is it doing what I'm hoping it's doing?'"

A Foundation Executive

"Is our support of a program really making a difference in the lives of the people we care about?"

The symposium participants offered suggestions for improving evaluations of comprehensive sexuality education. The recommendations are targeted at three main groups: program directors, evaluators and funders. Many apply to all three.

RECOMMENDATIONS FOR PROGRAM DIRECTORS

Recommendations for program directors include:

RECOMMENDATIONS FOR EVALUATORS

Recommendations for evaluators include:

RECOMMENDATIONS FOR FUNDERS

Recommendations for funders include:

CONCLUSION

SIECUS urges program directors, evaluation experts, and foundations to assess how they can better evaluate the impact of sexuality education programs.

Such evaluations must move beyond simpistic "did you like it?" post-test questionnaires as well as more sophisticated designs that measure only the impact of the incidence of intercourse and contraceptive use.

Practitioners have a critical need to understand what makes a program effective (and for which students) and how to improve their education programs.

The symposium and this report offer an important first step in this effort.

Editor's Note: SIECUS sincerely thanks the California Wellness Foundation for its grant which funded this symposium and report.

References

  1. National Guidelines Task Force (1996), Guidelines for Comprehensive Sexuality Education: Kindergarten--12th Grade, 2nd Edition, Sexuality Information and Education Council of the United States.

  2. A. Grunseit and S. Kippax (1993), "Effects of Sex Education on Young People's Sexual Behaviour," unpublished review commissioned by the Global Programme on AIDS, World Health Organization, July 1993; K. A. Moore, et al (1995), Adolescent Pregnancy Prevention Programs: Interventions and Evaluations, Child Trends, Inc., Washington, DC; J. J. Frost and J. D. Forrest (1995), "Understanding the Impact of Effective Teenage Pregnancy Prevention Programs," Family Planning Perspectives25(5): 188-96; and D. Kirby et al, "School-Based Programs to Reduce Sexual Risk Behaviors: A Review of Effectiveness," Public Health Reports, 109(3), 1994, pp. 339-60.

  3. D. Kirby et al, "School-Based Programs to Reduce Sexual Risk Behaviors."

  4. D. Kirby, No Easy Answers: Research Findings on Programs to Reduce Teen Pregnancy (Washington, DC: The National Campaign to Prevent Teen Pregnancy, 1997).

  5. Ibid, p. 3.

  6. J. R. Schuerman, Research and Evaluation in the Human Services (New York: Macmillan, 1983), pp. 113-14.

  7. Ibid., p. 114.

  8. M. Story, Body Attitudes Questionnaire, in C. Davis, W. Yarber, and S. Davis, editors, Sexuality-Related Measures: A Compendium (Syracuse, NY: Syracuse University, 1987), p. 40.

  9. E. R. Babbie, Survey Research Methods (Belmont, CA: Wadsworth Publishing Co., 1973).


Debra W. Haffner, M.P.H.
SIECUS President & CEO, and
Eva S. Goldfarb, Ph.D.
Assistant Professor, Montclair State University,
Upper Montclair, NJ

page divider

Home | Publications | Support SIECUS | Links | About SIECUS | Site Navigation | Search | Donate
Policy & Advocacy | Media | International | Library | Youth Development | School Health Education Clearinghouse

copyright © 1996-2004, SIECUS
Web Master: siecus@siecus.org

Back to SIECUS home page