|
|
|
 |
|
 |
 |
  |
Approaches to Value-Added Assessment |
Value-added assessment focuses on the impact of higher education on
student learning. Unlike most quality and accountability measures, it speaks directly to the most important product of undergraduate education, the development
of student knowledge and skills. Set in proper context, value-added assessment allows true comparisons of the difference college makes to students across
institutions and institutional types, instead of simply reflecting institutional resources and/or reputation.
There are three general approaches to estimating the institutional “value-added” to student learning. Each analyzes a slightly different part
of the picture, and they are complementary, not perfectly correlated. Each has strengths as well as challenges and limitations. |
 |
 |
 |
 |
|
Direct value-added assessment |
This method estimates institutional effect on student learning by measuring and comparing what students know and can do at two points in time—for
example, at the beginning and end of college. The difference between the two measures represents the learning gain and serves as an estimate of the institutional
contribution to student learning that can be compared across similar institutions. If comparisons are to be made across different institutional types, more
complex models are needed that take into account student academic abilities.
In the absence of measures at two points in time, it is possible to derive measures that enable comparisons of the institutional value-added. That is, one
could estimate institutional effect on student learning by comparing actual learning outcomes at the end of college to learning outcomes that would be predicted
on the basis of student characteristics. The difference between actual and expected outcomes serves as an estimate of the degree to which the institution
over- or under-performs in developing the abilities of its students. Examples:
The best example of direct value-added assessment is the Collegiate Learning Assessment (CLA), an outgrowth of RAND’s Value Added Assessment Initiative
(VAAI) that has been available to colleges and universities since spring 2004. The test goes beyond a multiple-choice format and poses real-world performance
tasks that require students to analyze complex material and provide written responses (such as preparing a memo or policy recommendation). Other instruments
for direct assessment include ACT’s Collegiate Assessment of Academic Proficiency (CAAP), the Educational Testing Services’s Academic Profile
and its successor, the Measure of Academic Proficiency and Progress (MAPP), introduced in January 2006. Around for more than a decade, these assessments
offer tools for estimating student general education skills.
Both Alexander Astin of the Higher Education Research Institute at the University of California at Los Angeles and The Education Trust have developed methodologies
for deriving measures of institutional effect. Focusing on another student outcome measure—the graduation rate, they controlled for such factors as
median ACT/SAT scores and percentage of students receiving Pell Grants. This allowed them to predict expected graduation rate outcomes and compare this
to actual results. Though there are additional challenges, researchers could explore ways to do the same for student learning measures. Similar controls
could be used to derive “value-added” estimates for student learning, even in the absence of “before” and “after” measures.
When two data points are available, this rich contextual data on student and institutional characteristics could be used to develop models and benchmarks
for comparing results from different institutional types.
Strengths:
This approach offers a direct measure of college-level learning, since it occurs throughout the undergraduate experience. It serves multiple stakeholders
and purposes, including accountability, state policy development, and institutional improvement.
It takes into account differences in student input and lends itself to the development of models and benchmarks for diverse institutional types.
Because it is explicit about collegiate-level skills, it facilitates academic alignment with the K-12 sector.
Challenges/Limitations:
Historically, there has been a great deal of autonomy concerning assessment of student learning at the classroom, department, and institutional levels.
Although learning assessment is increasingly a part of the accreditation process, many faculty and administrators remain resistant to state-level “interference”
in academic matters such as assessment of learning.
More research is needed on how to develop benchmarks and models for different institutional types.
Unlike the indirect approach, the direct approach does not point to specific directions for institutional improvement.
Unlike the applied approach, the direct approach does not measure how college learning relates to real-world performance. It cannot capture the full impact
of the college experience that continues to unfold as graduates gain maturity and experience. |
 |
 |
 |
 |
|
Indirect measures of value-added |
Rather than directly examining student learning, this approach measures the student behaviors and institutional actions that are known to correlate with
student learning and success in college. These measures of “good practice” are treated as proxies for student learning—to the extent that
they are in place, it is expected that greater student learning will occur. Examples:
This approach has come to be nearly synonymous with the National Survey of Student Engagement (NSSE). Begun in 2000, NSSE has developed benchmarks and instruments
that capture the dimensions of student engagement that correlate with student learning and success: level of academic challenge, active and collaborative
learning, student-faculty interaction, enriching educational experiences, and supportive campus environment. By collecting student self-reports on 42 aspects
of their undergraduate experiences, institutions can see how well they are doing and compare the results to those of their peers.
Strengths:
This method provides a useful proxy for direct learning assessment.
It yields useful information about specific institutional strengths and weaknesses and lends itself especially well to institutional improvement efforts.
Through Project DEEP (Documenting Effective Educational Practice), the NSSE Institute for Effective Educational Practice has examined the workings of
20 successful institutions and is sharing findings in order to help institutions identify strategies for using NSSE data to increase student success.
The approach is useful for diverse institutional types and peer information is available.
Challenges/Limitations:
Indirect assessment does not—and never will—measure actual student learning. In order to identify specific learning strengths and gaps, other
types of data would be needed.
There are always questions about the reliability of self-reports. NSSE developers have addressed this issue by identifying five conditions under which
self-reports are valid:
(1) when the information is known to the respondent;
(2) when the questions are clear and unambiguous;
(3) when the questions refer to recent activities;
(4) when respondents think the questions merit a serious and thoughtful response; and
(5) when answers do not threaten, embarrass, or violate the privacy of respondents.
|
 |
 |
 |
 |
|
Applied value-added models |
Instead of examining what happens during the college years, this approach gauges the impact of higher education in an applied setting, after-the-fact. For
example, alumni would be interviewed about the extent to which their education prepared them for jobs and employers would be interviewed about the extent
to which their employees have the necessary knowledge and skills for the job. Examples:
The National Center for Postsecondary Improvement designed the Collegiate Results Survey, a tool that interviews alumni to assess how postsecondary education
affected academic achievement and employment outcomes. First administered in 1999, the survey asks college graduates six to nine years out of college to
report on their occupations and the skills used in the workplace. It also calls for respondents to evaluate their ability to perform a variety of real-life
tasks. Resulting data have been used to establish unique institutional profiles to help consumers make better choices, now available through college guidebook
publisher Peterson’s . Institutions can work with Peterson’s for self-study purposes.
Strengths:
This method measures outcomes several years out of college, when institutional effects have had time to more fully develop. It reflects implications for
the real world, as assessed by employers and alumni in the workforce.
There is potential for development of benchmarks and models for different institutional types, based on existing data.
Challenges/Limitations:
Given the passage of time and intervention of other factors, it may be difficult to tease out institutional effects on students.
There are questions about the reliability of self-reports; given the passage of time, alumni may not be able to report accurately about college experiences. |
 |
 |
 |
 |
|
Analysis |
In addition to weighing the strengths and limitations of the three approaches, policymakers and higher education leaders must answer two sets of basic questions.
One is about the why of value-added assessment—how will these assessments contribute to the fulfillment of the state’s human capital goals and
priorities? The other pertains to the how of value-added assessment—what resources must be tapped and what obstacles overcome for the program to be
relevant and credible? This sets the stage for a policy conversation focused around two primary issues: purpose and implementation. Purpose:
The first—and most obvious—questions surrounding the development of a value-added assessment system relate to intent. How will the information
generated by such a system be used? How will it fit within the state’s overall education policy framework? How will it be linked with other primary
elements of higher education policy? Clarity and consensus in this area are essential to effective program design and execution, particularly on the following
points:
a. Institutional improvement vs. public accountability
A value-added assessment program can—and arguably should—satisfy both objectives, but the relative emphasis will vary across states and systems.
For example, states or systems wishing to focus on campus learning environments may concentrate more on indirect measures, while those more concerned
with workforce readiness may give more weight to applied metrics. The point is that there should be a “fit” between the mix of approaches
selected and the policy priorities of a particular state or system.
b. Relationship to other components of the educational pipeline
Value-added learning assessment cannot exist in a vacuum, and thus, must be structured so that it complements other quality assurance mechanisms in the
educational pipeline. How would a value-added program relate to high school exit or college admissions and placement exams? In many states, such questions
call for the engagement of P–16 entities (provided they are active and influential) in developing, testing, and implementing a comprehensive value-added
system.
c. Linkages across higher education policy
The vitality and success of a value-added program also will depend on its connection to key areas of policy, including:
Accountability—What weight should be given to value-added metrics, particularly in relation to existing outcome measures (e.g. persistence,
graduation, and post-baccalaureate placement rates)?
Finance—Will (or should) the data gleaned from such a program play a role in funding allocation (either base or supplemental)? If so, to what
extent?
Access and Inclusion—In what ways can value-added assessment data be used to assess and recalibrate state and system policies designed to promote
college participation and success for historically underrepresented and disadvantaged groups?
Economic/Workforce Development—How can assessment findings be used to create a feedback loop with state economic development organizations and
the private sector regarding the fit between what colleges and universities are producing and what the state needs or will need?
Implementation
Committing to a comprehensive value-added system requires significant and sustained investment of resources, as well as an awareness of potential
roadblocks. When considering the implementation of assessment, it is important for elected officials and higher education leaders to bear in mind that fiscal
and practical considerations have historically stood as the most prevalent stumbling blocks to fuller exploration of a systematic approach to value-added
assessment.
a. Resources
The experiences of the National Forum on College-Level Learning and similar initiatives reinforce that while effective learning assessment requires significant
and sustained financial investment, securing consistent policymaker support is even more critical—and often difficult. States and systems should
think broadly in terms of securing needed leadership and logistical support. This may include the reallocation of existing resources from obsolete or
lower priority accountability functions.
b. Participation
As with any new initiative, building a critical mass of interest and substantive involvement can be a challenge. What level of student/stakeholder participation
is necessary for the selected assessment measures to be credible? How can that level of participation be garnered and maintained? How can the quality
of stakeholder participation be assured, particularly if it is voluntary or not linked to academic advancement? Can sufficient statistical samples be
developed and maintained across different groups (e.g. first generation, low income, racial/ethnic minorities) to accurately gauge differences in perception
and performance? The National Forum on College-Level Learning has highlighted this as a key issue and states and systems must develop a participation
strategy as part of its implementation process.
c. Application
How the value-added program is applied across a wide range of campuses in a system will greatly affect its utility and relevance. For example, will the
program establish goals or benchmarks for institutions, either for individual measures or for a composite of measures? Will those goals/benchmarks account
for differences in institutional mission and admissions selectivity? How will the resulting data be presented and communicated, particularly to ensure
that they are understood by and useful to a broad array of internal and external stakeholders? |
 |
 |
 |
|
 |