Item Response Theory

description7,777 papers

group27,214 followers

lightbulbAbout this topic

Item Response Theory (IRT) is a statistical framework used in psychometrics to model the relationship between individuals' latent traits and their item responses on assessments. It focuses on understanding how specific characteristics of test items influence the probability of a correct response, allowing for the evaluation of both item and test-taker abilities.

lightbulbAbout this topic

Key research themes

1. How can Item Response Tree models capture complex response processes beyond traditional IRT outcomes?

This research theme explores advances in Item Response Theory (IRT) that model the internal cognitive or psychological decision processes influencing item response selection. Beyond assessing terminal item responses, item response tree models characterize sequential, nested, and multidimensional decision-making pathways. This detailed modeling offers nuanced insights into psychological assessments, response omissions, and the structure of Likert-type scale responses, addressing limitations of classical IRT models that treat responses as flat outcome categories.

A generalized item response tree model for psychological assessments

by Paul De Boeck

2023, Behavior research methods

Key finding: Jeon et al. (2015) introduce a generalized item response tree (IRT) model that flexibly incorporates node-specific parametric forms, dimensionality, and covariates, allowing models to capture complex decision processes in... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What advantages does Item Response Theory offer over Classical Test Theory in psychological test development and measurement precision?

This research area investigates the methodological and practical benefits of Item Response Theory (IRT) compared to Classical Test Theory (CTT) in psychological and educational assessments. It focuses on how IRT models provide invariant item and person parameters, permit precise measurement precision quantification at varying trait levels, and support refined test development practices. These advantages are crucial for improving test validity, reliability, and interpretability, especially in scales with graded responses.

An application of item response theory to psychological test development

by Claudio Hutz

2022, Psicologia: Reflexão e Crítica

Key finding: This study applies the graded response model (GRM) of IRT to positive and negative affect scales, demonstrating that IRT estimates person abilities invariantly across different test forms and reveals item discrimination... Read more

articleView Paper downloadDownload

Tackling measurement problems with Item Response Theory: Principles, characteristics, and assessment, with an illustrative example

by Jagdip Singh

2017

Key finding: This paper articulates fundamental measurement issues in psychology and marketing research that IRT can address more effectively than CTT, such as balancing reliability and construct validity and handling item wording... Read more

articleView Paper downloadDownload

Application of Item Response Theory as a Modern Statistical Tool to Test Item Development and Analysis

by Solomon C H U K W U Ohiri

2023, International Journal of Advanced Research in Science, Communication and Technology

Key finding: This paper outlines the advantages of IRT in educational and psychological test development, emphasizing its probabilistic modeling of item responses as functions of latent traits and its ability to address inherent... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How can item-fit and model-data fit be accurately assessed in IRT to identify aberrant items and improve measurement validity?

This theme focuses on the development and evaluation of statistical methods for assessing the fit of IRT models at the item level, crucial for ensuring accurate parameter estimation and valid test scores. It compares chi-square and entropy-based techniques, investigates challenges with traditional fit statistics due to model dependency and sample-specific grouping, and explores computational innovations to provide more precise diagnostics of item misfit, enabling enhanced item selection and test calibration.

An Investigation of Chi-Square and Entropy Based Methods of Item-Fit Using Item Level Contamination in Item Response Theory

by brandi weiss

2023, Journal of Modern Applied Statistical Methods

Key finding: This study compares several item-fit statistics including EMRj, traditional chi-square (X2), likelihood ratio (G2), S-X2, and PV-Q1 through Monte Carlo simulations mimicking item-level misfit scenarios under a 2PL IRT model.... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

4. What computational methods and software can enhance IRT parameter estimation for complex models and simulation studies?

This area addresses the challenges of flexible IRT model estimation using Bayesian methods and computational resources. It covers implementation strategies using BUGS-language software for various common and extended IRT models, enabling customization for longitudinal or multi-level data structures. It also examines automation with R scripting to conduct large-scale simulation studies using stand-alone software packages, streamlining iterative model fitting and fit metric extraction critical for psychometric research.

BUGS Code for Item Response Theory

by Luis Pineda

2018

Key finding: The paper presents detailed Bayesian modeling code in BUGS language for fitting common IRT models including the 2PL, 3PL, graded response, generalized partial credit, testlet, and generalized testlet models. It highlights the... Read more

articleView Paper downloadDownload

Automating Simulation Research for Item Response Theory using R

by Youn-Jeng Choi

2018

Key finding: Lee demonstrates methods to automate complex and large-scale IRT simulation studies by leveraging R's scripting capabilities to generate datasets, prepare software inputs, invoke stand-alone IRT estimation tools (like... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

5. How can IRT be extended and applied to continuous and polytomous response data while accounting for measurement constraints?

This theme investigates the modeling of non-dichotomous responses in IRT, including continuous measurements such as response times and Likert-scale data. It explores latent trait models that incorporate distributional restrictions (e.g., response boundedness), extend traditional discrete IRT to continuous domains, and develop threshold models suited for polytomous items. Addressing response scale properties improves model appropriateness, measurement validity, and the treatment of response patterns in diverse assessment contexts.

Latent Trait Item Response Models for Continuous Responses

by Gerhard Tutz

2025, Journal of Educational and Behavioral Statistics

Key finding: The article establishes a general IRT framework for continuous response variables that models responses as functions of latent traits while explicitly accommodating restrictions such as bounded or positive supports. It shows... Read more

articleView Paper downloadDownload

Application of nonparametric item response theory in determining the one-dimensionality and adaptability of TOEFL iBT listening test

by Hamed Ghaemi

2024, Language Testing in Asia

Key finding: This study applies Nonparametric Item Response Theory (NIRT) methods, specifically the Mokken and Dominance Models, to examine the one-dimensionality and invariant item ordering assumptions of the TOEFL iBT listening test.... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Item Response Theory

Diagnosing Student Node Mastery: Impact of Varying Item Response Modeling Approaches

by Susan Embretson

2025, Frontiers in Education

An important feature of learning maps, such as Dynamic Learning Maps and Enhanced Learning Maps, is their ability to accommodate nation-wide specifications of standards, such as the Common Core State Standards, within the map nodes along... more

descriptionView Paper arrow_downwardDownload

Effects of Reducing the Cognitive Load of Mathematics Test Items on Student Performance

by Susan Embretson

2025, Numeracy

This study explores a new item-writing framework for improving the validity of math assessment items. The authors transfer insights from Cognitive Load Theory (CLT), traditionally used in instructional design, to educational measurement.... more

descriptionView Paper arrow_downwardDownload

Psychometric Approaches to Understanding and Measuring Intelligence

by Susan Embretson

2025, Handbook of Intelligence

1~11f<! h'~1 \' ~~, , 1'11;1 ".11 ~~.'UI ~~ ., I''''~; I., .. "" "'1100"", tt lIu ,,~, 'lii0i , ' t,IIw ' "I '~""I ..... ~ "I" *"11111 ""I liN !I~ .. "" ""111\ ~"I ~ "' 1" q .... ,' "., t'l'l •• Ufll lit", I '11'1 t III "I '"'''' 1-.... more

descriptionView Paper arrow_downwardDownload

Additive Multilevel Item Structure Models with Random Residuals: Item Modeling for Explanation and Item Generation

by Susan Embretson

2025, Psychometrika

An additive multilevel item structure (AMIS) model with random residuals is proposed. The model includes multilevel latent regressions of item discrimination and item difficulty parameters on covariates at both item and item category... more

descriptionView Paper arrow_downwardDownload

Designing Cognitive Complexity in Mathematical Problem-Solving Items

by Susan Embretson

2025, Applied Psychological Measurement

Cognitive complexity level is important for measuring both aptitude and achievement in large-scale testing. Tests for standards-based assessment of mathematics, for example, often include cognitive complexity level in the test blueprint.... more

descriptionView Paper arrow_downwardDownload

Can Gender and Age Impact on Response Pattern of Depressive Symptoms Among College Students? A Differential Item Functioning Analysis

by Antonio Reis de Sá Junior

2025, Frontiers in Psychiatry

Background: Self-reported depressive complaints among college students might indicate different degrees of severity of depressive states. Through the framework of item response theory, we aim to describe the pattern of responses to items of the Beck Depression Inventory-II (BDI-II), in terms of endorsement probability and discrimination along the continuum of depression. Potential differential item functioning of the scale items of the BDI-II is investigated, by gender and age, to compare across sub-groups of students. The 21-item BDI-II was cross-sectionally administered to a representative sample of 12,677 Brazilian college students. Reliability was evaluated based on Cronbach's alpha coefficient. Severity (b i ) and discrimination (a) parameters of each BDI-II items were calculated through the graded response model. The influence of gender and age were tested for differential item functioning (DIF) within the item response theory-based approach. Results: The BDI-II presented good reliability (α = 0.91). Women and younger students significantly presented a higher likelihood of depression (cut-off > 13) than men and older counterparts. In general, participants endorsed more easily cognitive-somatic items than affective items of the scale. "Guilty feelings," "suicidal thoughts," and "loss of interest in sex" were the items that most likely indicated depression severity (b ≥ 3.60). However, all BDI-II items showed moderate-to-high discrimination (a ≥ 1.32) for depressive state. While two items were flagged for DIF, "crying" and "loss of interest in sex," respectively for gender and age, the global weight of these items on the total score was negligible. Conclusions: Although respondents' gender and age might present influence on response pattern of depressive symptoms, the measures of self-reported symptoms have not inflated severity scores. These findings provide further support to the validity of using BDI-II for assessing depression in academic contexts and highlight the value of considering gender-and age-related common symptoms of depression.

descriptionView Paper arrow_downwardDownload

Response pattern of depressive symptoms among college students: What lies behind items of the Beck Depression Inventory-II?

by Antonio Reis de Sá Junior

2025, Journal of affective disorders

This study examines the response pattern of depressive symptoms in a nationwide student sample, through item analyses of a rating scale by both classical test theory (CTT) and item response theory (IRT). The 21-item Beck Depression... more

descriptionView Paper arrow_downwardDownload

A Mixture Rasch Model–Based Computerized Adaptive Test for Latent Class Identification

by Hong Jiao

2025, Applied Psychological Measurement

This study explored a computerized adaptive test delivery algorithm for latent class identification based on the mixture Rasch model. Four item selection methods based on the Kullback–Leibler (KL) information were proposed and compared... more

descriptionView Paper arrow_downwardDownload

Validación para la utilización en Colombia de la escala EORTC QLQ-STO22 para la evaluación de la calidad de vida de pacientes con cáncer de estómago

by Claudia Ibáñez-Antequera

2025, Avances en Psicología Latinoamericana

su disposición y apoyo al dirigir este trabajo de grado. Al Dr. Ricardo Oliveros por su valiosa contribución con la evaluación clínica de los pacientes.

descriptionView Paper arrow_downwardDownload

Validation of the Brief Version of the Recovery Self-Assessment (RSA-B) Using Rasch Measurement Theory

by Larry Davidson

2025, Psychiatric rehabilitation journal

In psychiatry, the recovery paradigm is increasingly identified as the overarching framework for service provision. Currently, the Recovery Self-Assessment (RSA), a 36-item rating scale, is commonly used to assess the uptake of a recovery... more

descriptionView Paper arrow_downwardDownload

How Item Banks and Their Application Can Influence Measurement Practice in Rehabilitation Medicine: A PROMIS Fatigue Item Bank Example

by David Cella

2025, Archives of Physical Medicine and Rehabilitation

Objective-To illustrate how measurement practices can be advanced using as an example the fatigue item bank (FIB) and its applications (short-forms and computerized adaptive test) that were developed via the NIH Patient Reported Outcomes... more

descriptionView Paper arrow_downwardDownload

Item response theory analyses of the Delis-Kaplan Executive Function System card sorting subtest

by Sun-Joo Cho

2025, Child Neuropsychology

In the current study, we examined the dimensionality of the 16-item Card Sorting subtest of the Delis-Kaplan Executive Functioning System assessment in a sample of 264 native Englishspeaking children between the ages of 9 and 15 years. We... more

descriptionView Paper arrow_downwardDownload

Measuring Intervention Effectiveness: The Benefits of an Item Response Theory Approach

by Sun-Joo Cho

2025, Society for Research on Educational Effectiveness

Assessing the effectiveness of educational interventions relies on quantifying differences between interventions groups over time in a between-within design 1 . Binary outcome variables (e.g., correct responses versus incorrect responses)... more

descriptionView Paper arrow_downwardDownload

Item response theory analyses of the Delis-Kaplan Executive Function System card sorting subtest

by Sun-Joo Cho

2025, Child neuropsychology : a journal on normal and abnormal development in childhood and adolescence

In the current study, we examined the dimensionality of the 16-item Card Sorting subtest of the Delis-Kaplan Executive Functioning System assessment in a sample of 264 native English-speaking children between the ages of 9 and 15 years.... more

descriptionView Paper arrow_downwardDownload

Rheumatoid arthritis, item response theory, Blom transformation, and mixed models

by Ping An

2025, BMC Proceedings

descriptionView Paper arrow_downwardDownload

Rheumatoid arthritis, item response theory, Blom transformation, and mixed models

by Ping An

2025, BMC Proceedings

We studied rheumatoid arthritis (RA) in the North American Rheumatoid Arthritis Consortium (NARAC) data (1499 subjects; 757 families). Identical methods were applied for studying RA in the Genetic Analysis Workshop 15 (GAW15) simulated... more

descriptionView Paper arrow_downwardDownload

The Effects of Differential Item Functioning on Predictive Bias

by Damon Bryant

2025

The purpose of this research was to investigate the relation between measurement bias at the item level (differential item functioning, DIF) and predictive bias at the test score level. DIF was defined as a difference in the probability... more

descriptionView Paper arrow_downwardDownload

Comment on “Modified nonequilibrium molecular dynamics for fluid flows with energy conservation” [J. Chem. Phys. 106, 5615 (1997)]

by Carol Hoover

2025, The Journal of Chemical Physics

In their recent paper and the associated Response to this Comment, Tuckerman et al. dispute the form of the Liouville equation, as proposed by Liouville in 1838. They go on to introduce a definition of the entropy which is at variance... more

descriptionView Paper arrow_downwardDownload

Leading birds by their beaks: the response of flocks to external perturbations

by Nikos Kyriakopoulos

2025, New Journal of Physics

We study the asymptotic response of polar ordered active fluids ("flocks") to small external aligning fields h. The longitudinal susceptibility χ diverges, in the thermodynamic limit, like h -ν as h → 0. In finite systems of linear size... more

descriptionView Paper arrow_downwardDownload

Sample Size and Test Length Minima for DIMTEST with Conditional Covariance -Based Subtest Selection

by Derek Fay

2025

The existing minima for sample size and test length recommendations for DIMTEST (750 examinees and 25 items) are tied to features of the procedure that are no longer in use. The current version of DIMTEST uses a bootstrapping procedure to... more

descriptionView Paper arrow_downwardDownload

Sample Size and Test Length Minima for DIMTEST with Conditional Covariance -Based Subtest Selection

by Derek Fay

2025

descriptionView Paper arrow_downwardDownload

A Bayesian Approach for Item Response Theory in Assessing the Progress Test in Medical Students

by Zeynep ÖZTÜRK

2025

The progress test is used to provide useful summative and formative judgments about medical students' knowledge without distorting learning. The test samples the complete knowledge domain expected of new graduates on completion of... more

descriptionView Paper arrow_downwardDownload

Habilidades Psicológicas Deportivas y estados de ánimo en jugadores peruanos de Quadball (Quidditch)

by Revista de Psicología Aplicada al Deporte y al Ejercicio Físico

2025, Revista de Psicología Aplicada al Deporte y al Ejercicio Físico

RESUMEN: El objetivo del presente trabajo fue describir, comparar según género y relacionar las habilidades psicológicas deportivas y el estado de ánimo en deportistas peruanos de Quadball (Quiddicht). La muestra estuvo conformada por 43... more

descriptionView Paper arrow_downwardDownload

Application of the multidimensional 4-parameter logistic model in the estimation of the psychometric qualities of the West African Senior School Certificate chemistry examination

by Uduak Utibe and

2025, Uduak James Utibe

In the study reported on here we assessed the dimensionalities and trends in psychometric qualities of the West African Senior School Certificate chemistry examination (WASSCCE) by applying a multidimensional 4-parameter logistic model of... more

descriptionView Paper arrow_downwardDownload

Rasch Analysis of the University Student Depression Inventory (USDI) Using the Polytomous Partial Credit Model

by Sherwin E Balbuena

2025, The Philippine journal of science

Artykuł przedstawia w szerokiej perspektywie poglądy na samobójstwo, które jest traktowane jako apogeum nieprzyjaznej postawy żywionej wobec siebie, a tym samym staje się przykładem skrajnego niebezpieczeństwa egzystencjalnego. Tłem... more

descriptionView Paper arrow_downwardDownload

Assessment of Gender-related Differential Item Functioning of Teacher-Made Chemistry test

by Simeon Ariyo

2025, The African Journal of Behavioural and Scale Development Research

A good item that will measure the intended domain is expected to be free of biases. But several studies have confirmed that some items in a test reveal biases due to a group of testees.. A generally acceptable analytical technique that... more

descriptionView Paper arrow_downwardDownload

Forgetting curves: implications for connectionist models

by Sverker Sikström

2025, Cognitive Psychology

Forgetting in long-term memory, as measured in a recall or a recognition test, is faster for items encoded more recently than for items encoded earlier. Data on forgetting curves fit a power function well. In contrast, many connectionist... more

descriptionView Paper arrow_downwardDownload

EuleApp©: a computerized adaptive assessment tool for early literacy skills

by Haug Leuschner

2025, Frontiers in Psychology

Introduction: Ample evidence indicates that assessing children's early literacy skills is crucial for later academic success. This assessment enables the provision of necessary support and materials while engaging them in the culture of... more

descriptionView Paper arrow_downwardDownload

Enhancing Large-Scale Mathematical Assessments: Integrating Cognitive Diagnostic Models with Hierarchical Attribute Structures for Improved Student Diagnostics

by Farshad Effatpanah

2025, Frontier Research in Educational Measurement (FREMO), Centre for Educational Measurement (CEMO), University of Oslo, Olso, Norway

Cognitive diagnostic models (CDMs) provide a fine-grained analysis of students' cognitive abilities by determining their mastery or non-mastery of specific attributes. CDMs have been retrofitted to existing non-diagnostic (inter)national large-scale standardized mathematics assessments. However, in the absence of a cognitive development framework for constructing test items in retrofitting studies, an inferred substantive model and a Q-matrix are typically constructed to define the attributes measured in the test. This indirect approach can result in information loss, less precise modeling of cognitive attributes, and inaccurate student classifications. This study uses a hierarchical cognitive diagnostic model (HCDM) to explore the feasibility of incorporating cognitive models into the development of large scale assessment items that inherently generate Q-matrices for CDM analysis. It examines whether integrating theoretical assumptions about hierarchical relationships between mathematical attributes can improve the accuracy and effectiveness of the model. In contrast to previous CDM studies, an eight-attribute Q-matrix was systematically designed. Items were constructed based on the Q-matrix, which was derived from curriculum, cognitive models, and specified attribute hierarchies. The test was administered to 5,336 third-grade students in Luxembourg. The HCDM was evaluated against the G-DINA model. The results showed that: (1) the mastery rates derived from the HCDM align more closely with the developmental progression of mathematical ability, supporting the notion that theoretically grounded Q-matrix design yields more interpretable latent classes; (2) incorporating hierarchical attribute relationships enhances the effectiveness of diagnostic assessments by generating more meaningful and accountable classifications of student proficiency; and (3) the HCDM aligns more closely with didactic theories of mathematical development, because it better captures the structured progression of learning, where mastery of foundational skills is necessary for acquiring more complex competencies. This study underscores the importance of integrating cognitive models into assessment design and highlights the advantages of using HCDMs to improve large-scale educational diagnostics.

descriptionView Paper arrow_downwardDownload

Advances in Rasch Modeling: New Applications and Directions: Guest Editorial

by Mark Moulton

2025, Psychological test and assessment modeling

In 1960 Georg Rasch helped open the field of Item Response Theory by the model that bears his name, distinguished by the use of a single parameter to model the relationship between item difficulty and person ability. Various extensions of... more

descriptionView Paper arrow_downwardDownload

Developing and measuring IS scales using item response theory

by Paul Benjamin Lowry

2025

Information Systems (IS) research frequently uses survey data to measure the interplay between technological systems and human beings. Researchers have developed sophisticated procedures to build and validate multi-item scales that... more

descriptionView Paper arrow_downwardDownload

Computational Strategies and Estimation Performance With Bayesian Semiparametric Item Response Theory Models

by Abel Rodríguez

2025, Journal of Educational and Behavioral Statistics

Her research focuses on the development of Bayesian nonparametric models for single density estimation and regression modeling on compact spaces, time dynamic point processes, and diagnostic test validation. She is also the main developer... more

descriptionView Paper arrow_downwardDownload

An Evaluation of the Online Social Learning Environment Instrument (OSLEI) Using Rasch Model Analysis

by Noor Hidayah Che Lah

2025

One of the questionnaires that will be used to evaluate social learning environments such as Facebook is the Online Social Learning Environment Instrument (OSLEI). The aim of this study was to evaluate the OSLEI using alternative method... more

descriptionView Paper arrow_downwardDownload

Examining the 'hawk-dove effects' in portfolio assessment using the multi-facet Rasch model

by International Journal of Evaluation and Research in Education (IJERE)

2025, International Journal of Evaluation and Research in Education (IJERE)

Concerns among students have increased due to the use of test scores in decision-making, leading them to question whether their results accurately reflect their abilities, especially when they perceive subjectivity in rater scoring. This... more

descriptionView Paper arrow_downwardDownload

Development and validation of the principals’ digital leadership instrument using Rasch measurement model

by International Journal of Evaluation and Research in Education (IJERE)

2025, International Journal of Evaluation and Research in Education (IJERE)

This study addresses the critical need for robust measurement tools in digital leadership (DL) within educational settings—a topic of increasing relevance but limited research. Using the Rasch model measurement analysis, the study aims to... more

descriptionView Paper arrow_downwardDownload

Parameterization of Teacher-made Physics Achievement Test Using Deterministic-Input-Noisy-and-Gate (DINA) Model

by Fidelis O B I Nnadi

2025, Journal of Education and Practice

Traditional methods of test parameterization have been found defective in terms of assuming one score and not providing information on skills mastery profile of the examinees, in addition to non-estimation of the fourth parameter-slipping... more

descriptionView Paper arrow_downwardDownload

Evaluation of Differential Distractor Functioning of Physics Achievement Battery for Quality Assurance Using Multinomial Log-linear Model

by Fidelis O B I Nnadi

2025, International Journal of Modern Management Sciences

School examinations including Physics have been fraught with biased questions. Equality in the nature of examination questions is not attained between focal and reference test-takers. This makes assessment of the learners' knowledge of... more

descriptionView Paper arrow_downwardDownload

Vers une évaluation analytique des interfaces homme machine développées dans le contexte des habitats intelligents

by Belkacem Chikhaoui

2025

The author has granted a nonexclusive license allowing Library and Archives Canada to reproduce, publish, archive, preserve, conserve, communicate to the public by telecommunication or on the Internet, loan, distribute and sell theses... more

descriptionView Paper arrow_downwardDownload

EVALUATION OF WEST AFRICAN EXAMINATIONS COUNCIL'S PHYSICS ESSAY TEST USING PARTIAL CREDIT ITEM RESPONSE THEORY MODEL

by Fidelis O B I Nnadi

2025, Journal of Science Education, ESUT

The poor state of secondary school students' achievement relative to policy expectation in Physics triggered the study. At the level of instrument quality, the scoring pattern in West African Senior School Certificate Examination's... more

descriptionView Paper arrow_downwardDownload

COMPARATIVE EFFECT OF NUMBER RIGHT AND CORRECTED METHODS OF SCORING MULTIPLE CHOICES IN SOCIAL STUDIES TEST BY

by VALADA ALEX

2025, Valada Alex

This study compared the effectiveness of Number Right method and the Corrected Method of scoring multiple choice items in social studies. The purpose of the study was to determine students' performance, gender differences and compare the... more

descriptionView Paper arrow_downwardDownload

Application of the Mantel-Haenszel Procedure to Complex Samples of Items

by John Donoghue

2025, ETS Research Report Series

This Monte Carlo study examined the effect of complex sampling of items on the measurement of differential item functioning (DIF) using the Mantel-Haenszel procedure. Data were generated using a three-parameter logistic item response... more

descriptionView Paper arrow_downwardDownload

Estimating Parameters in the Generalized Graded Unfolding Model: Sensitivity to the Prior Distribution Assumption and the Number of Quadrature Points Used

by John Donoghue

2025

The generalized graded unfolding model (J. Roberts, J. Donoghue, and J. Laughlin, 1998, 1999) is an item response theory model designed to unfold polytomous responses. The model is based on a proximity relation that postulates higher... more

descriptionView Paper arrow_downwardDownload

Estimability of Parameters in the Generalized Graded Unfolding Model

by John Donoghue

2025

The generalized graded unfolding model (GGUM) (J. Roberts, J. Donoghue, and J. is an item response theory model designed to analyze binary or graded responses that are based on a proximity relation. The purpose of this study was to assess... more

descriptionView Paper arrow_downwardDownload

An Investigation of Ordinal True Score Test Theory

by John Donoghue

2025, Applied Psychological Measurement

The validity of the assumptions underlying Cliff's (1989) ordinal true score theory (OTST) were investigated in a three-stage study. OTST makes only ordinal assumptions about the data, and provides a means of converting ordinal item... more

descriptionView Paper arrow_downwardDownload

On the Relative Value of Multiple-Choice, Constructed Response, and Examinee-Selected Items on Two Achievement Tests. Program Statistics Research Technical Report No. 93-28

by Robert Lukhele

2025

Analyses based on fitting item response models to data from the College Board's Advanced Placement exams in Chemistry and United States History indicated that the constructed-response portion of the tests yielded little information over... more

descriptionView Paper arrow_downwardDownload

On the Relative Value of Multiple‐Choice, Constructed Response, and Examinee‐Selected Items on Two Achievement Tests

by Robert Lukhele

2025, Journal of Educational Measurement

Using analyses based on fitting item response models to data from the College Board's Advanced Placement exams in chemistry and United States history, we found that the constructed response portion of the tests yielded little... more

descriptionView Paper arrow_downwardDownload

Evaluation of quality-of-life measures for use in palliative care: a systematic review

by Mecheline van der Linden

2025, Palliative Medicine

Purpose: In this literature review we evaluated the feasibility and clinimetric quality of quality-of-life (QoL) measurement instruments suitable for use in palliative care. Methods: We conducted a systematic literature review to identify... more

descriptionView Paper arrow_downwardDownload

Multidimensionalitas pada tes potensi akademik

by Ali Ridho

2025

The aim of this research study was to find out characteristics of items and subtests of Tes Potensi akademik (TP) College Admissions (ujian masuk, UM) UGM 2006 approached by unidimensional and multidimensional item response theory with 3... more

descriptionView Paper arrow_downwardDownload

Suicide attempts among men and women with partner violence according to borderline personality status

by Kenneth Elliott

2025, Innovations in clinical neuroscience

descriptionView Paper arrow_downwardDownload

National Standards: Yesterday, Today, and Tomorrow

by Beth Kania-Gosche

2025, Critical Questions in Education

Educators in the United States continue to struggle with the disparity in academic achievement of their students and with the ever-increasing emphasis on meeting Adequate Yearly Progress, for No Child Left Behind. Looking at data from the... more

descriptionView Paper arrow_downwardDownload

Item Response Theory

Key research themes

1. How can Item Response Tree models capture complex response processes beyond traditional IRT outcomes?

2. What advantages does Item Response Theory offer over Classical Test Theory in psychological test development and measurement precision?

3. How can item-fit and model-data fit be accurately assessed in IRT to identify aberrant items and improve measurement validity?

4. What computational methods and software can enhance IRT parameter estimation for complex models and simulation studies?

5. How can IRT be extended and applied to continuous and polytomous response data while accounting for measurement constraints?

Related Topics

All papers in Item Response Theory