心理发展与教育 ›› 2012, Vol. 28 ›› Issue (3): 329-336.

• 论文 • 上一篇    

多维Rasch模型在维度分数报告中的应用—对带宽-保真度困境的解决

曾平飞1, 余娜1, 辛涛1, 王烨晖2   

  1. 1. 北京师范大学发展心理研究所, 北京 100875;
    2. 北京师范大学认知神经科学与学习国家重点实验室, 北京 100875
  • 出版日期:2012-05-15 发布日期:2012-05-15
  • 通讯作者: 辛涛,E-mail:xintao@bnu.edu.cn E-mail:xintao@bnu.edu.cn

An M-Rasch Approach to Domain Score Reporting: Solution to Bandwidth-fidelity Dilemma

ZENG Ping-fei1, YU Na1, XIN Tao1, WANG Ye-hui2   

  1. 1. Institute of Developmental Psychology, Beijing Normal University, Beijing 100875;
    2. State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing 100875
  • Online:2012-05-15 Published:2012-05-15

摘要: 分别采用四维度和十五维度Rasch模型分析包含项目内多维度结构的科学测验数据,估计两种维度结构下维度分数的信度.结果表明,对比相应的单维模型而言,四维度与十五维度Rasch模型均能够极大提高各内容维度上分数估计的信度.四维度与十五维度Rasch模型拟合结果的比较表明,对于总长度固定的测验,维度数目的增加能够补偿子维度长度减少引起的信度损失.但是这一作用必须以维度间较高的相关性为前提.

关键词: 大规模教育测量, 带宽-保真度困境, 多维Rasch模型, 项目内多维度

Abstract: Due to broad content coverage and limited testing time,the large scale assessment is challenged by the bandwidth-fidelity dilemma.This study is to explore how the multi-dimensional Rasch model would improve reliability in the within-item multi-dimensionality data.The results demonstrate that both 4-dimensional and 15-dimensional Rasch models fit data,which supports the construct validity of the test.The uni-dimensional model underestimates the correlation between domains due to measurement error.The multi-dimensional Rasch analysis yields a higher level of measurement precision and a more appropriate estimate for the correlation between domains as compared to uni-dimensional approach.The comparison between 4-dimension and 15-dimension analysis shows that the increase on the number of dimensions can compensate the effect of scale length reduction to a certain extent,as long as the correlations between the specific domain and the others are relatively high enough.In conclusion,the multi-dimensional Rasch analysis yields more reliable domain score than uni-dimensional Rasch model in the within-item multidimenionality context.For the test with fixed length,reliable scores can be reported on the more specified content domains,as long as there are high correlations between domains.

Key words: large scale assessment, bandwidth-fidelity dilemma, multi-dimensional Rasch model, within item dimensionality

中图分类号: 

  • B841
[1] Adams,R.J.,Wilson,M.,& Wang,W.C.(1997).The multidimensional random coefficients multinomial logit model.AppliedPsychological Measurement,21,1-23.
[2] Bock,R.D.,& Mislevy,R.J.(1981).An item response curve model for matrix-sampling data:The California grade-three assessment.New Directions for Testing and Measurement,10,65-90.
[3] Bock,R.D.,Thissen,D.,& Zimowski,M.F.(1997).IRT estimation of domain scores.Journal of educational measurement,34(3),197-211.
[4] Cheng,Y.Y.,Wang,W.C.,& Ho,Y.-H.(2009).
[5] Multidimensional Rasch analysis of a psychological test with multiple subtests:A statistical solution for the bandwidth——fidelity dilemma.Educational and Psychological Measurement,69(3),369-388.
[6] Cronbach,L.J.,& Gleser,G.C.(1965).Psychological tests and personnel decisions.Urbana:University of Illinois Press Urbana.Kahraman,N.,& Kamata,A.(2004).Increasing the precision of subscale scores by using out-of-scale information.Applied Psychological Measurement,28(6),407-426.
[7] Murphy,K.R.(1993).Honesty in the workplace.Pacific Grove:Brooks/Cole Pub Co.
[8] Ones,D.S.,& Viswesvaran,C.(1996).Bandwidth-fidelity dilemma in personality measurement for personnel selection.Journal of Organizational Behavior,17(6),609-626.
[9] Organization for Economic Co-operation and Development.(2005).The PISA 2003 technical report.Paris:Author.
[10] Pommerich,M.,Nicewander,W.A.,& Hanson,B.A.(1999).Estimating average domain scores.Journal of Educational Measurement,36(3),199-216.
[11] Sheng,Y.,& Wikle,C.K.(2008).Bayesian multidimensional IRT models with a hierarchical structure.Educational and Psychological Measurement,68(3),413-430.
[12] Wainer,H.,Sheehan,K.M.,& Wang,X.(2000).Some paths toward making praxis scores more useful.Journal of Educational Measurement,37(2),113-140.
[13] Wang,W.C.,Chen,P.H.,& Cheng,Y.Y.(2004).Improving measurement precision of test batteries using multidimensional item response models.Psychological Methods,9(1),116-135.
[14] Wright,B.D.,Linacre,J.M.,Gustafson,J.E.,& Martin-Lof,P.(1994).Reasonable mean-square fit values.Rasch Measurement Transactions,8(3),370.
[15] Wright,B.D.,& Mok,M.(2000).Understanding Rasch measurement:Rasch models overview.Journal of Applied
[16] Measurement,1(1),83-106.
[17] Wu,M.L.,Adams,R.J.,& Wilson,M.R.(1998).ConQuest [Computer software and manual].Camberwell,Victoria,Australia:Australian Council for Educational Research.
[18] Yao,L.,& Boughton,K.A.(2007).A multidimensional item response modeling approach for improving subscale proficiency estimation and classification.Applied Psychological Measurement,31(2),83-105.
[19] Yen,W.M.(1987).A Bayesian/IRT index of objective performance.Paper presented at the annual meeting of the Psychometric Society,Montreal,Quebec,Canada.
[20] 李凌艳,辛涛,董奇.(2007).矩阵取样技术在大尺度教育测评中的运用.北京师范大学学报(社会科学版),(6),19-25.
[1] 唐文清, 方杰, 蒋香梅, 张敏强. 追踪研究方法在国内心理研究中的应用述评[J]. 心理发展与教育, 2014, 30(2): 216-224.
[2] 张学新. 用公评审稿促进中国科技期刊的快速发展[J]. 心理发展与教育, 2013, 29(1): 109-111.
[3] 黎光明, 张敏强. 概化理论方差分量估计的跨分布分析[J]. 心理发展与教育, 2012, 28(6): 665-672.
[4] 康廷虎, 白学军. 任务类型和性别属性对人际信任的影响[J]. 心理发展与教育, 2012, 28(5): 456-462.
[5] 马文超, 边玉芳, 骆方. 网络成瘾的潜在结构:连续的还是分类的?[J]. 心理发展与教育, 2012, 28(5): 554-560.
[6] 辛涛, 谢敏. 群体水平领域分数及其估计方法[J]. 心理发展与教育, 2010, 26(4): 416-422.
[7] 贾宁, 白学军, 沈德立. 学习判断准确性的研究方法[J]. 心理发展与教育, 2006, 22(3): 103-109.
[8] 王登峰, 崔红, 胡军生, 陈侠. 中国青少年人格量表(QZPS-Q)的编制[J]. 心理发展与教育, 2006, 22(3): 110-115.
[9] 郝春东, 刘晓燕. 延迟满足的研究方法、理论及现状[J]. 心理发展与教育, 2006, 22(3): 120-124,128.
[10] 焦丽亚, 辛涛. 基于CTT的锚测验非等组设计中四种等值方法的比较研究[J]. 心理发展与教育, 2006, 22(1): 97-102.
[11] 张玲玲, 张文新, 纪林芹, Jari-Erik Nurmi. 青少年未来取向问卷中文版的测量学分析[J]. 心理发展与教育, 2006, 22(1): 103-108.
[12] 王永丽, 林崇德, 俞国良. 儿童社会生活适应量表的编制与应用[J]. 心理发展与教育, 2005, 21(1): 109-114.
[13] 李虹, 梅锦荣. 测量大学生的心理问题:GHQ-20的结构及其信度和效度[J]. 心理发展与教育, 2002, 18(1): 75-79.
[14] 罗清旭, 杨鑫辉. 《加利福尼亚批判性思维倾向问卷》中文版的初步修订[J]. 心理发展与教育, 2001, 17(3): 47-51.
[15] 高琨, 邹泓. 处境不利儿童的友谊关系研究[J]. 心理发展与教育, 2001, 17(3): 52-55.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!