心理发展与教育 ›› 2010, Vol. 26 ›› Issue (4): 416-422.

• 论文 • 上一篇    下一篇

群体水平领域分数及其估计方法

辛涛, 谢敏   

  1. 北京师范大学心理学院发展心理研究所, 北京100875
  • 出版日期:2010-07-15 发布日期:2010-07-15
  • 通讯作者: 辛涛,E-mail:xintao@bnu.edu.cn E-mail:xintao@bnu.edu.cn
  • 基金资助:
    教育部新世纪优秀人才支持计划(项目批准号:NCET-07-0097);全国教育科学规划教育考试科学研究专项

Group-level Domain Score and Its Estimation Methods

Xin Tao, Xie Min   

  1. The Institute of Developmental Psychology of Beijing Normal University, Beijing 100875
  • Online:2010-07-15 Published:2010-07-15

摘要: 对人才的需求已经引起各国政府和国际组织对教育的高度重视,纷纷在国家和地区层面进行大规模的教育评估。在大尺度教育评估中,如何向政府、管理者和公众报告学生表现是不可避免的重要问题。报告学生表现有多种方式,领域分数作为管理者和公众最容易理解和接受的分数报告工具之一,在近些年受到研究者和实践者的关注,因此也成为了大型教育评价项目的必然选择。文中将介绍群体领域分数的起源和定义,并重点介绍群体领域分数的估计方法和相关研究,最后对未来开展进一步研究进行展望。

关键词: 大尺度教育评估, 群体领域分数, 估计方法, IRT方法

Abstract: Needs for talent resources have gotten many countries and international organizations' high attentions, and they all have done some large-scale education assessments in country or region level.How to translate the students' performance into readable and informative score reports is an important and inevitable problems confronted by the large scale assessment.Domain score is often preferred by researchers and practitioners because of its understandability and readability for public.This paper detailed the definition of group domain score,its estimation methods and relevant researches,finally discussed the further study needed in the further.

Key words: Large-scale educational assessment, group-level domain score, estimation method, IRT method

中图分类号: 

  • B841
[1] Bay,L.,Chen,L.,Hanson B.A.,Happel J.,Kolen M.J.,Miller T.,et al.(1997).ACT's NAEP redesign project:assessment design is the key to useful and stable assessment results.National Center for Education Statistics(ED),Washington,DC.
[2] Berk,R.A.(1980).A consumers'guide to criterion-referenced test reliability.Journal of Educational Measurement,17(4),323-349.
[3] Bock,R.D.,Thissen D.,&Zimowski,M.F.(1997).IRT estimation of domain scores.Journal of Educational Measurement,34 (3),197-211.
[4] Cronbach,L.J.,Gleser,G.C.,Nanda,M.,&Rajaratnam,N. (1972).The dependability of behavioral measurements:theory of generalizability for scores and profiles.New York:Wiley.
[5] Hambleton,R.K.,Swaminathan,H.,Algina,J.,&Coulson D.B. (1978).Criterion-referenced testing and measurement:a review of technical issues and developments.Review of Educational Research, 48,1-47.
[6] Hambleton,R.K.(1983).Application of item response models to criterion-referenced assessment.Applied Psychological Measurement,7 (1),33-44.
[7] Kaiser,H.F.,&Michael,W.B.(1975).Domain validity and generalizability.Educational and Psychological Measurement,35,31-35.
[8] Kane,M.,&Wilson,J.(1984).Errors of measurement and standard setting in mastery testing.Applied Psychological Measurement,8(1), 107-115.
[9] Lin,M.H.,&Hsiung,C.A.(1994).Empirical bayes estimates of domain scores under binomial and hypergeometric distributions for test scores.Psychometrika,59(3),331-359.
[10] Mazzeo,J.,Kulick,E.Tay-Lim,B.,&Perie M.(2006).Technical report for the 2000 market-basket study in mathematics.ETS NAEP Technical and Research Report Series.
[11] Pommerich,M.,Nicewander,W.A.,&Hanson,B.A.(1999). Estimating average domain scores.Journal of Educational Measurement,36(3),199-216.
[12] Pommerich,M.(2006).Validation of group domain score estimates using a test of domain.Journal of Educational Measurement,43(2), 97-111.
[13] Schulz,E.M.,Kolen,M.J.,&Nicewander,W.A.(1999).A rationale for defining achievement levels using IRT-estimated domain scores.Applied Psychological Measurement,23(4),347-362.
[14] Schulz,E.M.,Lee,W.C.,&Mullen,K.(2005).A domain-level approach to describing growth in achievement.Journal of Educational Measurement,42(1),1-26.
[15] Tate,R.L.,&King,F.J.(1994).Factors which influence precision of school-level IRT ability estimates.Journal of Educational Measurement,spring,31(1),1-15.
[16] Tryon,R.C.(1957).Communality of a variable:formulation by cluster analysis.Psychometrika,22(3),241-260.
[17] Yao,L.H.,&Boughton,K.A.(2007).A multidimensional item response modeling approach for improving subscale proficiency estimation and classification.Applied Psychological Measurement,31 (2),83-105.
[1] 唐文清, 方杰, 蒋香梅, 张敏强. 追踪研究方法在国内心理研究中的应用述评[J]. 心理发展与教育, 2014, 30(2): 216-224.
[2] 张学新. 用公评审稿促进中国科技期刊的快速发展[J]. 心理发展与教育, 2013, 29(1): 109-111.
[3] 黎光明, 张敏强. 概化理论方差分量估计的跨分布分析[J]. 心理发展与教育, 2012, 28(6): 665-672.
[4] 康廷虎, 白学军. 任务类型和性别属性对人际信任的影响[J]. 心理发展与教育, 2012, 28(5): 456-462.
[5] 马文超, 边玉芳, 骆方. 网络成瘾的潜在结构:连续的还是分类的?[J]. 心理发展与教育, 2012, 28(5): 554-560.
[6] 曾平飞, 余娜, 辛涛, 王烨晖. 多维Rasch模型在维度分数报告中的应用—对带宽-保真度困境的解决[J]. 心理发展与教育, 2012, 28(3): 329-336.
[7] 贾宁, 白学军, 沈德立. 学习判断准确性的研究方法[J]. 心理发展与教育, 2006, 22(3): 103-109.
[8] 王登峰, 崔红, 胡军生, 陈侠. 中国青少年人格量表(QZPS-Q)的编制[J]. 心理发展与教育, 2006, 22(3): 110-115.
[9] 郝春东, 刘晓燕. 延迟满足的研究方法、理论及现状[J]. 心理发展与教育, 2006, 22(3): 120-124,128.
[10] 焦丽亚, 辛涛. 基于CTT的锚测验非等组设计中四种等值方法的比较研究[J]. 心理发展与教育, 2006, 22(1): 97-102.
[11] 张玲玲, 张文新, 纪林芹, Jari-Erik Nurmi. 青少年未来取向问卷中文版的测量学分析[J]. 心理发展与教育, 2006, 22(1): 103-108.
[12] 王永丽, 林崇德, 俞国良. 儿童社会生活适应量表的编制与应用[J]. 心理发展与教育, 2005, 21(1): 109-114.
[13] 李虹, 梅锦荣. 测量大学生的心理问题:GHQ-20的结构及其信度和效度[J]. 心理发展与教育, 2002, 18(1): 75-79.
[14] 罗清旭, 杨鑫辉. 《加利福尼亚批判性思维倾向问卷》中文版的初步修订[J]. 心理发展与教育, 2001, 17(3): 47-51.
[15] 高琨, 邹泓. 处境不利儿童的友谊关系研究[J]. 心理发展与教育, 2001, 17(3): 52-55.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!