Development of the data literacy scale in social sciences: A validity and reliability study

Authors

https://doi.org/10.29329/pedper.2026.169

The present study will provide important information for future educational strategies and intervention programs by revealing the current status of undergraduate students in data literacy. The current research is a scale development study. The study group consisted of undergraduate students from 20 universities in Türkiye. Data validity and construct analysis were performed using exploratory and confirmatory factor analyses. Whereas the Kaiser–Meyer–Olkin (KMO) value of 0.967 indicated that the sample was perfect, Bartlett’s test confirmed that the correlations between the items were adequate. Cronbach’s alpha value of 0.973 indicated a very high internal consistency. Furthermore, high reliability was provided with the inter-form correlation of 0.853, the Spearman-Brown coefficient of 0.920, and the Guttman split-half coefficient of 0.919. The exploratory factor analysis revealed that the scale consisted of three sub-dimensions and explained 64.056% of the total variance. The items showed factor loadings above 0.40. The CFA results confirmed that the model represented the three sub-dimensions of data literacy well, and the RMSEA, CFI, IFI, and RFI fit indices were high. Compared with the available scales in the literature, this study makes a significant contribution by presenting a customized, comprehensive measurement tool in the context of the social sciences.

Ackoff, R. L. (1989). From data to wisdom. Journal of Applied Systems Analysis, 16, 3–9. https://faculty.ung.edu/kmelton/documents/datawisdom.pdf

Amidon, D. M. (1997). Innovation strategy for the knowledge economy: The ken awakening. Butterworth-Heinemann. https://doi.org/10.4324/9780080508795

Braeken, J., & van Assen, M. A. L. M. (2017). An empirical Kaiser criterion. Psychological Methods, 22(3), 450–466. https://doi.org/10.1037/met0000074

Brown, T. A. (2015). Confirmatory factor analysis for applied research (2nd ed.). Guilford Press.

Çalışkan, H. (2012). Development of the measurement and evaluation self-efficacy perception scale and the examination of the status of social studies teachers. Energy Education Science and Technology Part B: Social and Educational Studies, 4(1), 1003–1008.

Calzada Prado, J., & Marzal, M. Á. (2013). Incorporating data literacy into information literacy programs: Core competencies and contents. Libri, 63(2), 123–134. https://doi.org/10.1515/libri-2013-0010

Carlson, J., Fosmire, M., Miller, C. C., & Nelson, M. S. (2011). Determining data information literacy needs: A study of students and research faculty. Portal: Libraries and the Academy, 11(2), 629–657. https://doi.org/10.1353/pla.2011.0022

Chen, B., Chang, Y. H., Ouyang, F., & Zhou, W. (2018). Fostering student engagement in online discussion through social learning analytics. The Internet and Higher Education, 37, 21–30. https://doi.org/10.1016/j.iheduc.2017.12.002

Cohen, J., & Cohen, P. (1983). Applied multiple regression/correlation analysis for the behavioral sciences. L. Erlbaum.

D'Ignazio, C., & Bhargava, R. (2015, September). Approaches to building big data literacy. In Proceedings of the Bloomberg Data for Good Exchange Conference (Vol. 6).

Davenport, T. H., & Prusak, L. (2000). Working knowledge: How organizations manage what they know. Harvard Business School Press. https://doi.org/10.1145/348772.348775

Deahl, E. (2014). Better the data you know: Developing youth data literacy in schools and informal learning environments [Master's thesis, Massachusetts Institute of Technology]. https://doi.org/10.2139/ssrn.2445621

De Winter, J. C. F., Dodou, D., & Wieringa, P. A. (2009). Exploratory factor analysis with small sample sizes. Multivariate Behavioral Research, 44, 147–181. https://doi.org/10.1080/00273170902794206

Demirtaş, Ç. (2022). A model proposal for knowledge literacy in social studies education [Doctoral dissertation, Bolu Abant Izzet Baysal University]. YÖK National Thesis Center. https://tez.yok.gov.tr

Demirtaş, Ç. (2024). Data literacy and education: A science mapping study. Participatory Educational Research, 11(3), 220–243. https://doi.org/10.17275/per.24.43.11.3

DeVellis, R. F. (2012). Scale development: Theory and applications (3rd ed.). Sage.

Field, A. (2009). Discovering statistics using SPSS (3rd ed.). Sage.

Floyd, F. J., & Widaman, K. F. (1995). Factor analysis in the development and refinement of clinical assessment instruments. Psychological Assessment, 7(3), 286–299. https://doi.org/10.1037/1040-3590.7.3.286

Fontichiaro, K., & Oehrli, J. A. (2016). Why data literacy matters. Knowledge Quest, 44(5), 21–27.

Gebre, E. H. (2018). Young adults' understanding and use of data: Insights for fostering secondary school students' data literacy. Canadian Journal of Science, Mathematics and Technology Education, 18(4), 330–341. https://doi.org/10.1007/s42330-018-0034-z

Gencer, R., & Altun, A. (2021). Dijitalleşme, bilgi hiyerarşisini değiştirdi mi? (VEBB: veri, enformasyon, bilgi ve bilgelik) [Did digitalization change the knowledge hierarchy? (DIKW: data, information, knowledge and wisdom)]. Diyalektolog – Uluslararası Sosyal Bilimler Dergisi, (27). https://doi.org/10.29228/diyalektolog.52392

Goodwin, L. D. (1999). The role of factor analysis in the estimation of construct validity. Measurement in Physical Education and Exercise Science, 3(2), 85–100. https://doi.org/10.1207/s15327841mpee0302_2

Hair, J. F., Black, W. C., Babin, B. J., & Anderson, R. E. (2014). Multivariate data analysis (7th ed.). Prentice Hall.

Hoogland, I., Schildkamp, K., Van der Kleij, F., Heitink, M., Kippers, W., Veldkamp, B., & Dijkstra, A. M. (2016). Prerequisites for data-based decision making in the classroom: Research evidence and practical illustrations. Teaching and Teacher Education, 60, 377–386. https://doi.org/10.1016/j.tate.2016.07.012

Howard, M. C. (2016). A review of exploratory factor analysis decisions and overview of current practices: What we are doing and how can we improve? International Journal of Human–Computer Interaction, 32(1), 51–62. https://doi.org/10.1080/10447318.2015.1087664

Hu, L. T., & Bentler, P. M. (1998). Fit indices in covariance structure modeling: Sensitivity to underparameterized model misspecification. Psychological Methods, 3(4), 424–453. https://doi.org/10.1037/1082-989X.3.4.424

Hu, L. T., & Bentler, P. M. (1999). Cut-off criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling, 6, 1–55. https://doi.org/10.1080/10705519909540118

İnan, S. (2021). Siyaset okuryazarlığı "yöneten birey olmak ve okullarda siyaset eğitimi mümkün mü?" [Political literacy "Is it possible to be a governing individual and to give political education in schools?"]. Yeni İnsan Yayınevi.

Jeffery, K. (2014, April). Data is the new oil [Conference presentation]. Best Practices for Data Management & Sharing, Joint Research Centre (JRC), Ispra, Italy.

Jöreskog, K. G. (1971). Statistical analysis of sets of congeneric tests. Psychometrika, 36(2), 109–133. https://doi.org/10.1007/BF02291393

Karaman, M. (2023). Keşfedici ve doğrulayıcı faktör analizi: Kavramsal bir çalışma [Exploratory and confirmatory factor analysis: A conceptual study]. Uluslararası İktisadi ve İdari Bilimler Dergisi, 9(1), 47–63. https://doi.org/10.29131/uiibd.1279602

Kelley, J. (2002). Knowledge nirvana: Achieving the competitive advantage through enterprise content management and optimizing team collaboration. Xulon Press.

Kim, J., Hong, L., Evans, S., Oyler-Rice, E., & Ali, I. (2023). Development and validation of a data literacy assessment scale. Proceedings of the Association for Information Science and Technology, 60(1), 620–624. https://doi.org/10.1002/pra2.827

Kippers, W. B., Poortman, C. L., Schildkamp, K., & Visscher, A. J. (2018). Data literacy: What do educators learn and struggle with during a data use intervention? Studies in Educational Evaluation, 56, 21–31. https://doi.org/10.1016/j.stueduc.2017.11.001

Kline, R. B. (2016). Principle and practice of structural equation modelling (4th ed.). Guilford Press.

Koltay, T. (2015). Data literacy: In search of a name and identity. Journal of Documentation, 71(2), 401–415. https://doi.org/10.1108/JD-02-2014-0026

Koyuncu, I., & Kılıç, A. (2019). The use of exploratory and confirmatory factor analyses: A document analysis. Education and Science, 44(198). https://doi.org/10.15390/EB.2019.7665

Lawshe, C. H. (1975). A quantitative approach to content validity. Personnel Psychology, 28(4), 563–575.

Liew, A. (2007). Understanding data, information, knowledge and their inter-relationships. Journal of Knowledge Management Practice, 8(2), 1–16.

López-Meneses, E., Sirignano, F. M., Vázquez-Cano, E., & Ramírez-Hurtado, J. M. (2020). University students' digital competence in three areas of the DigCom 2.1 model: A comparative study at three European universities. Australasian Journal of Educational Technology, 36(3), 69–88. https://doi.org/10.14742/ajet.5583

MacCallum, R. C., Browne, M. W., & Sugawara, H. M. (1996). Power analysis and determination of sample size for covariance structure modeling. Psychological Methods, 1(2), 130–149.

Mahmud, M. M., & Wong, S. F. (2022). Digital age: The importance of 21st century skills among the undergraduates. Frontiers in Education, 7, 950553. https://doi.org/10.3389/feduc.2022.950553

Mandinach, E. B., & Gummer, E. S. (2013). Building educators' data literacy: Differing perspectives. The Journal of Educational Research & Policy Studies, 13(2), 1–5.

Mandinach, E. B., & Gummer, E. S. (2016). What does it mean for teachers to be data literate: Laying out the skills, knowledge, and dispositions. Teaching and Teacher Education, 60, 366–376. https://doi.org/10.1016/j.tate.2016.07.011

Mandinach, E. B., & Schildkamp, K. (2021). Misconceptions about data-based decision making in education: An exploration of the literature. Studies in Educational Evaluation, 69, 100842. https://doi.org/10.1016/j.stueduc.2020.100842

Merriam-Webster. (2024). Data. In Merriam-Webster's collegiate dictionary (12th ed.). https://www.merriam-webster.com/dictionary/data

Nikkhah, M., Heravi-Karimooi, M., Montazeri, A., Rejeh, N., & Sharif Nia, H. (2018). Psychometric properties the Iranian version of older people's quality of life questionnaire (OPQOL). Health and Quality of Life Outcomes, 16, 1–10. https://doi.org/10.1186/s12955-018-0910-y

Öz, S., & Özdemir, A. (2022). Validity and reliability study on the development of data literacy scale for educators. International Journal of Contemporary Educational Research, 9(3), 649–661. https://doi.org/10.33200/ijcer.1079774

Qin, J., & D'Ignazio, J. (2010). Lessons learned from a two-year experience in science data literacy education. In Proceedings of the 31st Annual IATUL Conference. IATUL.

Reisoğlu, İ., & Çebi, A. (2020). How can the digital competences of pre-service teachers be developed? Examining a case study through the lens of DigComp and DigCompEdu. Computers & Education, 156, 103940. https://doi.org/10.1016/j.compedu.2020.103940

Rowe, S., Riggio, M., De Amicis, R., & Rowe, S. R. (2020). Teacher perceptions of training and pedagogical value of cross-reality and sensor data from smart buildings. Education Sciences, 10(9), 234. https://doi.org/10.3390/educsci10090234

Sarmento, R. P., & Costa, V. (2017). Comparative approaches to using R and Python for statistical data analysis. IGI Global. https://doi.org/10.4018/978-1-68318-016-6

Sarmento, R. P., & Costa, V. (2019). Confirmatory factor analysis – A case study [Preprint]. arXiv. https://doi.org/10.48550/arXiv.1905.05598

Schield, M. (2004). Information literacy, statistical literacy and data literacy. IASSIST Quarterly, 28(2/3), 6–11.

Schildkamp, K. (2019). Data-based decision-making for school improvement: Research insights and gaps. Educational Research, 61(3), 257–273. https://doi.org/10.1080/00131881.2019.1625716

Shreiner, T. L., & Guzdial, M. (2022). The information won't just sink in: Helping teachers provide technology-assisted data literacy instruction in social studies. British Journal of Educational Technology, 53(5), 1134–1158. https://doi.org/10.1111/bjet.13255

Small, H. (1973). Co-citation in the scientific literature: A new measure of the relationship between two documents. Journal of the American Society for Information Science, 24(4), 265–269. https://doi.org/10.1002/asi.4630240406

Steiger, J. H. (2007). Understanding the limitations of global fit assessment in structural equation modeling. Personality and Individual Differences, 42(5), 893–898. https://doi.org/10.1016/j.paid.2006.09.017

The CERN Council. (2024a). Who we are: Our mission. https://www.home.cern/about/who-we-are/our-mission

The CERN Council. (2024b). Accelerators. https://home.cern/science/accelerators

Trantham, P. S., Sikorski, J., de Ayala, R. J., & Doll, B. (2021). An item response theory and Rasch analysis of the NUDKS: A data literacy scale. Educational Assessment, Evaluation and Accountability, 1–23. https://doi.org/10.1007/s11092-021-09372-w

Uyumaz, G., & Sırgancı, G. (2020). Doğrulayıcı faktör analizi için gerekli örneklem büyüklüğü kaç kişidir? [What is the required sample size for confirmatory factor analysis?]. OPUS International Journal of Society Researches, 16(32), 5302–5340. https://doi.org/10.26466/opus.826895

Vahey, P., Rafanan, K., Patton, C., Swan, K., van't Hooft, M., Kratcoski, A., & Stanford, T. (2012). A cross-disciplinary approach to teaching data literacy and proportionality. Educational Studies in Mathematics, 81, 179–205. https://doi.org/10.1007/s10649-012-9392-z

Vahey, P., Yarnall, L., Patton, C., Zalles, D., & Swan, K. (2006, April). Mathematizing middle school: Results from a cross-disciplinary study of data literacy [Paper presentation]. Annual Meeting of the American Educational Research Association, San Francisco, CA.

Veneziano, L., & Hooper, J. (1997). A method for quantifying content validity of health-related questionnaires. American Journal of Health Behavior, 21(1), 67–70.

Worthington, R. L., & Whittaker, T. A. (2006). Scale development research: A content analysis and recommendations for best practices. The Counseling Psychologist, 34(6), 806–838. https://doi.org/10.1177/0011000006288127

There are 66 references in total.
Demirtaş, Çağrı . (2026). Development of the data literacy scale in social sciences: A validity and reliability study. Pedagogical Perspective, 5(1), 237-256. https://doi.org/10.29329/pedper.2026.169

Downloads

Article Information

  • File Downloads 5
  • Abstract Views 8
  • Altmetrics
  • Share
Download data is not yet available.

You may also start an advanced similarity search for this article.