Improving the quality, accessibility and endurability of national school data architecture.

About AERO's LLANIA dataset

The Australian Education Research Organisation (AERO)’s new Longitudinal Literacy and Numeracy in Australia (LLANIA) dataset is our first project aimed at maximising the value of educational data.

During 2022 and 2023, AERO undertook a national data linkage project, successfully linking up to 4 rounds of National Assessment Program – Literacy and Numeracy (NAPLAN) results for every school student in Australia, corresponding to their performance in Year 3, Year 5, Year 7 and Year 9. This resulted in LLANIA, a fully de-identified longitudinal NAPLAN dataset that has the potential to make great contributions to Australia’s education system.

The LLANIA dataset comprises:

  • data from 6,270,515 students enrolled in the Australian education system from 2008 to 2021
  • fully linked data from Year 3 to Year 9 for 25% of these students (N = 1,594,261).

LLANIA can be used to investigate a range of educational questions such as:

  • understanding student learning growth across different educational domains
  • insights about the performance of specific student groups, including the effects of disadvantages, adverse events or interventions aimed at improving student learning.

Longitudinal Literacy and Numeracy in Australia Dataset: Technical Report

AERO's Longitudinal Literacy and Numeracy in Australia (LLANIA) Dataset: Technical Report describes the creation of the LLANIA dataset, including the linkage process and quality assurance methods.


Keywords: educational datasets, student progress, learning outcomes, student performance, longitudinal data