About Linkage
What is data linkage?
As we utilise different health care services, information about our health is recorded and stored in databases created by hospitals, health departments and other organisations that provide health care. Typically this information is stored independently and managed by the data custodians in each organization. With data linkage we can connect these different pieces of information that relate to the same individual while also maintaining the privacy and integrity of the data sources. The data can be drawn from multiple databases and across several points in time. For example, a researcher might want to examine the relationship between cancer in childhood and recurrence in adulthood and could use linked data for their study.
By linking information from the same individual across time, it is possible to create a ‘lifebook’ of events rather than just examine one chapter at a time. A researcher can thus analyse the outcomes for a large sample of individuals across time and draw conclusions based on the average experience of the cohort. This sort of linkage opens up a range of research opportunities. It enables us to look at health outcomes as dynamic processes rather than as static events. While studying health outcomes, the relevant information is rarely found in one place. Data linkage allows us to use a wide range of applicable information that we might not have otherwise been able to include if we were only utilizing one source.
Why is it useful?
Data linkage allows us to test research hypotheses on existing data that is readily available. It is also quicker and more cost-effective to use these existing data collections rather than having to collect data for each new research project. Furthermore, if two projects use similar data, we do not have to collect the same information twice thereby reducing cost. Ordinarily, acquiring data for the whole population can be cost-prohibitive. However by using data linkage, we are able to utilize data for entire populations rather than having to use only a population sample to draw conclusions.
What data is currently available in Queensland?
There are currently 5 routinely linked datasets available for use in Queensland. These ‘core’ datasets include:
-
Queensland Hospital Admitted Patient Data Collection
-
Queensland Perinatal Data Collection
-
Registrar General – Birth notification
-
Registrar General – Birth registration
-
Registrar General – Deaths
Over the next few years, the core datasets will be expanded to include additional health data from Queensland.
Can other non-core databases be linked to core databases?
Yes, this kind of linkage is also possible. Please contact us for additional information since the possibilities will change depending on the type and ownership of data.
Who can use this data?
This data can be made available to researchers who have applied for and received clearance from the appropriate research ethics committees. Data is provided in accordance with protocols that have been designed to maintain confidentiality and security.
