GDPR: what are personal data?
The GDPR applies to the use of personal data. The definition of personal data in the context of this law is therefore of great importance.
Personal data are any information about an identified or identifiable natural person. A natural person is considered to be identifiable if he or she can be identified directly or indirectly. Information that at first sight doesn't appear to be traceable to a person can therefore be personal data according to the definition of the GDPR. This may be the case if the information can be traced back to the person in question, e.g. by combining the data with other or additional data.
- Some examples of 'regular' personal data include: name, address, e-mail address, photo, ID number, IP address, employee number, private or professional telephone number (who’s who), login data, identification cookies, account number, CV, log data (including cafeteria, parking use, web use, surfing), camera images, personnel files, wage data, professional expenses, etc.
- Please note: data that at first sight doesn't directly lead to (identification of) a person can still be personal data. For example, data about a person's reaction times to a task, brain activity (e.g. EEG), blood sugar level, personality, skin conductance and heart rate are also (sensitive) 'personal data' when there's also information that links this data (whether pseudonymised or not) to the natural person (see below under 'Pseudonymized data' and 'Anonymized data').
Data concerning deceased persons or organisations aren't personal data according to the GDPR and therefore fall outside the scope of the GDPR. Other laws and regulations may, however, apply to these data.
Special categories
Some personal data belong to 'special categories'. These are personal data from which you can derive certain sensitive information. Processing such data therefore entails a higher risk and potentially bigger impact on the rights and freedoms of the data subject(s). Special categories of personal data (sensitive personal data) are those that may reveal the following:
- racial or ethnic origin,
- political views,
- religious or philosophical beliefs,
- membership of a trade union,
- genetic data,
- biometric data,
- health data, or
- someone's sexual behavior or sexual orientation.
Pseudonymised personal data
Pseudonymised personal data is personal data that's been processed in such a way that it can no longer be linked to a specific data subject without additional data being used (this was referred to as 'coding' in previous Privacy Legislation). It's important to store this additional data separately and to take the necessary technical and organizational measures to ensure that the personal data can't be linked to an identified or identifiable natural person.
Pseudonymised personal data are still personal data protected by the GDPR. In this case, the criterion isn't whether the pseudonymised dataset reveals who's who, but whether the data - with or without additional information sources - can be traced back to an identifiable natural person. In research, pseudonymisation is typically achieved by 'obscuring' all identifiable elements (removing, replacing, generalizing, ...) and where possible separating them from the research data in a separate key file that's securely stored in a separate place. If necessary, both the key file and the research data contain a unique but arbitrary code so that the link can be made between the data subjects and the pseudonymised research data.
Anonymised personal data
With anonymised personal data, the possibilities for identification have been 'irreversibly' removed by means of a processing technique.
It's important to evaluate whether the data can be related to an identified, or identifiable, person. Data that can't be traced back to the original individuals with reasonable effort are anonymous data. However, data that can be traced back to the original individuals with reasonable effort aren't anonymous - after all, it's traceable to an identifiable person. They therefore remain 'personal data' according tot the definition of the GDPR, so the legislation still applies.
For this reason, it's difficult to truly anonymise many types of research data (for example: qualitative data, large data sets with a wide range of personal data, etc.).
Please note that if you anonymise personal data yourself, you do of course work with identifiable personal data at the start of your research and during anonymisation - at which time the GDPR will apply. This means that you must meet the requirements of the GDPR, starting with registering your processing activity.
Anonymous data
Anonymous data is data that doesn't relate to an identified or identifiable natural person or to personal data that's been made anonymous in such a way that the data subject is not or no longer identifiable (by any individual in any way).
Anonymous data aren't personal data and don't fall under the scope of the GDPR.
Please note: even if you only process anonymised data, it's still important to evaluate the ethical aspects of collecting or processing those data.
More information
More tips
- GDPR: What to keep in mind when developing or deploying apps for research? (Research integrity & ethics)
- GDPR: Can I share research data with personal data with other researchers or institutions when my research project has ended? (Research integrity & ethics)
- GDPR: how can I ensure that the processing of personal data is lawful? (Research integrity & ethics)
- GDPR: how do I protect my data correctly? (Research integrity & ethics)
- GDPR: how do I register personal data processing activities? (Research integrity & ethics)
- GDPR: how long may I store research data containing personal data? (Research integrity & ethics)
- GDPR: how to be transparent to data subjects in my research? (Research integrity & ethics)
- GDPR: Pseudonymisation of personal data (Research integrity & ethics)
- GDPR: what are some things to consider when processing personal data from minors? (Research integrity & ethics)
- GDPR: what are the basic principles? (Research integrity & ethics)
- GDPR: what are the different roles and responsibilities according to the GDPR? (Research integrity & ethics)
- GDPR: What do I need to think about when transferring personal data to third countries or international organisations? (Research integrity & ethics)
- GDPR: what do I need to think about when using a mailing list in the context of my research? (Research integrity & ethics)
- GDPR: what has changed with regard to the previous privacy legislation? (Research integrity & ethics)
- GDPR: what information should I include in an informed consent form when the processing of personal data is based on the consent of the data subjects? (Research integrity & ethics)
- GDPR: what is the General Data Protection Regulation? (Research integrity & ethics)
- GDPR: what rights do data subjects have, how do I respect them and what exceptions may apply to research? (Research integrity & ethics)
- GDPR: what should I do in case of a data breach? (Research integrity & ethics)
- GDPR: what should I do in the event of further/secondary processing of personal data? (Research integrity & ethics)
- GDPR: what should I keep in mind when designing my research? (Research integrity & ethics)
- GDPR: what should I keep in mind when processing special categories of personal data? (Research integrity & ethics)
- GDPR: What should I think about when I collaborate with others or share my data? (Research integrity & ethics)
- GDPR: When am I processing high-risk personal data and when do I need to conduct a DPIA? (Research integrity & ethics)
- GDPR: when does it apply to my research? (Research integrity & ethics)
- GDPR: who are considered to be vulnerable persons? (Research integrity & ethics)
- GDPR: why is it important to comply with this legislation? (Research integrity & ethics)
- Qualtrics: how do I use this survey tool? (Research integrity & ethics)
Translated tip
Last modified Oct. 21, 2022, 11:02 a.m.