Depersonalization
Depersonalization
Context: Sally authorizes access to a "depersonalized" version of her Road Warrior travel data.
For example, the data given to London Transport
Unclassified, available data
- Space-time travel data, speed, time
- Compliance with speeding / traffic restrictions
- Hours/day
Should have:
- Average distance traveled
- Average # of stops/starts
- Vehicle classification for congestion charge (permit/fee status)
- Segment by segment travel
Should not have:
- Obvious PII
- Name
- Driver's License #
- Addresses
- home
- work
- d. end-to-end travel
Open Questions
- Difference between Sally and the vehicle?
- How are other drivers accounted for?
Types of Anonymization (as a verb)
- Scrubbing (removing PII)
- Aggregate to ambiguity (increase the # of people that could be confused with Sally)
- number of replaceable entities
- number of queries
Depersonalization verses Anonymization
- Process. Steps taken.
- Not an endpoint. (Anonymization seems like an endpoint.)