Depersonalization
Depersonalization
Context: Sally authorizes access to a "depersonalized" version of her Road Warrior travel data.
For example, the data given to London Transport
Unclassified, available data
Space-time travel data, speed, time
Compliance with speeding / traffic restrictions
Hours/day
Should have:
Average distance traveled
Average # of stops/starts
Vehicle classification for congestion charge (permit/fee status)
Segment by segment travel
Should not have:
Obvious PII
Name
Driver's License #
Addresses
home
work
d. end-to-end travel
Open Questions
Difference between Sally and the vehicle?
How are other drivers accounted for?
Types of Anonymization (as a verb)
Scrubbing (removing PII)
Aggregate to ambiguity (increase the # of people that could be confused with Sally)
number of replaceable entities
number of queries
Depersonalization verses Anonymization
Process. Steps taken.
Not an endpoint. (Anonymization seems like an endpoint.)