<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Science Progress &#187; Peter Swire</title>
	<atom:link href="http://scienceprogress.org/author/pswire/feed/" rel="self" type="application/rss+xml" />
	<link>http://scienceprogress.org</link>
	<description></description>
	<lastBuildDate>Fri, 25 May 2012 14:25:09 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Consumer Data Privacy Codes of Conduct</title>
		<link>http://scienceprogress.org/2012/04/consumer-data-privacy-codes-of-conduct/</link>
		<comments>http://scienceprogress.org/2012/04/consumer-data-privacy-codes-of-conduct/#comments</comments>
		<pubDate>Wed, 11 Apr 2012 13:54:09 +0000</pubDate>
		<dc:creator>Peter Swire</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Home Page]]></category>
		<category><![CDATA[Science & Society]]></category>

		<guid isPermaLink="false">http://scienceprogress.org/?p=26013</guid>
		<description><![CDATA[Peter Swire discusses the balance of privacy and commercial usefulness of de-identified data in testimony before the Department of Commerce National Telecommunications and Information Administration. ]]></description>
			<content:encoded><![CDATA[<p><!--authorbio--><em>This is an excerpt from <a href="http://www.americanprogressaction.org/issues/2012/04/pdf/swire_testimony.pdf">the full comments</a> (pdf).</em></p>
<p>The National Telecommunications and Information Administration, or NTIA, has asked for comments on what issues should be addressed through a privacy multistakeholder process. Based on my experience in privacy law and policy, I believe an early and prominent candidate should be the definition of what counts as “de-identified” information. As discussed below this topic has multiple advantages, including heightened protection for consumers, positive effects on innovation and the broader economy, and likelihood of concrete, enforceable success for the process itself.</p>
<p>These comments provide background for the discussion and then explain the importance of the topic of de-identified data. The comments explain how the recent Federal Trade Commission privacy report provides a new and useful set of proposals for how to handle de-identified data, and concludes with an analysis of why the topic of de-identified data is a good candidate for early consideration in a multistakeholder process.</p>
<h2>Background</h2>
<p>As background for these comments, I am the C. William O’Neill Professor of Law at the Moritz College of Law of the Ohio State University, and Senior Fellow at the Center for American Progress Action Fund and the Future of Privacy Forum. Under President Bill Clinton I served as chief counselor for privacy in the U.S. Office of Management and Budget. Under President Barack Obama I was special assistant to the president for economic policy in 2009 and 2010. Further information is available at <a href="http://www.peterswire.net/">www.peterswire.net</a>.</p>
<p>This February the administration issued its white paper, “Consumer Data Privacy in a Networked World: A Framework for Protecting Privacy and Promoting Innovation in the Global Digital Economy.” This privacy framework defined a Consumer Privacy Bill of Rights. To implement this bill of rights, the framework called on the Department of Commerce to foster the development of enforceable codes of conduct for consumer privacy. These codes of conduct will be developed through multistakeholder processes, so that the range of relevant stakeholders can convene and develop codes of conduct even in the absence of binding legislation or regulation. Consumer privacy legislation has been difficult to enact in the United States, so consumer protection will advance more quickly through initiatives, such as the multistakeholder process, that do not depend on passage of such legislation.</p>
<p>Along with the administration’s framework, the Federal Trade Commission, or FTC, has continued its vital role in U.S. privacy policy and enforcement. On March 26, 2012, the FTC issued “Protecting Consumer Privacy in an Era of Rapid Change: Recommendations for Businesses and Policymakers.” This report reflected intensive FTC efforts on a wide range of privacy topics. The comments here, building on a short previous statement, focus on the FTC’s recommendations about how to approach the important issue of de-identified data.</p>
<h2>The importance of de-identified data</h2>
<p>The title of the administration’s white paper reflects two principal goals for policy concerning the data of individual consumers: “A Framework for Protecting Privacy and Promoting Innovation.” This title reflects the risks to individuals if privacy is not protected effectively. It also reflects the importance of creating good information rules in order to foster innovation and growth in our information economy.</p>
<p>The issue of de-identified data creates a vital opportunity to meet both goals—use data for innovation and growth while also protecting privacy. At least in theory, de-identified data allows us to have our cake and eat it, too. With de-identified data, we strip out the name and other information that reveals identity, but we nonetheless can process the data, do research, discover patterns, and innovate in how we respond to the information.</p>
<p>In any statute or other legal obligation, such as a company’s enforceable promise to protect privacy, the most important definition is what counts as covered by the law or obligation. Defining what counts as “de-identified” is crucial because it draws the line between what data is covered by privacy protections (still “identified”) and what data is not (“de-identified”).</p>
<p>In U.S. law de-identified data was first defined as part of the Health Insurance Portability and Accountability Act, or HIPAA, medical privacy rule drafted in the late 1990s. I was very involved in drafting the proposed and final HIPAA rule and paid particular attention to defining what counted as “de-identified.” In HIPAA “identified” data is considered personal health information, subject to the full range of privacy protections. If the data is scrubbed hard enough, however, then it becomes de-identified data and no longer subject to the regulatory requirements.</p>
<p>The final HIPAA medical privacy rule provided two ways to show that data was de-identified. First, the holder of the data could remove a list of at least 17 data fields that could identify a person, such as name, address, or Social Security number. Second, a statistical expert could certify that the risk is very small that the information could be used, alone or in combination with other reasonable available information, to re-identify the individual. Since HIPAA went into effect nearly a decade ago, health care entities have been able to publicly release health data if it has been scrubbed well enough to meet the regulatory requirements for de-identification.</p>
<h2>Finding a Goldilocks solution for de-identified data</h2>
<p>Since the HIPAA de-identification provisions were proposed in 1999, we have learned a lot about when and how it is possible to “re-identify” data—to link a person’s name with the supposedly de-identified data. Two big trends have made it harder to keep information de-identified. First, search on the Web has gotten much better. Google was not incorporated until 1998, and today’s search engines let anyone link together tidbits from previously hard-to-link data sources. Second, the amount of information on the Web about a typical person has grown astronomically, including all of the personal details on a person’s blog or Facebook page.</p>
<p>The combination of efficient search tools and lots of data means that there is a higher likelihood today that a person’s medical or other records can be re-identified even if the name and other traditional identifiers are deleted. For instance, the de-identified medical record might state that a person in Ohio had minor hand surgery on April 3. In the past, it would have been difficult or impossible for an outsider to figure out the name. Today, online search might turn up a social network thread about the hand surgery—there are multiple such surgeries in Ohio each day, but not that many. A bit of follow-up research, using the rest of the supposedly de-identified information, might easily pinpoint the person who had the surgery.</p>
<p>As academics have analyzed these facts about re-identification, some have concluded that the entire effort to de-identify data has failed, because of the risk of linking information back to the individual. Others have emphasized the limited actual success of re-identification efforts in practice, and found that the benefits to research and innovation are so great that they outweigh the privacy risks.</p>
<p>The preliminary FTC report, issued in 2010, received strong criticisms from both of these perspectives. The earlier report would have applied privacy protections to “consumer data that can be reasonably linked to a specific consumer, computer, or other device.” The debate centered on what the FTC meant by “reasonably linked.” Consumer groups correctly emphasized that it is easier now to search on the Web and re-identify data, at risk to privacy. Researchers and other users of data focused on the problems that come with an over-broad definition of “reasonably linked,” which could extend privacy rules to an almost unlimited range of data processing, if enough effort is put into tracking down and re-identifying data.</p>
<p>Responding to these critiques, the FTC looked at the technical de-identification issues, and found what I believe is a Goldilocks solution for the problem of de-identified data. The FTC provides what amounts to a safe harbor where: “(1) a given data set is not reasonably identifiable; (2) the company publicly commits not to re-identify it, and (3) the company requires any downstream users of the data to keep it in de-identified form.”</p>
<p>The FTC approach responds to the technical experts who correctly say that it is easier today to find data on the Web that helps us re-identify data. To address the privacy concerns the FTC approach first requires a company to make a data set reasonably de-identified. We can think of this as “good but not foolproof de-identification.” Then, in addition, the FTC requires administrative protections. The company has to commit publicly that it won’t re-identify the data. The company also has to get similar promises from anybody downstream who receives the data. These promises are enforceable because Section 5 of the FTC Act prohibits deceptive practices, such as broken privacy promises. Privacy is protected through the combination of technical measures, having reasonably de-identified data, and backup administrative measures, so that the only people who receive the data have made binding promises not to re-identify.</p>
<p>The FTC approach also responds to those who want to study data for research, innovation, and related purposes. Data must be scrubbed pretty hard but not incredibly hard—the dataset need merely not be “reasonably identifiable.” That data should still often be detailed enough to be useful for a variety of purposes, protected by the enforceable promises not to re-identify.</p>
<p>I have long believed that technical controls alone are not enough to protect consumers against possible re-identification, as shown in a 2009 report by the Center for Democracy and Technology and my December talk on de-identified data. The best path is to have reasonably strong technical protections, supplemented by the sorts of enforceable promises that the FTC report supports.</p>
<p><em>You can read more about why defining de-identified data is a good fit for the multistakeholder process in the <a href="http://www.americanprogressaction.org/issues/2012/04/swire_comments.html">full comments at the Center for American Progress Action Fund</a>. Peter Swire is the C. William O’Neill Professor of Law at Moritz College of Law at the Ohio State University, and Senior Fellow at the Center for American Progress Action Fund.</em></p>
<h2><strong>Further reading</strong><strong></strong></h2>
<div>
<hr align="left" size="1" width="33%" />
<div>
<p title="">The White House, “Consumer Data Privacy in a Networked World: A Framework for Protecting Privacy and Promoting Innovation in the Global Digital Economy” (2012), available at http:// www.whitehouse.gov/sites/default/files/privacy- final.pdf.</p>
</div>
<div>
<p title="">Federal Trade Commission, “Protecting Privacy in an Era of Rapid Change: Recommendations for Businesses and Policymakers” (2012), available at http://www.ftc.gov/opa/2012/03/privacyframework.shtm.</p>
</div>
<div>
<p title="">Peter Swire, “FTC Deserves Praise for Its De-Identification Safe Harbor,” Future of Privacy, March 26, 2012, available at http://www.futureofprivacy.org/2012/03/26/fpf-senior-fellow-peter-swire-ftc-deserves-praise-for-its-de-identification-safe-harbor/.</p>
</div>
<div>
<p title="">Paul Ohm, “Broken Promises of Privacy: Responding to the Surprising Failure of Anonymization” <em>UCLA Law Review</em> 57 (1701) (2010), available at <a href="http://ssrn.com/abstract=1450006">http://ssrn.com/abstract=1450006</a>.</p>
</div>
<div>
<p title="">Jane Yakowitz, “Tragedy of the Data Commons,” <em>Harvard Journal of Law and Technology </em>25 (2011), available at <a href="http://ssrn.com/abstract=1789749">http://ssrn.com/abstract=1789749</a>.</p>
</div>
<div>
<p title="">Ed Felton, chief technology officer of the FTC, listed de-identification as the top issue of “special interest to techies” in the FTC report. Ed Felton, “Tech Highlights of the FTC Privacy Report” (Washington: Federal Trade Commission, 2012), available at http://techatftc.wordpress.com/2012/03/26/tech-highlights-of-the-ftc-privacy-report/.</p>
</div>
<div>
<p title="">Center for Democracy and Technology, “Encouraging the Use of, and Rethinking Protections for De-Identified (and “Anonymized”) Health Data” (2009), available at https://www.cdt.org/healthprivacy/20090625_deidentify.pdf.</p>
</div>
<div>
<p title="">Peter Swire, “Keynote – Setting the State: How De-Identification Came into U.S. Law and Why the Debate Matters Today,” Future of Privacy Forum, Conference on De-Identification, 2011, available at http://www.peterswire.net/psspeeches2011.htm.</p>
</div>
<div>
<p title="">Peter Swire, “Peeping,” <em>Berkeley Technology Law Journal</em> (2009), available at http://ssrn.com/abstract=1418091.</p>
</div>
<div>
<p title="">Peter Swire, “Markets, Self-Regulation, and Government Enforcement in the Protection of Personal Information,” in U.S. Department of Commerce, “Privacy and  Self-Regulation in the Information Age” (1997), available at http://ssrn.com/abstract=11472.</p>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://scienceprogress.org/2012/04/consumer-data-privacy-codes-of-conduct/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>We Are the Web</title>
		<link>http://scienceprogress.org/2007/12/we-are-the-web/</link>
		<comments>http://scienceprogress.org/2007/12/we-are-the-web/#comments</comments>
		<pubDate>Wed, 05 Dec 2007 16:35:16 +0000</pubDate>
		<dc:creator>Peter Swire</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Home Page]]></category>
		<category><![CDATA[Innovation]]></category>
		<category><![CDATA[Internet]]></category>

		<guid isPermaLink="false">http://www.scienceprogress.org/2007/12/we-are-the-web/</guid>
		<description><![CDATA[Policymakers need to give consumers the choice to protect their privacy or allow e-commerce companies to profile their web travels.]]></description>
			<content:encoded><![CDATA[<p>Online advertising has exploded in recent months, both in business terms and in concerns about the effect of new ad technologies on privacy. There has been a wave of proposed or actual mergers led by Google Inc., the biggest online search and text advertising company, which is seeking to merge with DoubleClick Inc., the biggest display ad company.</p>
<p>To try to learn about these emerging technologies, the Federal Trade Commission recently held a <a href="http://www.ftc.gov/bcp/workshops/ehavioral/index.shtml">Town Hall event</a>. The event produced little consensus. Privacy advocates criticized the new types of online profiling. Industry mostly explained why profiling is good because people will see only the ads for things they are interested in.</p>
<p>At the Town Hall, along with my written <a href="http://www.americanprogress.org/issues/2007/10/privacy.html">testimony</a>, I proposed a common-sense test for what should happen—individuals should have a realistic way to choose not to be profiled when they go online.</p>
<p class="pullquote">Repeated polling shows that millions of Americans don’t want to have detailed and permanent profiling of their surfing habits.</p>
<p>Current practices on the Web often don’t meet this simple test. For instance, the <a href="http://www.networkadvertising.org/managing/opt_out.asp">Network Advertising Initiative</a> operates a web site that lets you stop DoubleClick and similar cookies from following you across the Net. Although the NAI opt-out tool has worked for me, I spoke with an FTC attorney at the Town Hall who had spent nearly an hour online the night before. She had tried without success to get the NAI opt-out to work for her. If it’s too hard for smart attorneys trying to opt out as part of their job, then it’s not a good enough system.</p>
<p>I learned the common-sense test when I worked on privacy in the White House under President Clinton. It was a huge help when we could say that individuals had a choice. One example was the <a href="http://www.ftc.gov/privacy/glbact/glbsub1.htm">financial privacy law</a> in 1999. Under the law, individuals gained the choice not to have their personal information sent to outside marketers.</p>
<p>Another example was in 2000, when there was a cookie problem on the White House website. We created a <a href="http://www.whitehouse.gov/omb/memoranda/m00-13.html">policy</a> that tracking cookies would be set only with the choice of the surfer.</p>
<p>On a political level, these measures worked—the administration could explain that individuals had a realistic choice about how their data would be handled. These measures also basically worked at the policy level. The reason: People who cared a lot about their privacy now had a way to say no to certain practices. (We later pushed for <a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=347402">stronger financial privacy protections</a>, but at least the 1999 law contained some important privacy measures.)</p>
<p>People in the advertising industry, understandably, don’t want to face burdensome regulations on their new business models. They believe targeted ads, based on detailed profiling of surfing habits, will be more profitable for companies. They say that consumers will benefit by seeing more relevant ads. They argue that profiling is benign because it only determines which ads get served to the desktop.</p>
<p>There are two major problems with that position, however. First, in our post-Patriot Act world, records can quite easily end up in government hands. The government can now use National Security Letters, so called Section 215 orders, and other tools to see the surfing records, often without any notice to the individual concerned.</p>
<p>Second, repeated polling shows that millions of Americans don’t want to have detailed and permanent profiling of their surfing habits. Some people are spooked by the risk that the government might be looking over their shoulders. Others simply want to surf without leaving detailed records in the hands of multiple company databases.</p>
<p>In short, there are good reasons for leaders in technology and E-commerce to find realistic ways for consumers to have a choice. I personally think, for instance, that search engines should have an option not to link my current search with previous searches. <a href="http://gizmodo.com/gadgets/internet/askcom-brings-anonymous-searching-with-askeraser-280756.php">Ask.com</a> has announced plans for this option, but the practices of other search engines <a href="http://www.news.com/In-their-own-words-Search-engines-on-privacy/2100-1029_3-6202047.html">vary widely.</a></p>
<p>Industry leaders should listen to the privacy concerns and come up with workable ways for consumers to get what they prefer when it comes to online profiling. Some consumers will love personalization and want online companies to know everything about them. Some consumers won’t care one way or the other. But some consumers don’t like the idea of their search requests and surfing habits being stored out of their control.</p>
<p>In the long run, the press and the political system are likely to push back whenever detailed profiling becomes a mandatory part of surfing the Web. It is common sense to build user-friendly systems from the start.</p>
<p><em><a href="http://www.americanprogress.org/aboutus/staff/SwirePeter.html">Peter Swire</a> is a Senior Fellow at the Center for American Progress and the C. William O’Neil Professor at the Moritz College of Law of the Ohio State University. From 1999 to early 2001, he served as Chief Counselor for Privacy in the U.S. Office of Management &amp; Budget.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://scienceprogress.org/2007/12/we-are-the-web/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

