Congrats to Google Takeout for adding Blogger, G+ Page exports

The Data Liberation Front advanced Google’s data portability practices another two notches, adding export for Blogger blogs and for Google+ Pages to Google Takeout. Kudos.

Data Liberation Front

A few questions:

How complete are these exports?

  • Do they include comments left on blogs? 
  • Trackbacks? 
  • Site settings? 
  • Media and files? 
  • Blogrolls? 
  • Themes and templates? 
How do they deal with people? 
  • List of contributors (including owners, guests authors, editors, commenters)
  • Their profiles, including bios and avatars
  • Their roles, including permissions

Are the exported files adjusted so they will work elsewhere? For example, so links that worked on will work if imported to 

Is a rendered view of content backed up as well as the raw content? Blogger posts are exported as Atom syndication files. Are there html versions of each page too? 

Are events properly preserved? A common fault in blog exports is the rewriting of time-stamps during export, import or migration. 

Export is good. But don’t stop there. Check portability projects for the quality and completeness of their service. commits to data portability as a brand promise, the we’re-supported-by-our-users-not-by-advertisers twitter alternative, promises data portability to its uses. Let’s look at their core values.

  • We are selling our product, NOT our users.

    We will never sell your personal data, content, feed, interests, clicks, or anything else to advertisers. We promise.

On the one hand, no selling your explicit, inferred or behavioral data to advertisers. Excellent clarity.

On the other, this promise is limited to sales to advertisers. Others will gladly pay for your data: aggregators who assemble and resell rich personal profiles from diverse sources; insurance companies to better characterize their risks; employers scanning prospective and current employee attitudes; law enforcement and intelligence agencies analyzing large bodies of text. Is promising not to sell to all third-parties or just to advertisers?

On the other hand (the third hand, I know) what if a user wants to sell their data? Are you ruling out being an infomediary, acting on behalf of users willing to license their history and activity to trusted third-parties? If you just want to be a custodian of user data, will you forbid API partners from selling user data originating on if users consent?

  • You own your content. members will always have full control of their data. Members have the fundamental right to easily back-up, export, and delete ALL of their data, whenever they want.

The human language is “ownership,” but the legally binding one is “control.”

So far is modeling an early export of your data.

When they hit $500K in revenue, about 10K paying users, Dalton Caldwell blogged:

When you are logged into the alpha, we provide a button which will email you a .zip file of all of your content in a structured format. If you are an alpha tester, go ahead and try it out. It works.

Export to a file is a great start. I’m eager to see docs showing the data structure and a list of other services that readily parse that structure.

I’m also eager to see more useful data portability controls:

  • Consume my data from other sources with open arms, both one shot and updates. There are thousands of services supporting feeds, activity streams, directories, etc.
  • Flipping that around, should also push my my life updates to other services.
  • Disclosure on where in the physical world my data is kept and flows. Maybe even choice of where. This matters when servers are subject to national court, law enforcement, and intelligence orders and to government censorship regimes.
  • Effective due process regarding choosing to disable or kill my account. It’s your business and you have the power to shut down any user or developer. How will you assure eviction is fair and seen as fair?
  • Controlling how my identity is presented to different publics within Is it “one account, one persona” or can I use one account to interact with my work, family, geek, cosplay, or political publics in ways I can tailor for each? We use faceted identity in the real world and need you to model it.
  • Discussion of who “owns” what in co-created content. Will you permit deletions to leave holes in a threaded conversation? Will you let me delete a photo I post and then delete it from other’s accounts even if others reshared it?
  • We will align our financial incentives with members & developers.

    In this paid model, the more people that value our service highly enough to pay for it, the more money we make. Our financial incentives are entirely tied to successfully delivering a service you can depend on, not on holding our ecosystem hostage.

Shoplifting is driven by value, opportunity, and perceived risk. So is having a service abuse user trust. Aligning financial incentives is a strong step to backing up data portability promises.

  • employees spend 100% of their time improving our services for you, not advertisers.

    Rather than waste most of our engineering time coming up with new and exciting ways to sell your personal data to advertisers, 100% of our engineering and product team will be focused on building the most innovative and reliable service we can.

This is about return on membership fees more than data portability. Still nice, though.

  • We will operate a sustainable, predictable business. will always have a clear business model. We know that depending on services that could go away or desperately squeeze users for more and more money is a toxic cycle.
    We want our ecosystem to rest easy that is built on a financially solid foundation.

Oceans of personal data are lost as companies kill products, turn off servers, or shut down their businesses. Business viability directly supports user-centric data portability. Other things can go wrong but at least there should be cash to pay for a graceful exit if things go badly.

  • We respect and value our developer community.

    We believe that developers building on our platform are increasing the value and attractiveness of our service to paying members, and thus our financial interests are fundamentally aligned. We hope developers build large, robust businesses on top of our platform. Even if it means that we will likely forgo some huge future revenue streams, we will NEVER screw developers acting in good faith.

This is a path of potential conflict:’s promises to users vs. developers.

Will build its personal data portability pledges into developer API terms of service? How strongly will those commitments be communicated? How will they be validated and abuses reported? What sanctions are set for abusers? What forms of redress will enable for users? It’s easier to have these conversations in alpha.

  • Our most valuable asset is your trust.

    Many people have become so cynical about user-hostile, privacy-violating social services that they refuse to participate at all. We can understand why. Earning your trust is the most important thing we can do. It won’t be easy, and we will make some mistakes, but we will do our best to be honest and transparent.

Trust, but verify.

These are bold values to hold. Eager to see what they look like in your user and developer Terms of Service. Congrats on your launch.

Data Gravity, or the theory that Data has Mass.

There’s an interesting article on TechCrunch today entitled “How Authoritarianism Will Lead To The Rise Of The Data Smuggler.”

While the article itself is interesting, and discusses a level of data portability far beyond what most people think about today, it also makes reference to the concept of “Data Gravity” posited by Dave McCrory. Dave’s rather unique concept is wrapped around the following statement (excerpted from the TC article by Alex Williams):

“Data has its own mass. When data gets stored it becomes harder to move. The more data stored, the greater the mass.”

Intriguing – and very cool. Check out the Data Gravity website for lots of details:

Filed under: thinking outside the box with Data Portability.

–Steve Repetti

Tim Berners-Lee Demands Data Portability

Wow, crazy good day for data portability capped by Tim Berners-Lee’s thoughts on the subject. From CNET: “The father of the World Wide Web says having all that data could have “tremendous potential to help humanity.”

Full article here:

The State of Data Portability in Social Media, Part I – A Closer Look at Facebook

[The following is not a commentary on data portability policies at Facebook --- that will be a follow-up to this series. Instead, this article attempts to document the current state of data portability within social media, and in this case, Facebook in particular]

Every day, Facebook consumes billions of snippets of people’s lives in . . . → Read More: The State of Data Portability in Social Media, Part I – A Closer Look at Facebook

Facebook: More access to your data

Rebecca MacKinnon's Consent of the Networked book cover

Facebook continues to improve on data portability issues. On The Media spoke last week to an Austrian student sent a thousand-page data dump when he asked Facebook for his personal data under EU law. Today, when you ask, Facebook sends you a form email (text below). You’re still not getting all of your data, but your self-service options are more complete and easier. Data you can download includes your profile and much of your activity, including off-Facebook “likes.” How useful and reusable is the downloaded data? I don’t know; if you try, please ping me and I’ll update the post. Meanwhile, Facebook’s Data Access Request Team writes “We expect to have a new tool with additional categories of data to download available in the next few months.”  Here’s hoping the new downloads come with specs so third-parties can put your archive to work.

Continue reading Facebook: More access to your data

Data Portability Wars : Google and Facebook vs. YOU

Well, here we go again.

The big companies love to embrace data portability and the freedom it provides its users, not to mention the press and goodwill that comes with it, as long as it doesn’t conflict with their corporate agenda.

Let’s call it what it is: Facebook and Google both support “convenient” data . . . → Read More: Data Portability Wars : Google and Facebook vs. YOU

Merc: Battle brewing over control of personal data online

Mike Swift writes up the personal data space as a contest between individuals and large corporations. Swift interviewed Kaliya Hamlin of the Personal Data Ecosystem Consortium and PDEC members, Personal, and Singly. The Consortium doesn’t approach the challenge as a direct conflict. They see a realignment of behavior by people and enterprises producing . . . → Read More: Merc: Battle brewing over control of personal data online

Google Unleashes New Data Portability Initiative: Google Takeout

Google today unveiled a new service that provides advanced Data Portability across its diverse platform.  Google Takeout ( makes it easy to extract your data from a variety of Google Services including: Buzz, Contact and Circles, Picasa Web Albums, and Profile. The information is provided in a variety of formats, including vCard and JSON . . . → Read More: Google Unleashes New Data Portability Initiative: Google Takeout

Data Portability Applauds US CIO, Mourns Departure

Today, friends of Data Portability lost an ally in their cause when the Federal CIO, Vivek Kundra, announced he will be leaving his post in August. Mr. Kundra was the first-ever Chief Information Officer of the United States. During his tenure, Mr. Kundra championed the use of open standards, cloud computing, accessibility, and data . . . → Read More: Data Portability Applauds US CIO, Mourns Departure