I’d argue that open data today is exactly where open source was some two decades ago, and I’d love to see if we couldn’t fast forward the community a bit. Imagine if every time the government posted a dataset, rather than posting the data as a zip file or to a proprietary data portal, the agency treated the data as open source. All of a sudden data sets get a running log of known issues, and not just those known to the agency. Consumers of the data can submit proposed changed to do everything from normalizing columns to correcting errors to making the data itself more useable. Most importantly, as that data evolves over time, there’s a running log of exactly what’s changed, a critical feature in the regulatory context (e.g., what licenses were issued in the past week?).
Je commence aujourd’hui à travailler pour Etalab (toujours via scopyleft) afin d’améliorer la plateforme data.gouv.fr. J’espère pouvoir apporter mes super-pouvoirs afin de rendre le projet plus accessible à la participation citoyenne, découvrir de nouvelles pratiques et de nouvelles personnes. Si vous avez des frustrations sur la plateforme, c’est le moment de les exprimer.