<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Open House Project &#187; Structured Data</title>
	<atom:link href="http://www.theopenhouseproject.com/category/structured-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.theopenhouseproject.com</link>
	<description>Recommendations, Resources, and Reform</description>
	<lastBuildDate>Wed, 23 Feb 2011 16:24:51 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Bulk data downloads approved in the omnibus spending bill (success!)</title>
		<link>http://www.theopenhouseproject.com/2009/03/11/bulk-data-downloads-approved-in-the-omnibus-spending-bill-success/</link>
		<comments>http://www.theopenhouseproject.com/2009/03/11/bulk-data-downloads-approved-in-the-omnibus-spending-bill-success/#comments</comments>
		<pubDate>Wed, 11 Mar 2009 14:08:13 +0000</pubDate>
		<dc:creator>Joshua Tauberer</dc:creator>
				<category><![CDATA[Congress]]></category>
		<category><![CDATA[GPO]]></category>
		<category><![CDATA[Structured Data]]></category>
		<category><![CDATA[appropriations]]></category>
		<category><![CDATA[approps]]></category>
		<category><![CDATA[clerk of the house]]></category>
		<category><![CDATA[govtrack]]></category>
		<category><![CDATA[legislation]]></category>
		<category><![CDATA[library of congress]]></category>
		<category><![CDATA[maplight]]></category>
		<category><![CDATA[openhouseproject]]></category>

		<guid isPermaLink="false">http://www.theopenhouseproject.com/?p=462</guid>
		<description><![CDATA[Two recommendations of our report have been moved forward in the FY09 omnibus appropriations bill (H.R. 1105) which cleared the Senate yesterday and the House last month. The first recommendation in our chapter on legislative databases was that the Library of Congress make its bill status database directly available to the public and that the [...]]]></description>
			<content:encoded><![CDATA[<p>Two recommendations of our report have been moved forward in the <a href="http://www.govtrack.us/congress/bill.xpd?bill=h111-1105">FY09 omnibus appropriations bill (H.R. 1105)</a> which cleared the Senate yesterday and the House last month. The first recommendation in our chapter on <a href="http://http//www.theopenhouseproject.com/the-open-house-project-report/3-legislation-database/">legislative databases</a> was that the Library of Congress make its bill status database directly available to the public and that the GPO not sell legislative documents to the public. These have been the two issues I&#8217;ve had my sights on over the last three years (probably starting <a href="http://groups.yahoo.com/group/committeewatch/message/153">here</a>). The second recommendation was about <a href="http://www.theopenhouseproject.com/the-open-house-project-report/12-coordinating-web-standards/">coordinating web standards</a> across Congress. These recommendations are addressed in two paragraphs the <a href="http://appropriations.house.gov/FY2009_consolidated.shtml">House statement accompanying the bill</a> for Division G &#8211; Legislative Branch, which is almost like being law itself.</p>
<p>The two paragraphs were added by <a href="http://honda.house.gov/">Congressman Mike Honda</a> of California, one of our champions of the use of technology to further transparency and civic engagement. John Wonderlich of Sunlight Foundation, Rob Pierson in Honda&#8217;s office, and I collaborated on this over a long period of time. Honda got involved in 2007 <a href="http://www.theopenhouseproject.com/2008/02/01/congressman-honda-on-the-open-house-cause/">asking the Library to look into this</a> and then in 2008 <a href="http://www.theopenhouseproject.com/2008/07/14/legislative-databases-recommendation-makes-it-to-house-leg-branch-appropriations-markup/">getting the paragraphs added to the bill markup</a>.</p>
<p><span id="more-462"></span></p>
<p>So here they are:</p>
<blockquote><p>
Congressional Technology Coordination.-The House of Representatives needs a strategic and coordinated plan that will prepare for the future technology needs of the institution. A 2006 report commissioned by the Chief Administrative Officer and the Committee on House Administration, entitled Strategic Technology Road Map for the Ten Year Vision of Technology in the House of Representatives, provided a suggested structure for Information Technology evaluation and decision making. The Chief Administrative Officer, the Clerk, and the Sergeant at Arms are asked to prepare a report by June 30, 2009 on their efforts or plans to develop House-wide data-sharing standards; implement standard legislative document formats; address the increasing resource challenges of Member offices; and identify disparate systems throughout the institution that prevent it from taking advantage of economies of scale.  [page 2]</p></blockquote>
<p>and</p>
<blockquote><p>Public Access to Legislative Data. There is support for enhancing public access to legislative documents, bill status, summary information, and other legislative data through more direct methods such as bulk data downloads and other means of no-charge digital access to legislative databases. The Library of Congress, Congressional Research Service, and Government Printing Office and the appropriate entities of the House of Representatives are directed to prepare a report on the feasibility of providing advanced search capabilities. This report is to be provided to the Committees on Appropriations of the House and Senate within 120 days of the release of Legislative Information System 2.0. [page 11]</p></blockquote>
<p>According to an article in <a href="http://blog.wired.com/27bstroke6/2009/03/federal-bill-wo.html">Wired</a>: â€œIn our web 2.0 world, we can empower the public by providing them with raw data that they can remix and reuse in new and innovative ways,&#8221; says Honda, who is vice chairman of the Appropriations Subcommittee on the Legislative Branch. &#8220;With these tools, the public can collaborate on projects that can help legislators to create better policies to address the pressing challenges facing our nation.â€ There&#8217;s also a good <a href="http://www.motherjones.com/politics/2009/03/congressional-data-mining-coming-soon">article at Mother Jones</a> and a nice <a href="http://radar.oreilly.com/2009/03/bulk-data-downloads-government-transparency-breakthrough.html">post by Tim O&#8217;Reilly</a>.</p>
<p>The concept of bulk data downloads hasn&#8217;t been missed by many parts of the government. The Census Bureau and the Federal Elections Commission, for instance, are fantastic at sharing with the public as much as they can. In the latter case it is electronic versions of campaign contribution filings, which is obviously very important for preventing corruption. But, there are significant gaps in other areas of the government where a little legislating is necessary.  Here we&#8217;re talking about information on bills in congress going back around two decades, and the information going forward.</p>
<p>The Library of Congress has a database of this information but they don&#8217;t share it with the public. Sharing it would mean that creating sites like GovTrack &#8212; and the various other sites that use data from GovTrack including OpenCongress and MAPLight.org &#8212; would be a little easier, but also a little more accurate. Right now GovTrack goes through a roundabout process to reverse-engineer the same information we are seeking from this database. Basically, we already have the information by scraping it off of thomas.loc.gov &#8212; we&#8217;d just rather get it directly rather than the way it is assembled now. So because I go through so much trouble to reverse-engineer the data I want, not so many things will change in an obvious way on GovTrack &#8212; it&#8217;ll just be that my life will be a little easier and the information will be a little more complete and up to date. But, you can expect to see other sites spring up doing new and interesting things with the information &#8212; ways of visualizing the congressional process that we couldn&#8217;t yet imagine. </p>
<p>The Government Printing Office is mentioned because of how they make legislative documents like the text of bills available to the public. PDFs and text-only versions are made available for free already. No problem there. But they have other files that would be useful to sites like GovTrack which they sell at ridiculously high subscription prices. Those files would make comparisons of bill text easier to produce (although GovTrack already has this feature, again by essentially going about it the hard way). If you think about it from the perspective that some bills go through Congress so fast no one has time to read them through, being able to apply technology to the process is so important, like to detect changes in the text of bills between versions to make it easier for people to get through it. This is what GPO is preventing by selling some of its files, rather than providing them to the public for free (which it is essentially mandated to do for most documents &#8212; why they exempt certain documents is not known). </p>
<p>Now, it&#8217;s not that the Library doesn&#8217;t necessarily *want* to share its database. It&#8217;s just that sharing it wasn&#8217;t a part of their mandate from Congress and they don&#8217;t want to upset Congress by stepping out of their mandate. The omnibus bill is an indication from the House to the Library that this would be something supported by Congress. (My understanding is that the Library has been seeking permission from Congress to do some of these things, probably in response to a previous push for this, but the omnibus legislation has been in the works concurrently.) </p>
]]></content:encoded>
			<wfw:commentRss>http://www.theopenhouseproject.com/2009/03/11/bulk-data-downloads-approved-in-the-omnibus-spending-bill-success/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Watch the revisions to the bail-out bill</title>
		<link>http://www.theopenhouseproject.com/2008/10/01/watch-the-revisions-to-the-bail-out-bill/</link>
		<comments>http://www.theopenhouseproject.com/2008/10/01/watch-the-revisions-to-the-bail-out-bill/#comments</comments>
		<pubDate>Wed, 01 Oct 2008 21:12:18 +0000</pubDate>
		<dc:creator>Joshua Tauberer</dc:creator>
				<category><![CDATA[Structured Data]]></category>
		<category><![CDATA[data visualization]]></category>
		<category><![CDATA[govtrack]]></category>
		<category><![CDATA[visualizations]]></category>
		<category><![CDATA[xml]]></category>

		<guid isPermaLink="false">http://www.theopenhouseproject.com/?p=425</guid>
		<description><![CDATA[Following John&#8217;s note on an OHP mail list email, I adapted the bill comparison tool I developed for GovTrack and used it to analyze the changes made between the draft PDFs that have been circulating of the economic bail-out bill that is now a large package of legislation. I found five drafts, going back to [...]]]></description>
			<content:encoded><![CDATA[<p>Following John&#8217;s note on an OHP mail list email, I adapted the bill comparison tool I developed for GovTrack and used it to analyze the changes made between the draft PDFs that have been circulating of the economic bail-out bill that is now a large package of legislation. I found five drafts, going back to Thursday, September 25 and the latest one from the Senate today. You can see the successive changes from draft to draft <a href="http://www.govtrack.us/special/econstimbill/changes.xpd">here</a>.</p>
<p>It&#8217;s not very pretty because while bill writers have been posting the PDFs, PDFs don&#8217;t make it easy to make comparisons. The bill writers <strong>are</strong> composing the bills in XML, and if they made those available we the public would have an easier time. Maybe we wouldn&#8217;t complain to our reps so much either because we could actually understand what is going on better!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.theopenhouseproject.com/2008/10/01/watch-the-revisions-to-the-bail-out-bill/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Legislative Databases recommendation makes it to House Leg Branch Appropriations markup</title>
		<link>http://www.theopenhouseproject.com/2008/07/14/legislative-databases-recommendation-makes-it-to-house-leg-branch-appropriations-markup/</link>
		<comments>http://www.theopenhouseproject.com/2008/07/14/legislative-databases-recommendation-makes-it-to-house-leg-branch-appropriations-markup/#comments</comments>
		<pubDate>Mon, 14 Jul 2008 17:42:28 +0000</pubDate>
		<dc:creator>Joshua Tauberer</dc:creator>
				<category><![CDATA[GPO]]></category>
		<category><![CDATA[House of Representatives]]></category>
		<category><![CDATA[LoC]]></category>
		<category><![CDATA[OpenHouse]]></category>
		<category><![CDATA[Structured Data]]></category>
		<category><![CDATA[appropriations]]></category>
		<category><![CDATA[cha]]></category>
		<category><![CDATA[library of congress]]></category>
		<category><![CDATA[ohp]]></category>
		<category><![CDATA[openhouseproject]]></category>

		<guid isPermaLink="false">http://www.theopenhouseproject.com/?p=375</guid>
		<description><![CDATA[I&#8217;m ecstatic. All right, so this all goes back to late 2006, a bunch of people sitting at their computers writing some emails about what Congress should do with data. I distinctly remember Dan Newman and I both thinking that the Library of Congress should make its raw legislative database (that powers THOMAS) available directly [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m ecstatic. All right, so this all goes back to late 2006, a bunch of people sitting at their computers writing some emails about what Congress <em>should</em> do with data. I distinctly remember Dan Newman and I both thinking that the Library of Congress should make its raw legislative database (that powers THOMAS) available directly to us to build applications off of, rather than the screen-scraping that I was doing. One thing leads to another, the Open House Project, <a href="http://http://www.theopenhouseproject.com/the-open-house-project-report/3-legislation-database/">the legislative databases section of the OHP report</a> in May 2007 (which I principally wrote), then later that year with the support of Rep. Mike Honda, in November <a href="http://www.govexec.com/story_page.cfm?filepath=/dailyfed/0108/012308tdpm1.htm">CHA asked the LOC to look into the issue</a> (<a href="http://www.theopenhouseproject.com/2008/02/01/congressman-honda-on-the-open-house-cause/">more</a>), and then in the last month his office submitted text for the House Legislative Branch Appropriations Report, which made it through subcommittee markup of the bill, to give this request a little more teeth (like, ehm, the force of law).</p>
<p>His office also submitted a second paragraph which I&#8217;ll get to below.<br />
<span id="more-375"></span><br />
Rob Pierson in Honda&#8217;s office writes on the OHP mail list:</p>
<blockquote><p>I&#8217;ve mentioned on the list some of the steps my boss (Congressman Honda) has been taking, with counsel from many folks on this list, to guide Congressional policies on the path towards effectively leveraging technology to open up access to the public. There are actually quite a few other staffers who also follow this list, and we&#8217;ve certainly learned quite a bit from the conversations posted here, so I wanted to throw out a quick note of appreciation to everyone who has been contributing to the discussions.</p>
<p>With guidance from the conversations on this list (and the OHP report), Congressman Honda recently submitted the following sections into the House Legislative Branch Appropriations Report. The following (or possibly very similar versions) were included in the Leg Branch Subcommittee markup of the bill:</p>
<p>*Public Access to Legislative Data (as submitted)*</p>
<p>The Committee believes that the public should have improved access to legislative information through more advanced search capabilities such as those available through the Library of Congress&#8217; Legislative<br />
Information System. The Committee also supports enhancing public access to legislative documents, bill status, summary information, and other legislative data, through more direct methods such as bulk data downloads and other means of no-charge digital access to legislative databases. The Committee requests that the Library and Government Printing Office report on the progress towards these goals within 90 days of enactment of this Act.
</p></blockquote>
<p>Note that the GPO has also been stuck in there. More more on that, <a href="http://www.theopenhouseproject.com/2007/11/14/better-late-than-never-gpo-responds-to-my-question-1-year-later/">see this post</a>.</p>
<p>The second paragraph that Honda&#8217;s office submitted John noted was parallel to the final chapter of our report, <a href="http://www.theopenhouseproject.com/the-open-house-project-report/12-coordinating-web-standards/">Coordinating Web Standards</a>. (Hmm, I principally wrote that chapter too&#8230;.)</p>
<blockquote><p>*Congressional Technology Coordination (as submitted)*</p>
<p>The Committee recognizes the need for the House of Representatives to develop a strategic and coordinated plan that will prepare for the future technology needs of the institution.  A 2006 report commissioned by the Chief Administrative Officer and the Committee on House Administration, entitled /Strategic Technology Road Map for the Ten Year Vision of Technology in the House of Representatives/ provided a suggested structure for an IT evaluation and decision-making process.<br />
No later than 90 days after the enactment of this Act, the Committee requests that the Chief Administrative Officer, the Clerk, and the Sergeant at Arms report to the Committee of their efforts to develop House-wide data-sharing standards; implement standard legislative document formats; address the increasing resource challenges of Member offices; and identify disparate systems throughout the institution, which prevent it from taking advantage of economies of scale.</p></blockquote>
<p>This is of course fantastic news for anyone that supports transparency, which is, well, everyone in their right mind, I think. So thanks to Congressman Honda for taking the initiative on this!</p>
<p>(Other links: <a href="http://www.theopenhouseproject.com/2007/06/27/house-leg-branch-appropriations-review/">last year&#8217;s leg branch appropriations blog post</a>, <a href="http://www.theopenhouseproject.com/2007/01/25/mash-ups-for-government-transparency/">my first or one of my first posts here about structured data</a>)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.theopenhouseproject.com/2008/07/14/legislative-databases-recommendation-makes-it-to-house-leg-branch-appropriations-markup/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Eating well on Independence Day</title>
		<link>http://www.theopenhouseproject.com/2008/07/04/eating-well-on-independence-day/</link>
		<comments>http://www.theopenhouseproject.com/2008/07/04/eating-well-on-independence-day/#comments</comments>
		<pubDate>Fri, 04 Jul 2008 12:54:03 +0000</pubDate>
		<dc:creator>Joshua Tauberer</dc:creator>
				<category><![CDATA[Structured Data]]></category>
		<category><![CDATA[data visualization]]></category>
		<category><![CDATA[government websites]]></category>

		<guid isPermaLink="false">http://www.theopenhouseproject.com/?p=370</guid>
		<description><![CDATA[Happy 4th of July. I thought I&#8217;d share an interesting website that has nothing to do with government transparency but is about good use of government data. The USDA maintains a big database of nutrition facts about foods. You can download the database and build applications based on it, like a menu planner. This is [...]]]></description>
			<content:encoded><![CDATA[<p>Happy 4th of July. I thought I&#8217;d share an interesting website that has nothing to do with government transparency but is about good use of government data. The USDA maintains a <a href="http://www.ars.usda.gov/main/site_main.htm?modecode=12354500">big database of nutrition facts about foods</a>. You can download the database and build applications based on it, like a menu planner. This is something I&#8217;ve been thinking about in the back of my head for a while since after getting into the whole <a href="http://www.michaelpollan.com/">Michael Pollan food mind-set</a> I&#8217;ve wondered whether one can make a healthy diet just by balancing various food groups (as I try to do with limited success), or whether (contra Pollan&#8217;s overall message, though maybe not in the details) it would be useful to start adding up the numbers of various nutrients to see how my meals match up with recommended values. How should I know, for instance, if I&#8217;ve managed to exclude an important vitamin in my particular selection of foods that I eat week after week, right?</p>
<p>The database is great itself, but the cooler website is <a href="http://www.mypyramidtracker.gov/planner/">MyPyramid Menu Planner (mypyramidtracker.gov)</a> (also out of the USDA). You can enter a typical daily roster of what you eat (with a nice sound effect) and it will tell you how it stacks up for a recommended diet for your age (or for me, how to gain weight to a recommended amount for my age). It feels a little over-simplified, but the simplicity keeps me on the site. I find, not surprisingly, that I probably eat about half of the recommended calories and clearly not enough grain or fruit. Well, I knew this in the abstract, but quantifying it helps direct me to fixing the problem.</p>
<p>I&#8217;m sure there are other websites that do similar things, but it&#8217;s nice to find a case where the government has both published a comprehensive (well structured, well documented) database and has also built a really nice interface for the data. And on a topic that is really very important to daily life, too.</p>
<p>And with that, I think I will take the rest of the weekend off from civics!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.theopenhouseproject.com/2008/07/04/eating-well-on-independence-day/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Communicating with Congress: Recommendations for Improving the Democratic Dialogue</title>
		<link>http://www.theopenhouseproject.com/2008/06/21/communicating-with-congress-recommendations-for-improving-the-democratic-dialogue/</link>
		<comments>http://www.theopenhouseproject.com/2008/06/21/communicating-with-congress-recommendations-for-improving-the-democratic-dialogue/#comments</comments>
		<pubDate>Sat, 21 Jun 2008 14:06:23 +0000</pubDate>
		<dc:creator>Joshua Tauberer</dc:creator>
				<category><![CDATA[CMF]]></category>
		<category><![CDATA[Structured Data]]></category>
		<category><![CDATA[advocacy]]></category>
		<category><![CDATA[communication]]></category>
		<category><![CDATA[openhouseproject]]></category>

		<guid isPermaLink="false">http://www.theopenhouseproject.com/?p=367</guid>
		<description><![CDATA[CMF published an interim report Communicating with Congress: Recommendations for Improving the Democratic Dialogue . I had one of those &#8220;someone got it right&#8221; moments reading the report. Following what seemed to be tireless work by Daniel Bennett and Rob Pierson (Rep. Mike Honda&#8217;s office) and CMF staff going back a long time, and a [...]]]></description>
			<content:encoded><![CDATA[<p>CMF published an interim report <a href="http://www.cmfweb.org/index.php?option=com_content&#038;task=view&#038;id=256">Communicating with Congress: Recommendations for Improving the Democratic Dialogue </a>. I had one of those &#8220;someone got it right&#8221; moments reading the report. Following what seemed to be tireless work by Daniel Bennett and Rob Pierson (<a href="http://honda.house.gov/">Rep. Mike Honda</a>&#8217;s office) and CMF staff going back a long time, and a conference in October that I really enjoyed, they recommend adding metadata to constituent communication to reliably indicate who the sender is, what the issue is, and what advocacy organization helped the sender send the message.</p>
<p>The recommendation serves to help congressional staff manage incoming communication. It&#8217;s a method of triage on the one hand, and a tool to help tally communications by position on the other. Critical as this may be, I find tallying to be incredibly superficial &#8212; and it really reveals, I think, that the world of communicating with Congress has become extremely narrow. (But I&#8217;ve written on that before.)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.theopenhouseproject.com/2008/06/21/communicating-with-congress-recommendations-for-improving-the-democratic-dialogue/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Webcontent.gov updates publishing-data recommendations</title>
		<link>http://www.theopenhouseproject.com/2008/06/12/webcontentgov-updates-publishing-data-recommendations/</link>
		<comments>http://www.theopenhouseproject.com/2008/06/12/webcontentgov-updates-publishing-data-recommendations/#comments</comments>
		<pubDate>Thu, 12 Jun 2008 17:25:30 +0000</pubDate>
		<dc:creator>Joshua Tauberer</dc:creator>
				<category><![CDATA[OMB]]></category>
		<category><![CDATA[Structured Data]]></category>
		<category><![CDATA[egov]]></category>
		<category><![CDATA[executive]]></category>
		<category><![CDATA[government websites]]></category>
		<category><![CDATA[openhouseproject]]></category>

		<guid isPermaLink="false">http://www.theopenhouseproject.com/?p=360</guid>
		<description><![CDATA[I was very lucky this week to have stumbled into the middle of an update being done to a page maintained by the U.S.&#8217;s GSA at webcontent.gov on best practices for making data available, for executive branch agencies. The site serves as a collection of best practices and uses OMB policies
as a starting point. I [...]]]></description>
			<content:encoded><![CDATA[<p>I was very lucky this week to have stumbled into the middle of an update being done to a page maintained by the U.S.&#8217;s GSA at <a href="http://webcontent.gov">webcontent.gov</a> on best practices for making data available, for executive branch agencies. The site serves as a collection of best practices and uses OMB policies<br />
as a starting point. I think it had been last updated in 2005.</p>
<p>The page updated is <a href="http://www.usa.gov/webcontent/usability/accessibility/access_to_data.shtml">here</a>.</p>
<p>The updates were a combination of suggestions from Scott Horvath and Jeremy Fee at the USGS, Kol Peterson from EPA, and me, and really big thanks go to Scott and Kol for reaching out to others for input on Monday and getting the feedback back to Bev Godwin at GSA who runs webcontent.gov who published the changes only a few days later. Scott also notes that additional suggestions could still be considered (his email address is at the bottom of that page).</p>
<p>In making my suggestions, I turned to the <a href="http://www.opengovdata.org">Open Government Data Principles</a> and tried to squeeze in as much as I could without overloading the document, and I drew from ideas that came up in the preparation of the Open House Project report. Some of the changes made were:</p>
<ul>
<li>It now provides examples of data as being documents, audio/visual recordings, and databases.</li>
<li>It now says to support &#8220;the widest practical range of public uses of<br />
the data&#8221;. It had formerly suggested supporting the &#8220;intended&#8221; use of<br />
the website by visitors.</li>
<li>It notes the benefit of providing data: &#8220;New uses of your agency&#8217;s<br />
data may become a valuable public resource that would be out of the<br />
scope of your own website, such as helping to keep the public informed<br />
about the work of your agency and supporting civic education and<br />
participation.&#8221;</li>
<li>There is a new paragraph that I might be misunderstanding but which<br />
seems to make a suggestion along the lines of the recent &#8220;Invisible<br />
Hand&#8221; paper about the agency&#8217;s website getting the data the same way the<br />
public does: &#8220;Providing a uniform method to access raw data can also be<br />
the first step in internal development, accomplishing both goals at<br />
once. When a uniform method to access data is available, developers and<br />
webÃ¢â‚¬â€œservices can focus on data presentation.&#8221;</li>
<li>It notes that the availability of bulk downloads of data is something<br />
to consider when building data access.</li>
<li>It notes some disadvantages of using proprietary formats.</li>
<li>It recommends that if a proprietary format is needed, a<br />
non-proprietary format should be used in addition.</li>
<li> It adds a benchmark to test for success: &#8220;One benchmark for<br />
determining whether data is made sufficiently available is whether the<br />
public has all of the data needed to replicate any searching, sorting,<br />
and display functionality provided on the agency&#8217;s own website.&#8221;</li>
<li>It notes that consulting the public in the development of data access<br />
seems to be entailed from OMB policy: &#8220;When choosing data formats and<br />
distribution methods, keep in mind that your agency&#8217;s visitors are the<br />
best judges of their own needs. Agencies must &#8220;establish and maintain<br />
communications with members of the public and with State and local<br />
governments to ensure your agency creates information dissemination<br />
products meeting their respective needs&#8221; (OMB Policies for Federal<br />
Public Websites #4A).&#8221;</li>
</ul>
<p>We have a real success story here.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.theopenhouseproject.com/2008/06/12/webcontentgov-updates-publishing-data-recommendations/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Government Data and the Invisible Hand</title>
		<link>http://www.theopenhouseproject.com/2008/06/06/government-data-and-the-invisible-hand/</link>
		<comments>http://www.theopenhouseproject.com/2008/06/06/government-data-and-the-invisible-hand/#comments</comments>
		<pubDate>Fri, 06 Jun 2008 10:58:44 +0000</pubDate>
		<dc:creator>Joshua Tauberer</dc:creator>
				<category><![CDATA[Structured Data]]></category>
		<category><![CDATA[government websites]]></category>

		<guid isPermaLink="false">http://www.theopenhouseproject.com/?p=352</guid>
		<description><![CDATA[The guys over at Princeton&#8217;s new Center for Information Technology Policy wrote a really great paper for the Yale Journal of Law &#038; Technology on the role data should have, compared to websites, in government. It articulates a point that I think many of us subconsciously have had in mind:
&#8220;The new administration should specify that [...]]]></description>
			<content:encoded><![CDATA[<p>The guys over at Princeton&#8217;s new Center for Information Technology Policy wrote a really great <a href="http://ssrn.com/abstract=1138083">paper</a> for the Yale Journal of Law &#038; Technology on the role data should have, compared to websites, in government. It articulates a point that I think many of us subconsciously have had in mind:</p>
<blockquote><p>&#8220;The new administration should specify that the federal governmentÃ¢â‚¬â„¢s primary objective as an online publisher is to provide data that is easy for others to reuse, rather than to help citizens use the data in one particular way or another.&#8221;</p></blockquote>
<p>And they suggest an interesting way to push that forward:</p>
<blockquote><p>
&#8220;The policy route to realizing this principle is to require that federal government websites retrieve the underlying data using the same infrastructure that they have made available to the public. Such a rule incentivizes government bodies to keep this infrastructure in good working order, and ensures that private parties will have no less an opportunity to use public data than the government itself does. The rule prevents the situation, sadly typical of government websites today, in which governmental interest in presenting data in a particular fashion distracts from, and thereby impedes, the provision of data to users for their own purposes.&#8221;</p></blockquote>
<p>I think this is a worthwhile addition to the <a href="http://www.opengovdata.org">opengovdata</a> and <a href="http://www.publicmarkup.org">publicmarkup.org</a> policy documents &#8212; if not as a direct recommendation (because I think it may be too much to ask for in a grand form) then noted as a long-term goal or (in terms of the second paragraph I quoted) as a benchmark, a concrete way to tell whether data is open.</p>
<p>The full citation is: Robinson, David, Yu, Harlan, Zeller, William P and Felten, Edward W, &#8220;<a href="http://ssrn.com/abstract=1138083">Government Data and the Invisible Hand</a>&#8221; (2008). Yale Journal of Law &#038; Technology, Vol. 11, 2008</p>
]]></content:encoded>
			<wfw:commentRss>http://www.theopenhouseproject.com/2008/06/06/government-data-and-the-invisible-hand/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>House Rules Offers Updates</title>
		<link>http://www.theopenhouseproject.com/2008/02/21/house-rules-offers-updates/</link>
		<comments>http://www.theopenhouseproject.com/2008/02/21/house-rules-offers-updates/#comments</comments>
		<pubDate>Fri, 22 Feb 2008 03:44:38 +0000</pubDate>
		<dc:creator>John Wonderlich</dc:creator>
				<category><![CDATA[OpenHouse]]></category>
		<category><![CDATA[Structured Data]]></category>
		<category><![CDATA[rules]]></category>

		<guid isPermaLink="false">http://www.theopenhouseproject.com/2008/02/21/house-rules-offers-updates/</guid>
		<description><![CDATA[

Closely watching legislation progress through the House can be difficult, especially given the discretion the Speaker and the Rules Committee have in scheduling and controlling the floor.
The House Rules Committee is making legislation a little easier to follow online, by voluntarily posting updates of special rules, as they are approved, to their website in a [...]]]></description>
			<content:encoded><![CDATA[<p><a title="rules rss.jpg by johnarthurw, on Flickr" href="http://www.flickr.com/photos/johnarthurw/2283195272/"><img width="394" height="480" alt="rules rss.jpg" src="http://farm3.static.flickr.com/2207/2283195272_10fd78b011_o.jpg" /></a><a title="rules rss.jpg by johnarthurw, on Flickr" href="http://www.flickr.com/photos/johnarthurw/2283195272/"><br />
</a></p>
<p>Closely watching legislation progress through the House can be difficult, especially given the discretion the Speaker and the Rules Committee have in scheduling and controlling the floor.</p>
<p>The <a href="http://www.rules.house.gov/">House Rules Committee</a> is making legislation a little easier to follow online, by voluntarily posting updates of special rules, as they are approved, to their website <a href="http://www.rules.house.gov/rss.aspx?GroupTypeID=2">in a structured format</a>.  This means that you can receive updates on newly approved special rules that structure floor consideration for impending bills.  They&#8217;re also offering a <a href="http://www.rules.house.gov/rss.aspx?GroupTypeID=9">feed of amendments</a> submitted to the committee.</p>
<p>The only way to get faster updates would be to attend the committee hearing.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.theopenhouseproject.com/2008/02/21/house-rules-offers-updates/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Congressman Honda on the Open House cause</title>
		<link>http://www.theopenhouseproject.com/2008/02/01/congressman-honda-on-the-open-house-cause/</link>
		<comments>http://www.theopenhouseproject.com/2008/02/01/congressman-honda-on-the-open-house-cause/#comments</comments>
		<pubDate>Fri, 01 Feb 2008 15:54:53 +0000</pubDate>
		<dc:creator>Joshua Tauberer</dc:creator>
				<category><![CDATA[OpenHouse]]></category>
		<category><![CDATA[Structured Data]]></category>
		<category><![CDATA[web 2.0]]></category>

		<guid isPermaLink="false">http://www.theopenhouseproject.com/2008/02/01/congressman-honda-on-the-open-house-cause/</guid>
		<description><![CDATA[Congressman Mike Honda (D, CA-15) is one of this project&#8217;s heroes in the House. In fact, I can&#8217;t recall any other congressman picking out a recommendation of the Open House Project and saying publicly that it&#8217;s a good idea, and referencing this project. In November, he took real action to further transparency in Congress by [...]]]></description>
			<content:encoded><![CDATA[<p>Congressman Mike Honda (D, CA-15) is one of this project&#8217;s heroes in the House. In fact, I can&#8217;t recall any other congressman picking out a recommendation of the Open House Project and saying publicly that it&#8217;s a good idea, and referencing this project. In November, he took real action to further transparency in Congress by supporting the Committee on House Administration in <a href="http://groups.google.com/group/openhouseproject/browse_thread/thread/0ea713bcd6adafc8/a4c81096a690e44e?#a4c81096a690e44e">asking the Library of Congress</a> to look into making the legislative database behind THOMAS publicly available to other websites to reuse. (This is of course the recommendation that I most care about.)</p>
<p>In <a href="http://www.govexec.com/story_page.cfm?filepath=/dailyfed/0108/012308tdpm1.htm">an article last week</a> in the National Journal&#8217;s Technology Daily, Honda compared the benefits of an open legislative database to what the world has gotten out of wikis and open source software like Linux.</p>
<p>Then on Wednesday, Honda <a href="http://mikehonda.blogspot.com/2008/01/congress-20.html">blogged</a> (a perfect medium to express the sentiment) about the issue, citing the Open House project&#8217;s report. He captures the issue really well:</p>
<blockquote><p>
I have been working on an initiative to make Congressional legislative information more accessible to the public. I believe that public information should be provided in a format that takes advantage of the innovative technologies that are revolutionizing the Internet, sometimes known as Web 2.0.</p>
<p>Making CongressÃ¢â‚¬â„¢ legislative database open to the public Ã¢â‚¬Å“would enable independent Web sites to use information in new and creative ways, including educating the public about Congress and providing citizens with customized views of its proceedings,Ã¢â‚¬? according to a report from the Open House Project, an organization supporting this proposal.</p>
<p>Offering legislative information in a way that other websites can reuse could lead to revolutionary changes in the way our government functions, eventually allowing Congress to better tap into the knowledge and wisdom of the American people.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.theopenhouseproject.com/2008/02/01/congressman-honda-on-the-open-house-cause/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Procedural Uncertainty &amp; Normalization</title>
		<link>http://www.theopenhouseproject.com/2008/01/09/procedural-uncertainty-normalization/</link>
		<comments>http://www.theopenhouseproject.com/2008/01/09/procedural-uncertainty-normalization/#comments</comments>
		<pubDate>Thu, 10 Jan 2008 02:41:23 +0000</pubDate>
		<dc:creator>Joshua Tauberer</dc:creator>
				<category><![CDATA[House of Representatives]]></category>
		<category><![CDATA[OpenHouse]]></category>
		<category><![CDATA[Structured Data]]></category>
		<category><![CDATA[executive]]></category>
		<category><![CDATA[govtrack]]></category>
		<category><![CDATA[jurisdiction]]></category>
		<category><![CDATA[library of congress]]></category>
		<category><![CDATA[procedure]]></category>

		<guid isPermaLink="false">http://www.theopenhouseproject.com/2008/01/09/procedural-uncertainty-normalization/</guid>
		<description><![CDATA[I always find it interesting how although our government is run by fairly strict procedural rules that have been written out in various places, starting with the constitution and ending somewhere past the horizon, sometimes it&#8217;s just impossible to locate exactly at what point in the procedural game &#8220;reality&#8221; is. For instance, the constitution outlines [...]]]></description>
			<content:encoded><![CDATA[<p>I always find it interesting how although our government is run by fairly strict procedural rules that have been written out in various places, starting with the constitution and ending somewhere past the horizon, sometimes it&#8217;s just impossible to locate exactly at what point in the procedural game &#8220;reality&#8221; is. For instance, the constitution outlines how a bill can become a law. But, at what point is a bill considered vetoed? If the president is signing the veto signature but misspells &#8220;veto&#8221; (or whatever he writes in this case, I have no idea), or is taken to the hospital before he writes the &#8220;o&#8221;, is the bill vetoed, or is it still awaiting a signature?</p>
<p>The reason this is interesting to me is that we like to capture reality in data. The Library of Congress and GovTrack both systematize (or in computer jargon &#8220;normalize&#8221;) the bill-becomes-a-law process. At every point in the game, a bill, in our data formats, is either in-progress, enacted, dead, etc. It must be in one of these states. After all, the constitution outlines exactly what states a bill can be in, so any bill *must* be in one of these states.</p>
<p><span id="more-231"></span></p>
<p>But if we&#8217;re not sure what state a bill is in, what state do we put it in in our data? There&#8217;s also the more important question- What do the lawmakers do if they disagree about what state a bill is in? (Actually, I would prefer to phrase it as &#8220;what state <b>they</b> are in&#8221;, but that&#8217;s another story.) Wikipedia <a href="http://en.wikipedia.org/wiki/Pocket_veto">describes</a> (what the editors of the page claim is) a current debacle over <a href="http://www.govtrack.us/congress/bill.xpd?bill=h110-1585">H.R. 1585: National Defense Authorization Act FY 2008</a>:</p>
<blockquote><p>In December of 2007, President George W. Bush pushed the pocket veto into murky waters by claiming that he had pocket vetoed H.R. 1585, the &#8220;National Defense Authorization Act for Fiscal Year 2008,&#8221; even though the House of Representatives had designated agents to receive presidential messages before adjourning. The bill had been previously passed by veto-proof majorities in both the House and the Senate [JT: and thus a traditional veto would have been futile].</p></blockquote>
<p>So was the bill (pocket) vetoed or not? Is the bill still in-progress? Assuming it was not pocket vetoed, after 10 legislative days without a traditional veto it becomes law, and us citizens would hate to be on that 11th day without either resolution on the pocket veto matter or a traditional veto, because then we as a country will not know whether this bill has become law. (Another question: How might the Supreme Court assert jurisdiction over this question.)</p>
<p>But back to the data. At one point, some time after Dec. 28, someone in the House responsible for updating the bill status information shown on THOMAS entered a new status line:</p>
<blockquote><p>Dec 28, 2007: Pocket Vetoed by President.</p></blockquote>
<p>GovTrack picked up on the change and shows that status currently, much to the confusion of several people emailing me about it. Looking back at THOMAS, it seems like someone realized that that was apparently quite a constitutional (if not political) claim and retracted that update, because <a href="http://thomas.loc.gov/cgi-bin/bdquery/z?d110:HR01585:@@@X">it not longer says that</a>. </p>
<p>In many cases citizens complain when the government takes things back, hiding information previously made public. That&#8217;s definitely not what I am getting at here. THOMAS is forced to show *something*, and when it doubt&#8230; well, what can you do but roll back history until we figure out what the next legislative step actually *was*.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.theopenhouseproject.com/2008/01/09/procedural-uncertainty-normalization/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

