<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Webdogs 3.0 &#187; planning</title>
	<atom:link href="http://webdogs.org/tag/planning/feed/" rel="self" type="application/rss+xml" />
	<link>http://webdogs.org</link>
	<description>site sentient since 2006</description>
	<lastBuildDate>Tue, 15 May 2012 00:10:53 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Selecting GSA targets &#8211; Part Three: Quantification, Revision and Finalization</title>
		<link>http://webdogs.org/2009/03/22/selecting-gsa-targets-part-three-quantification-revision-and-finalization/</link>
		<comments>http://webdogs.org/2009/03/22/selecting-gsa-targets-part-three-quantification-revision-and-finalization/#comments</comments>
		<pubDate>Mon, 23 Mar 2009 04:03:26 +0000</pubDate>
		<dc:creator>Brian Lawlor</dc:creator>
				<category><![CDATA[planning]]></category>
		<category><![CDATA[specs]]></category>
		<category><![CDATA[gsa]]></category>
		<category><![CDATA[pika]]></category>
		<category><![CDATA[reality]]></category>
		<category><![CDATA[tfp]]></category>

		<guid isPermaLink="false">http://www.webdogs.org/findability/?p=729</guid>
		<description><![CDATA[There is a great deal of work proceeding behind the scenes and several key project elements are converging as we move toward finalizing this public project. Among other things, we have been working through modest but practical solutions for better placement and targeting of our existing 300,000+ repository documents, while solidifying all the additional Google [...]]]></description>
			<content:encoded><![CDATA[<p>There is a great deal of work proceeding behind the scenes and several key project elements are converging as we move toward finalizing this public project. Among other things, we have been working through modest but practical solutions for better placement and targeting of our existing 300,000+ repository documents, while solidifying all the additional Google Search Appliance (GSA) targets in our enterprise search sights, described in <a href="http://www.webdogs.org/2008/11/10/selecting-gsa-targets-part-two-the-practical-realities/">Part Two</a>. At the same time, we are in our own March (and April and May) Madness as we mount a rapid-fire round of trainings for each of our eight remote offices (spread out over 50,000 square miles of Northern California) on their new role in making Google Enterprise search work for them, which is to say for all of us. And as mentioned in earlier posts, we are working every bit as earnestly on our latest in-house build of the <a href="http://pikasoftware.com/">Pika CMS</a>.</p>
<p>The infusion of new, future content into the <a href="http://www.webdogs.org/2008/09/30/tfp-taxonomy-part-two-the-practice/">simplified structural taxonomy</a> we created is a separate challenge we will be posting about later. Dealing with our existing files is more immediate, more concrete. Groking those files has been one of the more interesting, <a href="http://www.webdogs.org/2008/09/30/tfp-taxonomy-part-three-the-anecdotes/">at times hilarious</a> parts of this project.</p>
<p>For those legal services programs interested in how the existing files from our eight local offices break out, here are the percentages for the seven most common &#8220;document&#8221; types:</p>
<ul style="margin-left: 50px;">
<li><strong>67%</strong> &#8211; WordPerfect (WPD)</li>
<li><strong>18%</strong> &#8211; Word (DOC)</li>
<li><strong>9%</strong> &#8211; Portable Document Format (PDF)</li>
<li><strong>3%</strong> &#8211; Excel (XLS)</li>
<li><strong>1%</strong> &#8211; Text (TXT)</li>
<li><strong>0.9%</strong> &#8211; Rich Text Format (RTF) </li>
<li><strong>0.6%</strong> &#8211; PowerPoint (PPT)</li>
</ul>
<p>(Discretion prohibits us from detailing the other file flotsam discovered on local office servers. That said, allow us to observe that some within our organization have extraordinarily good taste in photos taken by National Geographic, and not such good taste in music.)</p>
<p>We are totally on track for targeting most of our planned GSA targets: The existing office archive files listed above have long been targeted (although we still have a lot of work left to fit them into our structural taxonomy); over the last several months we have worked very hard to refresh and update (and remove, as warranted) the targeted content at LSNC&#8217;s various public websites; and we are very pleased with the quality of the <a href="http://www.webdogs.org/2009/02/26/comparing-google-sites-and-gsa-search-results-with-release-52-in-place/">GSA results we are getting out of Google Sites</a>.</p>
<p>This is all good news. In addition, we are putting in place a few more content channels: Targeting the content in our organization&#8217;s seven private Google discussion groups, and a program-wide canvass for select hard-copy training resource materials for <a href="http://www.webdogs.org/2008/11/12/converting-hard-copy-documents-for-addition-to-the-shared-repository/">digital conversion and addition to the shared repository</a>. It&#8217;s all good.</p>
<p>We have had one major disappointment: We discovered that there are significant, unanticipated technical challenges unique to the Pika CMS that thus far have prevented effective use of the GSA to target Pika content. The problem is not the GSA itself or configuring the GSA to target Pika. The GSA by design performs wholly benign, non-destructive crawls as it indexes targeted records. We did a huge amount of target testing and <a href="http://en.wikipedia.org/wiki/Search_engine_results_page">SERP</a> evaluation, and we were very pleased &#8212; actually, thrilled is a better word &#8212; with the results we were getting from Pika. The unanticipated problem is that the current version of Pika is not well optimized for use as an enterprise search target. There are code anomalies in Pika that, among other things, cause it to auto-generate new case intakes and case records when it is crawled by the GSA.</p>
<p>After an assessment by Pika Software, it is now apparent it will take something in the neighborhood of 200 hours of work to make Pika more receptive, shall we say, to an external crawl by the GSA. (Ouch!) So for now, we have to put that part of the project to the side. File under: Lessons Learned.</p>
<h2  class="related_post_title">Other posts of possible interest...</h2><ul class="related_post"><li><a href="http://webdogs.org/2008/11/10/selecting-gsa-targets-part-two-the-practical-realities/" title="Selecting GSA targets &#8211; Part Two: The Practical Realities">Selecting GSA targets &#8211; Part Two: The Practical Realities</a></li><li><a href="http://webdogs.org/2009/09/17/what-the-lsnc-shared-portal-now-looks-like/" title="Revised: What the LSNC Shared Portal now looks like">Revised: What the LSNC Shared Portal now looks like</a></li><li><a href="http://webdogs.org/2009/08/03/tig-final-evaluation-report-for-the-findability-project/" title="TIG final evaluation report for The Findability Project">TIG final evaluation report for The Findability Project</a></li><li><a href="http://webdogs.org/2009/03/26/metadata-schmetadata-relevance-and-reality/" title="Metadata Schmetadata, Relevance and Reality">Metadata Schmetadata, Relevance and Reality</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://webdogs.org/2009/03/22/selecting-gsa-targets-part-three-quantification-revision-and-finalization/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Presumptive Shareability</title>
		<link>http://webdogs.org/2008/12/18/presumptive-shareability/</link>
		<comments>http://webdogs.org/2008/12/18/presumptive-shareability/#comments</comments>
		<pubDate>Thu, 18 Dec 2008 22:39:00 +0000</pubDate>
		<dc:creator>Brian Lawlor</dc:creator>
				<category><![CDATA[concepts]]></category>
		<category><![CDATA[gsa]]></category>
		<category><![CDATA[planning]]></category>
		<category><![CDATA[findability]]></category>
		<category><![CDATA[shareability]]></category>
		<category><![CDATA[tfp]]></category>

		<guid isPermaLink="false">http://www.webdogs.org/findability/?p=389</guid>
		<description><![CDATA[After the first of the year, we&#8217;ll be cranking up as we complete porting of our existing target documents into our new taxonomic organization, resolve some filtering and usability touches we want to integrate into our default GSA front end, primp and polish the layout and presentation of the front end, implement a few basic [...]]]></description>
			<content:encoded><![CDATA[<p>After the first of the year, we&#8217;ll be cranking up as we complete porting of our existing target documents into our new <a href="http://www.webdogs.org/2008/09/30/tfp-taxonomy-part-two-the-practice/">taxonomic organization</a>, resolve some filtering and usability touches we want to integrate into our default <a href="http://code.google.com/apis/searchappliance/documentation/50/help_gsa/serve_frontends.html">GSA front end</a>, primp and polish the layout and presentation of the front end, implement a few basic <a href="http://www.google.com/enterprise/gsa/onebox.html">OneBox</a> modules, and set in motion what we&#8217;re now referring to as the &#8220;Rolling Thunder Roadshow&#8221; to all our eight office locations.</p>
<p>The RTR will be our way to recognize and promote among all our staff the changes in how documents and other files are made easily and intuitively findable, and given a new level of access and usability throughout our non-profit organization. After all, that is the core purpose of enterprise search. And a key element of all this is changing deeply rooted individual notions or assumptions about what can or should be &#8220;shareable.&#8221;</p>
<p>In working on this project within a non-profit environment, we have learned that most employees have an inclination to undershare, not overshare. Not because they are selfish or secretive; rather, because the type of transparent sharing that enterprise search makes possible is foreign to most of them. It is familiar to them to be asked to provide a document to others on request in person, by phone or by email. It is foreign to them to decide in advance that a document they created or have received from someone else should be transparent to the rest of the entire organization. The concepts of creation and possession are severed from the concept of findability.</p>
<p>To be sure, the increasing use of collaborative web-based document tools within our organization &#8212; principally our adoption two years ago of <a href="http://www.google.com/apps/">Google Apps</a> &#8212; has helped us on this journey. Most staff at this point are familiar with the concept, if not the practice in their individual work, of creating or editing or uploading documents that can be &#8220;shared&#8221; from a common web location. They get that, even if they don&#8217;t do it themselves, because increasingly others demand they do so&#8230; when they get a &#8220;share&#8221; message email from Google Docs about a document someone created or edited there; when they get an email with a link to something someone else posted in our domain&#8217;s Google Sites; or when they get a message to fill out a Google Docs form for, well, whatever.</p>
<p>As we prepare for the RTR, the team working on this project have brainstormed about what we can say or demonstrate to the staff in each office, to prompt them to rethink (OK, in some cases just think) what types of documents should be shared with others by adding them to the new document repositories.</p>
<p>We now refer to this as &#8220;presumptive shareability.&#8221; In particular situations, it may not be appropriate to make the document or file transparent through enterprise search, but in most cases it will be because all are situations where the document or file has served a shareable purpose, i.e, use by more than one person or re-use by one or more persons.</p>
<p>Among the situations we think should trigger staff to think to add the document or file in question to the shared repository are the following:</p>
<ul class="content">
<li>An attachment to an email message you send or forward to someone else.</li>
<li>You request or receive a file as an email attachment from someone within the organization.</li>
<li>You receive a non-confidential file attachment from someone outside the organization.</li>
<li>Every time you re-use a document or form as part of your work.</li>
<li>You learn that the PowerPoint (or other presentation format) for a training or conference event you attended is now available for viewing or downloading.</li>
<li>You lug home substantive hard-copy handouts distributed from a training or conference.</li>
<li>Can you say, &#8220;presentation&#8221; and/or &#8220;portable&#8221;? Whatever it is, if it is a PDF or PPT file it is presumptively shareable.</li>
<li>If it is the &#8220;final&#8221; version of a case-related pleading, memorandum, exhibit or correspondence and you think others may find it usable, share it.</li>
<li>Usable documents you discover and think to save to your desktop as part of research on the Web, regardless of file type (PDF, DOC, XLS, etc.)</li>
<li>Similarly, when doing work-related research on the Web, anytime you think to bookmark a web page or save the page to your desktop as an HTML or TXT file.</li>
<li>&#8230; you get the drift.</li>
</ul>
<p>Shareability promotes findability. That&#8217;s our story and we&#8217;re stickin&#8217; to it.</p>
<h2  class="related_post_title">Other posts of possible interest...</h2><ul class="related_post"><li><a href="http://webdogs.org/2008/09/30/tfp-taxonomy-part-two-the-practice/" title="The Findability Project Taxonomy &#8211; Part Two: The Practice">The Findability Project Taxonomy &#8211; Part Two: The Practice</a></li><li><a href="http://webdogs.org/2009/05/17/tfp-taxonomy-part-four-revisions-to-the-projects-structural-taxonomy/" title="TFP Taxonomy &#8211; Part Four: Revisions to the project&#039;s structural taxonomy">TFP Taxonomy &#8211; Part Four: Revisions to the project&#039;s structural taxonomy</a></li><li><a href="http://webdogs.org/2008/12/04/the-search-box-as-a-findability-design-concept/" title="The search box as a findability (design) concept">The search box as a findability (design) concept</a></li><li><a href="http://webdogs.org/2008/11/10/selecting-gsa-targets-part-two-the-practical-realities/" title="Selecting GSA targets &#8211; Part Two: The Practical Realities">Selecting GSA targets &#8211; Part Two: The Practical Realities</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://webdogs.org/2008/12/18/presumptive-shareability/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Selecting GSA targets &#8211; Part Two: The Practical Realities</title>
		<link>http://webdogs.org/2008/11/10/selecting-gsa-targets-part-two-the-practical-realities/</link>
		<comments>http://webdogs.org/2008/11/10/selecting-gsa-targets-part-two-the-practical-realities/#comments</comments>
		<pubDate>Mon, 10 Nov 2008 18:52:06 +0000</pubDate>
		<dc:creator>Brian Lawlor</dc:creator>
				<category><![CDATA[planning]]></category>
		<category><![CDATA[findability]]></category>
		<category><![CDATA[gsa]]></category>
		<category><![CDATA[reality]]></category>
		<category><![CDATA[tfp]]></category>

		<guid isPermaLink="false">http://www.webdogs.org/findability/?p=286</guid>
		<description><![CDATA[In an earlier post about selecting Google Search Appliance (GSA) targets for this project, the narrative definitely edged toward the more abstract. We highlighted four principal sets of GSA targets: files on our newly created &#8220;shared document repositories&#8221;; repurposed intranet content being moved over from an MediaWiki installation on an old server; cherry-picked content available [...]]]></description>
			<content:encoded><![CDATA[<p>In an <a href="http://www.webdogs.org/2008/08/31/you-are-what-you-find-selecting-gsa-targets/">earlier post about selecting Google Search Appliance (GSA) targets</a> for this project, the narrative definitely edged toward the more abstract. We highlighted four principal sets of GSA targets: files on our newly created &#8220;shared document repositories&#8221;; repurposed intranet content <a href="http://www.webdogs.org/?p=309">being moved over</a> from an MediaWiki installation on an old server; cherry-picked content available on LSNC&#8217;s varied public websites (LSNC maintains 13 distinct public websites and subsites); and select records in <a href="http://pikasoftware.com/">Pika CMS</a>, the secured, web-based case management system used by all our advocates.</p>
<p>As far as it goes, this abstract list of GSA target sets fairly summarizes what we, as an organization, want to make transparent via enterprise search, which is to say make &#8220;findable&#8221; in ways not practicable without the GSA. This abstract list of GSA targets, however, fails to convey what we have done at non-abstract, practical level to make those targets useful to our larger search goals.</p>
<p>So, let me hit a few notes about several practical decisions we&#8217;ve made at launch as we target the GSA at real files offering real search results.</p>
<p>As described in <a href="http://www.webdogs.org/2008/08/31/you-are-what-you-find-selecting-gsa-targets/">Part One</a>, when we first unpacked our GSA and aimed it, uh, somewhat aimlessly at any and every file on one of our local servers, the GSA did its job in killer fashion&#8230; and blew out our file limit. While one can proceed that way, we were always mindful that we had to sort out how to organize and structure the shared content that we seriously wanted to make searchable and findable. So, one of the first tasks we confronted on this project was to work out our thinking about &#8220;taxonomy,&#8221; resulting in the <a href="http://www.webdogs.org/2008/09/30/tfp-taxonomy-part-two-the-practice/">basic directory structures we have adopted</a>.</p>
<p>That taxonomic &#8220;organization&#8221; step was essential to this project, but completing that particular project objective doesn&#8217;t translate directly into searchable content organized in a particular way. You see, there is this pesky little detail: Real people need to actually identify the existing and/or newly created files to be included and then somehow get the files in the directories on the shared document repository that are the target of the GSA.</p>
<p>Easier said than done.</p>
<p>In our case &#8212; particularly given the limited IT and support staff resources available to us as a typical legal services field program &#8212; we had to come up with some practical approaches to move existing files from any number of different locations to the designated shared locations or document repositories. (I will discuss how we handle adding newly created files, in a later post.) Here&#8217;s what we did with our existing files to fold them into the content targeted by our GSA:</p>
<p><strong>1. Initially, include all existing &#8220;staff-specific&#8221; content, with an opt-out</strong></p>
<p>We did find ourselves on the receiving end of a lot of staff enthusiasm about this project. Truly. But it is impractical and unrealistic to expect your individual legal services advocates and other staff to comb through all their thousands of files and then move them over to a different file server location. (Maybe it <em>should</em> be realistic to expect them to do this, but in our experience it just ain&#8217;t gonna happen. No way.) But there are tons of content gold in them thar files, so we had to figure out a way to <em>initially</em> get all that good stuff in place, even if not parsed out in a taxonomic sense, so we could target it.</p>
<p>To accomplish this, we first vetted with, and got buy-in from, all our local offices to do the following:</p>
<p>On the local project file server for each local office, we created a special project &#8220;archive&#8221; directory. Then each local office manager copied each individual staff member&#8217;s files wholesale over to a user-specific directory in this so-called archive directory. Having an unequivocal &#8220;opt-out&#8221; option was important to the success of this approach. Again and again, in formal meetings and informal discussions, we reminded office staff that they could ask that any or all files to be removed from these initial archive-file targets. No questions asked. There were a few such requests, but not a lot: One advocate asked that her files not be targeted at all, so we removed all her files; two others had less than a dozen files they wanted removed as targets, so we did so. No biggie.</p>
<p>The net effect is that this makes the targeted advocate files initially non-taxonomic, but in short order you have a <em>huge</em> repository that has a (allow me to exaggerate here, for literary effect) 99% chance  of including pretty much everything the individual staff members would add if they &#8220;woulda, coulda, shoulda,&#8221; so to speak. In our case, this initially amounts to about 300,000 document files, the vast majority of which are advocate-generated files.</p>
<p>At launch, this does mean that these office-specific, bulk compilations of <em>existing</em> files added as targets include a significant number of drafts and duplicates that one would normally not include as a shared file if it were being added as a newly created file. For example, within our office culture it is not only common but actually expected that advocates <em>not</em> work in isolation on major cases. (We discourage the &#8220;lone eagle&#8221; model.) So, our early search-results testing shows that often the same file shows up in more than one target location because more than one advocate has a copy of the file in their archive.</p>
<p>It bears mentioning two other factors we kept in mind as part of this initial targeting of shared files: We double- and triple-checked with all management staff to assure nothing management-sensitive or -confidential was moved to a location where it could be targeted. Also, before we moved anything over wholesale, as described, we asked all staff to remove certain types of files that no one would reasonably expect to be part of the searchable content. Examples: Family photos, MP3 music downloads, YouTube videos, yada yada yada. Enough said.</p>
<p>We do have an approach in mind for &#8220;peeling off&#8221; these office-specific archives over time, to separate out the drafts and duplicates and place them within our taxonomic directory structures. More about that later.</p>
<p><strong id="google-sites">2. Using Google Sites as the platform for our existing intranet content</strong></p>
<p>I recall having a passing conversation with Gabrielle Hammond at last year&#8217;s TIG conference about how we were holding off on further intranet development while waiting to see how Google implements its <a href="http://www.jot.com/">JotSpot-based</a> wiki application, now known as <a href="http://www.google.com/sites/overview.html">Google Sites</a>.</p>
<p>Well, people, we now know what Google Sites is all about and we love it! For the last several years we had been using MediaWiki as the publication platform for our intranet, but we are in the process of <a href="http://www.webdogs.org/?p=309">replacing it</a> for our internal wiki needs. We are about half way through that process, which should be completed shortly after the first of the year.</p>
<p>One big bonus of moving our intranet content to Google Sites is that it is quasi-tailor made to work with both the GSA and <a href="http://www.google.com/analytics/">Google Analytics</a>. I say &#8220;quasi&#8221; because the interactions between them are good but hardly optimal at the moment. For example, only days ago <a href="http://googleenterprise.blogspot.com/2008/11/google-analytics-for-google-apps.html">Google Analytics for Google Apps</a> was rolled out, but the quality of the data we are getting so far is not so easy to get a handle on. More importantly, Google promises <a href="http://www.google.com/support/sites/bin/answer.py?answer=90921">GSA integration with Google Sites</a>, but it is still a buggy implementation. We have easily targeted test site pages within our domain&#8217;s Google Sites, but have hit a wall with getting the GSA to properly return search results on the indexed content <em>within</em> files <em>uploaded</em> to Google Sites. Turns out we are one of several organizations that have identified this problem and Google Enterprise support assures it will have a fix with its next software upgrade, in about a month or two. We (and <a href="http://www.mcplusa.com/">our GSA consultant</a>) are confident this will work in due time, but it&#8217;s one of those details we have to wait on for now.</p>
<p><strong id="public">3. Updating our public web content</strong></p>
<p>Over the last 10 years, LSNC has placed an enormous amount of its advocate content out on the public Web. But one recent example is the <a href="http://www.foodstampguide.org/">California Food Stamp Guide</a>, a prime example of public content that our advocates can search at that site, but would want to be able to search directly via our GSA shared portal. It is also one example of a content cluster that can be part of or its own <a href="http://googleenterprise.blogspot.com/2006/10/tech-tip-collection-math.html">GSA collection</a>. (&#8220;Collections are logical views of information in the index, as defined by URL patterns. This allows you, for example, to index the entire contents of your intranet, but then divide it up into logical groups of content.&#8221;)</p>
<p>Implementation of <strong>The Findability Project</strong> has prompted some public housekeeping. Our target testing of our public content, predictably, reveals that we have stuff out there that is, well&#8230; past its shelf-life, shall we say. So we are working on a systematic way to thoroughly review and clean up that public content. It is obvious but important: Current and correct public content means better search results via the GSA. (Apologies to the larger legal services community for not doing it sooner.)</p>
<p><strong>4. Targeting our case management system</strong></p>
<p>We consider our <a href="http://pikasoftware.com/">Pika case management system</a> a key, long-term GSA target. But we are not there yet. We have prioritized getting all the other targeted content organized and in position, with clear protocols in place. We also are busy reworking on our shared portal, which will integrate the GSA search functions and provide users with (hopefully) intuitive ways to filter their search results, search select content collections, and provide the users with some nice Google GSA touches like <a href="http://www.google.com/enterprise/gsa/onebox.html">OneBox</a> searches, among <a href="http://www.google.com/enterprise/gsa/features.html">other features</a>.</p>
<p>That all said, being able to target our case management system is a total no-brainer and perhaps the most practical of necessities. In a given day, there is likely nothing more common or more vital to our work for clients than the search for information within our case management system. The native search functions built into the current version 3.07 of Pika are good. But we are optimistic that we can exploit the GSA to make those searches even better. And certainly more integrated with everything else in our new enterprise search universe.</p>
<h2  class="related_post_title">Other posts of possible interest...</h2><ul class="related_post"><li><a href="http://webdogs.org/2009/09/17/what-the-lsnc-shared-portal-now-looks-like/" title="Revised: What the LSNC Shared Portal now looks like">Revised: What the LSNC Shared Portal now looks like</a></li><li><a href="http://webdogs.org/2009/03/26/metadata-schmetadata-relevance-and-reality/" title="Metadata Schmetadata, Relevance and Reality">Metadata Schmetadata, Relevance and Reality</a></li><li><a href="http://webdogs.org/2009/03/22/selecting-gsa-targets-part-three-quantification-revision-and-finalization/" title="Selecting GSA targets &#8211; Part Three: Quantification, Revision and Finalization">Selecting GSA targets &#8211; Part Three: Quantification, Revision and Finalization</a></li><li><a href="http://webdogs.org/2008/08/31/you-are-what-you-find-selecting-gsa-targets/" title="Selecting GSA targets &#8211; Part One: Four abstract targets">Selecting GSA targets &#8211; Part One: Four abstract targets</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://webdogs.org/2008/11/10/selecting-gsa-targets-part-two-the-practical-realities/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Going Forward: Document &#8220;best practices&#8221; and protocols</title>
		<link>http://webdogs.org/2008/10/23/going-forward-document-best-practices-and-protocols/</link>
		<comments>http://webdogs.org/2008/10/23/going-forward-document-best-practices-and-protocols/#comments</comments>
		<pubDate>Thu, 23 Oct 2008 17:36:14 +0000</pubDate>
		<dc:creator>Brian Lawlor</dc:creator>
				<category><![CDATA[planning]]></category>
		<category><![CDATA[findability]]></category>
		<category><![CDATA[tfp]]></category>

		<guid isPermaLink="false">http://www.webdogs.org/findability/?p=253</guid>
		<description><![CDATA[In earlier posts I have shared memoranda distributed at a recent organization-wide meeting, including an explanation of our taxonomic structures and details of various file-naming conventions adopted for this project. Attached to this post are two additional memos: Download: Best practices and protocols for GSA Documents Download: Organization of advocate-user directories As I so often [...]]]></description>
			<content:encoded><![CDATA[<p>In earlier posts I have shared memoranda distributed at a recent organization-wide meeting, including an explanation of <a href="http://www.webdogs.org/2008/09/30/tfp-taxonomy-part-two-the-practice/">our taxonomic structures</a> and details of various <a href="http://www.webdogs.org/2008/10/03/dont-overthink-file-naming-conventions/">file-naming conventions</a> adopted for this project. Attached to this post are two additional memos:</p>
<ul class="content">
<li>Download: <a href="http://www.webdogs.org/project_files/gsa_document_protocols_09-18-08.pdf">Best practices and protocols for GSA Documents</a></li>
<li>Download: <a href="http://www.webdogs.org/project_files/user_directories_09-18-08.pdf">Organization of advocate-user directories</a></li>
</ul>
<p>As I so often like to say, allow me to explain:</p>
<p>In making practical decisions about handling files targeted by our Google Search Appliance (GSA), we look both backward and forward in time. This dichotomy between the past and the future is one that Google Enterprise itself promotes with its cursory recommendations that its customers decide for themselves where to locate <a href="http://code.google.com/apis/searchappliance/documentation/50/planning/planning.html#PlanningExistingContent">existing content</a> and <a href="http://code.google.com/apis/searchappliance/documentation/50/planning/planning.html#PlanningNewContent">new content</a>.</p>
<p>Based on our experience working on this project, there are considerable differences in how to handle the &#8220;past.&#8221; A separate post on those issues will be coming forth, soon. But going forward into the &#8220;future,&#8221; we have thrashed out the practices and protocols detailed in the two memos linked, above. While there are institutional contexts for some things described in the memos that may be lost on those not part of our organization, the memos are (hopefully) self-explanatory. There are other practical details about the document protocols that will be expanded on in later posts, including how the Shared Repository web interface works, the integration of our metadata models, and so on. (All good things come to those who wait, at least with this project.)</p>
<p>Among the most practical observations in these memos, I think, is breaking through the common but incorrect perception that one needs to save a document to &#8220;the&#8221; correct directory, as opposed to &#8220;a&#8221; correct directory, however it is done. And while staff are instintively bewildered, somewhat, by concepts of &#8220;taxonomy&#8221; and &#8220;metadata&#8221; and wonder how they will be able to find things if they are not located in &#8220;the&#8221; correct directory, it is also extraordinarily reassuring to them to know that we&#8217;re talking about Google here. Even if they do not understand how it all works, typically they have great faith that Google search will find it for them, as described in an <a href="http://www.webdogs.org/2008/11/19/advocate-search-anecdotes/">earlier ancecdote</a>.</p>
<p><img style="float: right; margin: 0 0 15px 15px;" src="http://www.webdogs.org/project_files/example_shared_drive_02.png" alt="" /></p>
<p>The memos also attempt to address some of the practical realities and limits of a non-profit, legal services work environment. LSNC has neither the resources nor motivation to micro-manage how users organize their own file directories. Life is too short. But as detailed in the &#8220;advocate-user directories&#8221; memo, we do now require all LSNC staff to have a user-specific, user-named directory, and that the name used be the user&#8217;s full name. The primary motivation for this requirement is a practical need to standardize directory-name conventions throughout the organization, so that the location and targeting of files is predictable, manageable and findable. And, if for no other reason, doing so eliminates the need to guess whether something is located in the Shareen, Shari, Shelly or Sherri directory &#8212; a real-world example from our Auburn office, illustrated to the right.</p>
<h2  class="related_post_title">Other posts of possible interest...</h2><ul class="related_post"><li><a href="http://webdogs.org/2008/12/18/presumptive-shareability/" title="Presumptive Shareability">Presumptive Shareability</a></li><li><a href="http://webdogs.org/2008/11/10/selecting-gsa-targets-part-two-the-practical-realities/" title="Selecting GSA targets &#8211; Part Two: The Practical Realities">Selecting GSA targets &#8211; Part Two: The Practical Realities</a></li><li><a href="http://webdogs.org/2008/10/03/dont-overthink-file-naming-conventions/" title="Don&#039;t overthink file naming conventions">Don&#039;t overthink file naming conventions</a></li><li><a href="http://webdogs.org/2008/09/30/tfp-taxonomy-part-two-the-practice/" title="The Findability Project Taxonomy &#8211; Part Two: The Practice">The Findability Project Taxonomy &#8211; Part Two: The Practice</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://webdogs.org/2008/10/23/going-forward-document-best-practices-and-protocols/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Don&#039;t overthink file naming conventions</title>
		<link>http://webdogs.org/2008/10/03/dont-overthink-file-naming-conventions/</link>
		<comments>http://webdogs.org/2008/10/03/dont-overthink-file-naming-conventions/#comments</comments>
		<pubDate>Fri, 03 Oct 2008 22:51:03 +0000</pubDate>
		<dc:creator>Brian Lawlor</dc:creator>
				<category><![CDATA[concepts]]></category>
		<category><![CDATA[planning]]></category>
		<category><![CDATA[findability]]></category>
		<category><![CDATA[tfp]]></category>

		<guid isPermaLink="false">http://www.webdogs.org/findability/?p=198</guid>
		<description><![CDATA[This is a quick post to discuss some basics about file naming conventions worked out as part of this project. At a recent program-wide meeting to discuss details of this project with all our office managers, among the memos distributed was the following: Download: File-naming &#8220;best practices&#8221; If you take a look at the files [...]]]></description>
			<content:encoded><![CDATA[<p>This is a quick post to discuss some basics about file naming conventions worked out as part of this project. At a recent program-wide meeting to discuss details of this project with all our office managers, among the memos distributed was the following:</p>
<ul class="content">
<li>Download: <a href="http://www.webdogs.org/project_files/file_naming_conventions_09-18-08.pdf">File-naming &#8220;best practices&#8221;</a></li>
</ul>
<p>If you take a look at the files in your advocate staff directories, you are likely to see individualized albeit typical patterns in the file names. There is a discernible Darwinism to the conventions individual users adopt: They use both directory structures and name files in a way that makes them &#8220;findable&#8221; for them later, if not for others. (OK, basically &#8220;unfindable&#8221; by others, in a lot of instances.) Common naming patterns include, in almost all instances, at least a generic name descriptive of the type of document (e.g., petition or complaint or writ), plus other descriptive elements that help the user to later locate it, such as a client or project name, the date of the document, and/or whether the file is a draft or a final or a version copy.</p>
<p>There are any number of ways one can go with file naming conventions, as well illustrated in the article at CompuJurist, <a href="http://compujurist.com/2006/04/07/50/">Are there any recognized &#8220;best&#8221; practices for file naming conventions?</a> Akin to what is discussed in that article, we&#8217;ve adopted the following template for use by advocates to name their files:</p>
<p><strong>[draft/final]&nbsp;&nbsp;[document type]&nbsp;&nbsp;[party/case]&nbsp;&nbsp;[subject]&nbsp;&nbsp;[date] [file extension]</strong></p>
<p>Yes, there are other longstanding concerns about how files are named, beyond what may concern your attorneys and other advocates. These concerns include naming conventions driven by the <a href="http://www.portfoliofaq.com/pfaq/FAQ00352.htm">demands of particular operating systems</a>, all too often <a href="http://www.43folders.com/2006/10/23/file-naming">non-intuitive but technical project requirements</a>, or the <a href="http://www.mattcutts.com/blog/dashes-vs-underscores/">recommendations of 800-pound gorillas like Google</a> (explaining why Google favors dashes over underscores).</p>
<p>All that said, if you look at the memo linked above, you may notice that the examples promote the use of underscores, rather than spaces, between words in the filenames.</p>
<p>Our thinking is this:</p>
<ul class="content">
<li>As a practical matter, using spaces between words in file names creates file transfer problems when moving the file from one server to the other. Not a good thing. Especially when you are dealing with relocating files that count in the hundreds of thousands.</li>
<li>Using spaces creates readability problems when viewing the path of the file in a GSA search result, because the GSA <a href="http://en.wikipedia.org/wiki/URL_normalization">normalizes the URL</a> by inserting special characters wherever the file has a space in its name. Even if you didn&#8217;t know what to call it, you&#8217;ve seen this phenomenon. Here&#8217;s a real world example in a GSA search result from one of our test bed sites:<br />
<br />
<strong>/Ukiah%20Office/Former%20Staff/Kan&#39;s%20Transfer%20File/letter%20to%20jake.wpd</strong><br />
<br />
Look familiar?</li>
<li>File names with underscores are easier to read than files with dashes. They just are, OK? To be fair, not everyone is going to agree with that proposition. But we did some admittedly unscientific user testing (hey, Glenn, you get what you pay for), where we asked staff to read the same file name listed three ways: With spaces, with underscores, and with dashes. Without exception, our crack team of testers said they found it easiest to read the file names if they had spaces (duh!); less easy to read if there were underscores; and least easy where dashes were used.</li>
<li>A usability corollary: If you use underscores, a linked file name in a search result is easier to read because as a link the file name appears underlined, so words appear as if they have spaces, which is the easiest of the three formats to read (see test results, above).</li>
</ul>
<p>This is fairly prosaic stuff and bears some thought, but is not worth overthinking. Or much of an enforcement regime. The project goal is not to have nice, neat, compliant file names. That&#8217;s an objective. It is not the goal. We are not investing a lot of time worrying about those who paint outside the lines. The point of the project is to get the files targeted properly so that users can find the content they contain.</p>
<p>We are confident that, as users see file names displayed in the GSA search results, it will sink in why it matters how one names the files, and they will adapt.</p>
<h2  class="related_post_title">Other posts of possible interest...</h2><ul class="related_post"><li><a href="http://webdogs.org/2008/12/18/presumptive-shareability/" title="Presumptive Shareability">Presumptive Shareability</a></li><li><a href="http://webdogs.org/2008/11/10/selecting-gsa-targets-part-two-the-practical-realities/" title="Selecting GSA targets &#8211; Part Two: The Practical Realities">Selecting GSA targets &#8211; Part Two: The Practical Realities</a></li><li><a href="http://webdogs.org/2008/10/23/going-forward-document-best-practices-and-protocols/" title="Going Forward: Document &#8220;best practices&#8221; and protocols">Going Forward: Document &#8220;best practices&#8221; and protocols</a></li><li><a href="http://webdogs.org/2008/09/30/tfp-taxonomy-part-two-the-practice/" title="The Findability Project Taxonomy &#8211; Part Two: The Practice">The Findability Project Taxonomy &#8211; Part Two: The Practice</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://webdogs.org/2008/10/03/dont-overthink-file-naming-conventions/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The Findability Project Taxonomy &#8211; Part Two: The Practice</title>
		<link>http://webdogs.org/2008/09/30/tfp-taxonomy-part-two-the-practice/</link>
		<comments>http://webdogs.org/2008/09/30/tfp-taxonomy-part-two-the-practice/#comments</comments>
		<pubDate>Tue, 30 Sep 2008 19:47:07 +0000</pubDate>
		<dc:creator>Brian Lawlor</dc:creator>
				<category><![CDATA[planning]]></category>
		<category><![CDATA[concepts]]></category>
		<category><![CDATA[findability]]></category>
		<category><![CDATA[taxonomy]]></category>
		<category><![CDATA[tfp]]></category>

		<guid isPermaLink="false">http://www.webdogs.org/findability/?p=188</guid>
		<description><![CDATA[We&#8217;ve laid out our take on the theoretical approach to the TFP taxonomy. But in practice, how is LSNC actually implementing those organizational concepts or principles? That is what this post is about. I&#8217;ll just give you the end-product upfront and then explain how LSNC sorted out the basic taxonomic structures for its shared document [...]]]></description>
			<content:encoded><![CDATA[<p>We&#8217;ve laid out our take on the <a href="http://www.webdogs.org/2008/09/23/tfp-taxonomy-part-one-the-theory/">theoretical approach</a> to the TFP taxonomy. But in practice, how is LSNC actually implementing those organizational concepts or principles? That is what this post is about.</p>
<p>I&#8217;ll just give you the end-product upfront and then explain how LSNC sorted out the basic taxonomic structures for its shared document repository. The two PDF files linked below are copies of what was distributed at a program-wide meeting a few weeks ago to address and resolve what the basic organization structures would look like.</p>
<ul class="content">
<li>Download: <a href="http://www.webdogs.org/project_files/top_level_directory_structure_05-11-09.pdf">29 Top-level Project Directories</a></li>
<li>Download: <a href="http://www.webdogs.org/project_files/simplified_structural_taxonomy_05-11-09.pdf">Simplified Project Taxonomy</a></li>
</ul>
<p>It was actually quite easy to come up with an initial (if bloated) proposed list of likely substantive advocacy content targets, their location, and how the content would be organized, but even that required process.</p>
<p>LSNC has what it calls a &#8220;regional counsel&#8221; model, which means there are three designated advocacy leaders with senior substantive, litigation and advocacy experience who are expected to provide just that, &#8220;leadership.&#8221; (One of the three, by the way, is Mona Tawatao who is the recipient of the <a href="http://www.nlada.org/Training/Train_Annual/Annual_Home_thirdcol_Awards#section6">2007 NLADA Reginald Heber Smith award</a>.) The regional counsel, with feedback from other management leadership (including the executive director, a few local office managing attorneys interested in this particular project detail, and the senior office manager representing support staff interests) worked up the list, later vetted more broadly with the entire management team, who in turn vetted it with each of their local offices or other program unit.</p>
<p>In the initial proposal, the substantive advocacy content was organized based on the ten LSC Problem categories in current use by legal services programs, plus roughly an additional 30 or so other general categories. The latter included additional substantive categories (economic development, disaster relief, etc.), practice matters (e.g., federal and state court practice issues, discovery, etc.), and other work-related content (self-help clinic content, specialized training materials, etc.) that reflect what LSNC and other legal services field programs actually do for a living. In response to any number of discussions and comments by the smaller group thrashing the details out, the list at times expanded and contracted, went deeper and then sometimes more shallow. This initial organization structure also included targeted content related to local office and central administrative office work. A similar vetting process was undertaken by the senior office manager with all the other office managers in all the core local offices, as well as administrative and business office managers. As mentioned earlier, each of those, in turn, were asked to vet the structures with their respective staff.</p>
<p>This process did not operate in a project vacuum. As not only one of the three regional counsel but also the person responsible for managing this project, I also did what I think managers should always do: I talk to the people affected. I took the time, a lot of it, to speak directly and individually with all of the forgoing to explain the overall project and its technical demands, and in a non-technical fashion (well, at least I tried) the significance of developing an organizational structure, and other, related issues, such as the use metadata models to attribute value to the targeted content, and so on. The point being, to take the time to assure leadership understood from more than a memo what the project is about, why it matters, and answer their questions or concerns. In response to the vetting and these dialogs, real changes were made in the proposed organization and additional content targets were identified. Time investments paid dividends, at least in this case.</p>
<p>By the time our GSA consultant showed up for a scheduled three-day thrashing of our test-bed installation in Sacramento, we had a taxonomy with over 40 top-level directories and a lot of two- and three-level deep subdirectories. He looked at this, in a non-committal fashion said &#8220;that&#8217;s fine,&#8221; and then began to suggest reasons why it should be simplified. This push by the GSA consultant was prompted by notions of usability and manageability of the content areas. As mentioned in the <a href="http://www.webdogs.org/2008/09/23/tfp-taxonomy-part-one-the-theory/">prior post on project taxonomy</a>, there are not significant advantages or improvements to search results in a repository structure beyond a second-level directory. The consultant also emphasized that most users are not likely to locate or use a directory substructure below the second level. (This has to do with users navigating directory structures to add, remove or modify files, for whatever reason.)</p>
<p>Since a significant portion of the metadata models we are adopting rely on the organization structures in order to build logical, searchable &#8220;collections,&#8221; we simplified the structures in response to the consultant&#8217;s recommendation in this regard. Hence, the 29 top-level directories and the &#8220;simplified&#8221; taxonomy you can see in the memos linked at the beginning of this post, and the reliance on only one-level deeper for those directories.</p>
<p>As we get life experience with this organization structure, my guess is that we may expand to add a few additional top-level directories but not many, if any. I think we have things pretty much covered at the top-level, at this point. But apart from the rigid yet practical exploitation of the dated &#8212; but undeniably familiar &#8212; LSC Problem Codes for a large chunk of the substantive organization, my guess also is that the one-level down subdirectory structures will likely change as users give us feedback, and we discover that some subdirectories are not particularly used or useful. Proof&#8217;s in the pudding, people.</p>
<p>This all came full circle with our program-wide meeting a few weeks back. By the time of that meeting, every manager within LSNC had seen the organization proposals, every manager had a one-on-one conversation with project staff about the project and the organization structure, every manager had vetted the proposal to his or her people, and the memos you see linked here had been distributed to all offices.</p>
<p>That&#8217;s how we roll.</p>
<h2  class="related_post_title">Other posts of possible interest...</h2><ul class="related_post"><li><a href="http://webdogs.org/2009/05/17/tfp-taxonomy-part-four-revisions-to-the-projects-structural-taxonomy/" title="TFP Taxonomy &#8211; Part Four: Revisions to the project&#039;s structural taxonomy">TFP Taxonomy &#8211; Part Four: Revisions to the project&#039;s structural taxonomy</a></li><li><a href="http://webdogs.org/2008/12/18/presumptive-shareability/" title="Presumptive Shareability">Presumptive Shareability</a></li><li><a href="http://webdogs.org/2008/09/23/tfp-taxonomy-part-one-the-theory/" title="The Findability Project Taxonomy &#8211; Part One: The Theory">The Findability Project Taxonomy &#8211; Part One: The Theory</a></li><li><a href="http://webdogs.org/2009/03/26/metadata-schmetadata-relevance-and-reality/" title="Metadata Schmetadata, Relevance and Reality">Metadata Schmetadata, Relevance and Reality</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://webdogs.org/2008/09/30/tfp-taxonomy-part-two-the-practice/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

