<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>ShapeSpace &#187; Data Cleansing</title>
	<atom:link href="http://www.shapespace.com/topics/consultancies/data-cleansing-consultancies/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.shapespace.com</link>
	<description>ShapeSpace Website</description>
	<lastBuildDate>Mon, 04 Feb 2013 12:52:46 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Cleaning up for CAD &amp; PLM</title>
		<link>http://www.shapespace.com/consultancies/data-cleansing-consultancies/cleaning-up-for-cad-plm/</link>
		<comments>http://www.shapespace.com/consultancies/data-cleansing-consultancies/cleaning-up-for-cad-plm/#comments</comments>
		<pubDate>Tue, 21 Sep 2010 14:18:54 +0000</pubDate>
		<dc:creator>drewsherlock</dc:creator>
				<category><![CDATA[Data Cleansing]]></category>

		<guid isPermaLink="false">http://www.shapespace.com/?p=517</guid>
		<description><![CDATA[Here&#8217;s a story we&#8217;ve seen a few times in the last 12 months at ShapeSpace: A small/medium sized engineering company has been happily using CAD for 10 or more years. Things have gone well,  perhaps the design team has doubled in size,  and tens of thousands of parts and assemblies have been created. But now it&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<div id="_mcePaste">
<p><a href="http://bookofjoe.typepad.com/photos/uncategorized/2007/07/02/chainclean01.jpg"><img class="aligncenter" title="Cleaning up" src="http://bookofjoe.typepad.com/photos/uncategorized/2007/07/02/chainclean01.jpg" alt="" width="279" height="300" /></a></p>
<p><strong>Here&#8217;s a story we&#8217;ve seen a few times in the last 12 months at ShapeSpace:</strong></p>
</div>
<div id="_mcePaste">A small/medium sized engineering company has been happily using CAD for 10 or more years. Things have gone well,  perhaps the design team has doubled in size,  and tens of thousands of parts and assemblies have been created. But now it&#8217;s becoming clear that the organisation of the data is getting a bit out of hand.  Lots of time is being spent looking for data.  Duplication is everywhere as files have been copied between folders.  Something needs to be done.  So the decision is made &#8211; we need a PDM or PLM system.</div>
<p>
<div id="_mcePaste"><strong>Now,  how to get all the data in?</strong></div>
<p>
<div>Yes, we can use batch tools to upload files and product structure, but we know our data is badly organised and so we&#8217;ll need to clean things up. We can bring in consultants and set up an in-house team to tackle the data. But now we&#8217;re faced with the scale of the problem. Perhaps we  have 50,000 files and 10% need some attention prior to migration into the PLM system. That&#8217;s 5000 files. Say, as a wild underestimate, each needs 10 minutes of someone&#8217;s time to open,  do a where-used,  and make some decision on what to do. Thats&#8217;s 50,000 minutes, or 833 hours, or 104 working days &#8211; solid. So we throw our hands in the air, decide to park the data somewhere and migrate it into our PLM system as it&#8217;s needed.</div>
<div id="_mcePaste"></div>
<p>
<div>Yes,  PLM will help to keep <strong>future</strong> data in order, but it is actually no help to the problem we started with &#8211; organising our <strong>existing</strong> data.</div>
<div id="_mcePaste">Oleg Shilovitsky <a title="PLM and Legacy Data" href="http://beyondplm.com/2010/07/26/plm-and-legacy-data/">has written about legacy data here</a>.  However,  &#8217;legacy&#8217; comes with an implication that the data is old &#8211; in fact, all the data at this point is legacy data.</div>
<p>
<div id="_mcePaste"><strong>We should look for help in the data itself</strong>.</div>
<p>
<div>Most of the information needed to help with the migration is already present in the data, but we just lack the tools to utilise it. François Guillaumin wrote an interesting <a title="Semantic search, Classification and Data migration: the winning team" href="http://plmforest.free.fr/?p=176">blog post</a> on how he used a semantic analysis to classify parts prior to a migration.  We (ShapeSpace, along with our partners AESSiS) have <a title="Reducing Part Counts &amp; Material Spend" href="http://www.aessis.com/Blog/post/Reducing-Part-Counts-Material-Spend.aspx">used</a> our shape similarity algorithms to determine groups of geometrically identical parts from the CAD data, and then analysed attributes to determine duplication.</div>
<div id="_mcePaste">I think tools to data-mine and data-cleanse within product data is an area where much more should be done. After all your PLM system can only be as good as the data in it.</div>
<p>
<div id="_mcePaste">I&#8217;d be interested to hear your thoughts&#8230;.</div>
<div><a title="Data cleanse consultancy" href="http://www.shapespace.com/solutions/data-cleanse-cad-and-plm/"><br />P.S.  More about ShapeSpace&#8217;s data cleansing services can be found here&#8230;</a></div>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save"><img src="http://www.shapespace.com/wp/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a> </p>]]></content:encoded>
			<wfw:commentRss>http://www.shapespace.com/consultancies/data-cleansing-consultancies/cleaning-up-for-cad-plm/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
	</channel>
</rss>
