<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: DISCUSSION: Flash + Google Indexing</title>
	<atom:link href="http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/feed/" rel="self" type="application/rss+xml" />
	<link>http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/</link>
	<description>Simple And Stupid Software Implementation Examples</description>
	<pubDate>Fri, 04 Jul 2008 00:01:54 +0000</pubDate>
	<generator>http://wordpress.org/?v=MU</generator>
		<item>
		<title>By: devdave</title>
		<link>http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/#comment-196</link>
		<dc:creator>devdave</dc:creator>
		<pubDate>Sun, 27 Apr 2008 15:33:18 +0000</pubDate>
		<guid isPermaLink="false">http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/#comment-196</guid>
		<description>I agree with Josh - I don't want bots trawling through every string in my code; a lot of the strings will be misleading for search engine purposes.

Using an XML document as a content provider and XSL is ingenious - however, do you reckon there is a chance that at some point Google might classify this as grey hat since the bot is indexing something which doesn't really appear on the page.

For the majority of my clients we usually end up grabbing the data from some kind of CMS (either XML or a database) and creating a simple HTML microsite which either redirects to the Flash (also a little dangerous from a blacklisting perspective) or has links to the real site.

I'm sure I heard a rumour that Adobe had some kind of big and cunning plan for improving this in the next generation of Flash Player, but hey, they've been saying that for a while :)

Dave
http://www.actionscriptfreelancer.com</description>
		<content:encoded><![CDATA[<p>I agree with Josh - I don&#8217;t want bots trawling through every string in my code; a lot of the strings will be misleading for search engine purposes.</p>
<p>Using an XML document as a content provider and XSL is ingenious - however, do you reckon there is a chance that at some point Google might classify this as grey hat since the bot is indexing something which doesn&#8217;t really appear on the page.</p>
<p>For the majority of my clients we usually end up grabbing the data from some kind of CMS (either XML or a database) and creating a simple HTML microsite which either redirects to the Flash (also a little dangerous from a blacklisting perspective) or has links to the real site.</p>
<p>I&#8217;m sure I heard a rumour that Adobe had some kind of big and cunning plan for improving this in the next generation of Flash Player, but hey, they&#8217;ve been saying that for a while <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>Dave<br />
<a href="http://www.actionscriptfreelancer.com" rel="nofollow">http://www.actionscriptfreelancer.com</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brian</title>
		<link>http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/#comment-178</link>
		<dc:creator>Brian</dc:creator>
		<pubDate>Thu, 10 Apr 2008 06:46:49 +0000</pubDate>
		<guid isPermaLink="false">http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/#comment-178</guid>
		<description>For Google Analytics to work on a Flash/Flex site, you already have to manually notify the tracker that a pageview has occurred.  It'd be nice if GoogleBot could at least tie into that somehow.</description>
		<content:encoded><![CDATA[<p>For Google Analytics to work on a Flash/Flex site, you already have to manually notify the tracker that a pageview has occurred.  It&#8217;d be nice if GoogleBot could at least tie into that somehow.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: thesaj</title>
		<link>http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/#comment-146</link>
		<dc:creator>thesaj</dc:creator>
		<pubDate>Tue, 18 Mar 2008 13:47:38 +0000</pubDate>
		<guid isPermaLink="false">http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/#comment-146</guid>
		<description>Josh/Jim - Thanks for the link to Ted's directory. That is definitely one of the cleaner ways I've seen a transition implemented.

Josh - My idea was not to have search engines try to understand the SWF format any deeper. Rather to simply give them a channel to listen to.  And to perhaps provide a method that would optionally allow for differentiation of states.

Andrew - Most definitely would have to be internal to Flash Player. 

John - We have numerous clients, some have some pretty high relevancy on the web.  Just pulling one off the top of my head, Home Media Magazine ranks 4th for the following keywords: "Blu-Ray format war over".

So there is validity to our desire to improve indexing results.</description>
		<content:encoded><![CDATA[<p>Josh/Jim - Thanks for the link to Ted&#8217;s directory. That is definitely one of the cleaner ways I&#8217;ve seen a transition implemented.</p>
<p>Josh - My idea was not to have search engines try to understand the SWF format any deeper. Rather to simply give them a channel to listen to.  And to perhaps provide a method that would optionally allow for differentiation of states.</p>
<p>Andrew - Most definitely would have to be internal to Flash Player. </p>
<p>John - We have numerous clients, some have some pretty high relevancy on the web.  Just pulling one off the top of my head, Home Media Magazine ranks 4th for the following keywords: &#8220;Blu-Ray format war over&#8221;.</p>
<p>So there is validity to our desire to improve indexing results.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Dowdell</title>
		<link>http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/#comment-145</link>
		<dc:creator>John Dowdell</dc:creator>
		<pubDate>Tue, 18 Mar 2008 04:06:42 +0000</pubDate>
		<guid isPermaLink="false">http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/#comment-145</guid>
		<description>On which search terms do you think people might search for your services, and on which of those might you be able to place within the first page of Google results?</description>
		<content:encoded><![CDATA[<p>On which search terms do you think people might search for your services, and on which of those might you be able to place within the first page of Google results?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jim</title>
		<link>http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/#comment-144</link>
		<dc:creator>Jim</dc:creator>
		<pubDate>Tue, 18 Mar 2008 01:48:54 +0000</pubDate>
		<guid isPermaLink="false">http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/#comment-144</guid>
		<description>One way is to place the textual data that is to be supplied to the SWF in the HTML page that contains the SWF, and use XSL to transform it for the browser. This is easier to show by example than it is to describe.

To see an example, go to http://directory.onflex.org/ and you'll see a Flex UI. Now, view the HTML source in the browser, and you'll see an XML page that does *not* embed any SWF. Instead, it holds all the data. This is what the search engines will see, and as you can see it is highly indexable. This page also supplies the data for the SWF, so it performs double duty. 

Now look at the XSL for the above page at http://directory.onflex.org/template003.xsl and you'll see that the XML is transformed into a simple HTML page that uses JavaScript to embed the SWF. This supplies what the user sees in his browser - the Flex-built SWF. The SWF, in turn, loads its data from the above XML.

So, really, it isn't any more work than standard Flash, it's just a slightly different twist: Rather than using a plain-Jane HTML file to embed the SWF and a separate XML file to supply the data to the SWF, you let the XML file double as a search-engine-friendly page and use an XSL file to transform it into HTML that embeds the SWF.</description>
		<content:encoded><![CDATA[<p>One way is to place the textual data that is to be supplied to the SWF in the HTML page that contains the SWF, and use XSL to transform it for the browser. This is easier to show by example than it is to describe.</p>
<p>To see an example, go to <a href="http://directory.onflex.org/" rel="nofollow">http://directory.onflex.org/</a> and you&#8217;ll see a Flex UI. Now, view the HTML source in the browser, and you&#8217;ll see an XML page that does *not* embed any SWF. Instead, it holds all the data. This is what the search engines will see, and as you can see it is highly indexable. This page also supplies the data for the SWF, so it performs double duty. </p>
<p>Now look at the XSL for the above page at <a href="http://directory.onflex.org/template003.xsl" rel="nofollow">http://directory.onflex.org/template003.xsl</a> and you&#8217;ll see that the XML is transformed into a simple HTML page that uses JavaScript to embed the SWF. This supplies what the user sees in his browser - the Flex-built SWF. The SWF, in turn, loads its data from the above XML.</p>
<p>So, really, it isn&#8217;t any more work than standard Flash, it&#8217;s just a slightly different twist: Rather than using a plain-Jane HTML file to embed the SWF and a separate XML file to supply the data to the SWF, you let the XML file double as a search-engine-friendly page and use an XSL file to transform it into HTML that embeds the SWF.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andrew Westberg</title>
		<link>http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/#comment-143</link>
		<dc:creator>Andrew Westberg</dc:creator>
		<pubDate>Tue, 18 Mar 2008 00:22:16 +0000</pubDate>
		<guid isPermaLink="false">http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/#comment-143</guid>
		<description>Interesting thoughts.  It would have to be embedded at the flash player layer though as putting it any higher in the stack would have people hacking the content for the way google sees them.</description>
		<content:encoded><![CDATA[<p>Interesting thoughts.  It would have to be embedded at the flash player layer though as putting it any higher in the stack would have people hacking the content for the way google sees them.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Josh</title>
		<link>http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/#comment-142</link>
		<dc:creator>Josh</dc:creator>
		<pubDate>Mon, 17 Mar 2008 22:33:34 +0000</pubDate>
		<guid isPermaLink="false">http://thesaj.wordpress.com/2008/03/17/discussion-flash-google-indexing/#comment-142</guid>
		<description>Personally, I don't want to see search engines try to understand the SWF format any more than they do now. Being a complex binary format, SWFs are a poor format for crawling. Heck, many SWFs don't even have data that &lt;em&gt;should&lt;/em&gt; be considered searchable. I want to see SWF content authors find better ways to give the data to search engines in ways those search engines already understand or can understand with much more useful enhancements (beyond just the Flash use-case).

The best solution that I've seen so far is the one that Ted Patrick used for &lt;a href="http://directory.onflex.org/" rel="nofollow"&gt;Flex Directory&lt;/a&gt;. He uses XSL to transform a simple, semantic XML/HTML document by replacing it with SWF embed code. The SWF then reloads the page without the XSL transform, and it displays the data as part of an RIA.

Ted's solution is a good first step. I'd like to see it brought to a finished product's level of quality, though. If JavaScript is disabled, the embed code fails. I'd like to see decent error handling such that if the content cannot be embedded, the raw data will be formatted nicely as a regular webpage (probably with CSS too). Basically, have it fail gracefully such that the user may not even notice that the preferred Flash Player view didn't work.

On the search engine front, I think Google/Yahoo!/others should have better handling of pure XML and other non-HTML text data formats. Ted found that he had to change his page's content from XML to HTML because the search engines would completely ignore the page's content if they didn't understand the tags being used. The web is moving beyond HTML. We have other XML-based formats like RSS, ATOM, JSON, and others that could provide all sorts of new information to search engines. Even better, these formats could be very useful in the crawling of RIAs created with technologies like Flash, Silverlight, and others.</description>
		<content:encoded><![CDATA[<p>Personally, I don&#8217;t want to see search engines try to understand the SWF format any more than they do now. Being a complex binary format, SWFs are a poor format for crawling. Heck, many SWFs don&#8217;t even have data that <em>should</em> be considered searchable. I want to see SWF content authors find better ways to give the data to search engines in ways those search engines already understand or can understand with much more useful enhancements (beyond just the Flash use-case).</p>
<p>The best solution that I&#8217;ve seen so far is the one that Ted Patrick used for <a href="http://directory.onflex.org/" rel="nofollow">Flex Directory</a>. He uses XSL to transform a simple, semantic XML/HTML document by replacing it with SWF embed code. The SWF then reloads the page without the XSL transform, and it displays the data as part of an RIA.</p>
<p>Ted&#8217;s solution is a good first step. I&#8217;d like to see it brought to a finished product&#8217;s level of quality, though. If JavaScript is disabled, the embed code fails. I&#8217;d like to see decent error handling such that if the content cannot be embedded, the raw data will be formatted nicely as a regular webpage (probably with CSS too). Basically, have it fail gracefully such that the user may not even notice that the preferred Flash Player view didn&#8217;t work.</p>
<p>On the search engine front, I think Google/Yahoo!/others should have better handling of pure XML and other non-HTML text data formats. Ted found that he had to change his page&#8217;s content from XML to HTML because the search engines would completely ignore the page&#8217;s content if they didn&#8217;t understand the tags being used. The web is moving beyond HTML. We have other XML-based formats like RSS, ATOM, JSON, and others that could provide all sorts of new information to search engines. Even better, these formats could be very useful in the crawling of RIAs created with technologies like Flash, Silverlight, and others.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
