Jump to content
The Inquirer-Home

Googlebots crawl for Flash

Kiss fast-loading wibblery goodbye
Tuesday, 1 July 2008, 15:35

A RECENT POSTING on the official Googlebog has confirmed that the search giant will soon be indexing Flash content, no doubt to the delight of over-exhuberant web designers everywhere.

Until now, search engine indexing spiders have ignored Flash content much to the annoyance of those enamored by over-designed twirly-whirly animated nonsense.

Ron Adler and Janis Stipins - software engineers on Google's indexing team - explained how the new functionality did its funky thang.

"We've improved our ability to index textual content in SWF files of all kinds. This includes Flash 'gadgets' such as buttons or menus, self-contained Flash websites, and everything in between.

"If your web site contains Flash, the textual content in your Flash files can be used when Google generates a snippet for your web site. Also, the words that appear in your Flash files can be used to match query terms in Google searches. "

In addition to finding and indexing the textual content in Flash files, Google will also discover URLs which appear in Flash files, and feed them into its crawling pipeline. If a Flash application contains links to pages inside a web site, Google may now discover and crawl more of that site.

Non-contextual content like images and videos are still providing difficult. At present the engineers are only discovering and indexing textual content. Image files will not be recognised or indexed as well as any text that may appear in those images.

The Google boffins have developed an algorithm which explores Flash files in the same way that a person would, by clicking buttons, entering input, and so on. The algorithm remembers all of the text that it encounters along the way, and that content is then available to be indexed.

Google was not prepared to reveal all of the proprietary details, but it did say that the algorithm's effectiveness was improved by utilizing Adobe's new Searchable SWF library.

Webmasters won't have to recode standard content on existing sites as the changes will be implemented automatically, but Flash content which is loaded via Javascript will have to be rethought as the Googlebots won't see it. Offsite flash content will also be indexed seperately.

Lastly, anyone using a bidirectional language, such as Hebrew or Arabic, will have to wait for a forthcoming fix. µ

Share this:

Comments
What?

Bidirectional Language?
WHAT ARE THEY? - CRAZY / YZARC - ?YEHT ERA TAHW

why would anyone want bidirectional language?
Is that better somehow?
It seems confusing to me, like if you switched the road signs direction at whim.

posted by : eh?, 01 July 2008 Complain about this comment
Bidirectional

Bidirectoinal languages are the languages whose writting goes from right to left (you start writing from the right of the page to the left), as opposed to the left to right writting we see in western languages (english)

Have you ever seen an arabic news channel, were the news text at the botton of the page crawls completely different as you would expect?? That's what i mean..

posted by : Dimitris K, 01 July 2008 Complain about this comment
Hmm

Bidirectional is denoting 2 directions, and does not mean reversed from western, I guess you could argue hebrew/arabic/etc is bidirectional if you count numbers as part of the language, but I think it's simply just a mistake to use that word in this case possibly.

http://en.wikipedia.org/wiki/Bi-directional_text


posted by : W.-, 02 July 2008 Complain about this comment
Yuck

Indexed schlockwave.

posted by : North, 02 July 2008 Complain about this comment
Advertisement
Subscribe to the INQ Newsletter
Sign-up for the INQBot weekly newsletter
Click here to sign up Existing user
Advertisement
INQ Poll

Christmas computer sales

Will you be buying a new computer this Christmas?