Yahoo Research Labs

Jan 20th

Me Too! Google is frequently cooking up something in the Google Labs. Yahoo today announces the creation of "Yahoo Research Labs."

Latent Semantic Indexing

Jan 18th

(GEEK STUFF) One of the largest problems many search engines run into is that after they get to a few hundred million documents their algorithms and hardware hit a wall.

For those companies that can afford the investment to get past this point they still run into the problem that each additional resource makes their job a bit harder.

One of the major ways around this problem is to take advantage of the natural patterns in human language. Using Latent Semantic Indexing allows indexing search results based on the pairing of like words within documents.

Many complex searches may lack exact matches in the results as well. Being able to find near matches will allow search engines to provide more comprehensive results.

Its hard to get computers to understand anything human, but the process of latent semantic indexing delivers conceptual results while being entirely mathematically driven.

There are two main ways to do this, single variable decomposition and multi dimentional scaling.

Some of the steps of the single variable decomposition process are to:

  • create a database of all words in relevant documents
  • remove common stop words
  • stemming
  • remove words appearing in all results
  • remove words only appearing in one result
  • create a database of relavent keywords
  • weight the pages based on the frequency of keyword distribution
  • increasing the relevance of terms which appear in a small number of pages (as they are more likely to be on topic than words that appear in most all documents)
  • normalize the page to remove the pagelength as a factor
  • create relevancy vectors for the keywords

The single variable decomposition process is not scalable enough to work on large scale search engines though as it requires too much processor time. Multi dimentional scaling allows us to take snapshots of the topicology of different documents. "Instead of deriving the best possible projection through matrix decomposition, the MDS algorithm starts with a random arrangement of data, and then incrementally moves it around, calculating a stress function after each perturbation to see if the projection has grown more or less accurate. The algorithm keeps nudging the data points until it can no longer find lower values for the stress function."

This does not provide exact results, but only a rough approximation. When combined with other factors this approximation improves scalability and quality of search.

Good Reading on latent semantic indexing

This technology is so amazing that it may eventually help lead to a cure for cancer. Already the technology is being refined for cognitive improvements and test grading!

Top Search Keywords List

Jan 18th

So many people ask what the top search keywords are. Generally this is an unimportant topic. What is important is the top keywords in the subject you know or are interested in. Looking at the general terms means you must compete with the entire web. Thinking of your specific segment (and those who compete with it) makes it easier to determine which keywords are important to you.

Here are some of the top keywords lists though just for the heck of it...

Yahoo Buzz index (also in UK version)
Lycos 50
Google Zeitgiest
Ask Jeeves

and of course the traditional Keyword tools
Overture Search Term Suggestion (also in UK verion)
Espotting
Wordtracker

and Hitwise offers a monthly search term report

many of these were found in a thread at Highrankings Forum

  • Over 100 training modules, covering topics like: keyword research, link building, site architecture, website monetization, pay per click ads, tracking results, and more.
  • An exclusive interactive community forum
  • Members only videos and tools
  • Additional bonuses - like data spreadsheets, and money saving tips
We love our customers, but more importantly

Our customers love us!

Custom 404 Error Page

Jan 18th

Many sites do not have a custom 404 error page. When a site visitor clicks on a dead link the visitor is most likely gone. Here is a perfect article about creating the perfect 404 error page.

Google ads in email

Jan 18th

Syndication AdSense ads are going to appear more regularly in email newsletters.

Did-It Keyword Suggestion Tool

Jan 17th

Did it has released a new free keyword suggestion tool located at http://www.did-it.com/suggest.php. They analyzed the meta tags of millions of websites to create keyword relationships between them.

While not as robust as WordTracker, this is a great tool for free!

MSN Using Inktomi, Drops LookSmart

Jan 16th
posted in
msn

Divoriced! MSN drops LookSmart. Results now powered by Inktomi.

Kindling Inktomi now powers MSN, Yahoo announces change to Inktomi will occur in Q1

Pay Per Click is Broken?

Some recent articles ("Google's House of Cards" "A Perfect Storm for Pay Per Click") have been saying that the ROI for paid advertising is going away. It has been. It was not very competitive a few years ago, but now with over 150,000 people in the market you need to be more effective to extract profits from a campaign.

I honestly think many of the articles are suggested / written by people who want to make their own jobs easier and make more money while doing it. Some articles may even be written to scare away competition or drive leads to firms who provide the services.

In the past a sloppy website with low conversion rates was ok because there was little competition. Now some areas are requiring a smooth ad, which is well targeted, that leads to a smooth site, which has great usability, and is customer centric. In essence the shakeup of the organic listings and the rising costs of pay per click ads are forcing websites (and the internet as a whole) to be more functional.

There are few mediums which have feedback as rapid as AdWords does. Pay per click is here to stay. Those who know how to use it will make a ton of money.

How to Make Dynamic URLs Static

Jan 15th

Many of these tips originate from members of the I search discussion list (which is an amazing resource well worth the money).

This guy has an datebase ASP website and makes his dynamic content look static to the search engines using a custom 404 error pag build.

Additional ideas are a server side filter softwarehttp://www.smalig.com/url_rewrite-en.htm and URL rewriting software http://www.opcode.co.uk/components/rewrite.asp.

Here is the Apache Mod Rewrite page for you Apache people...

General tips to make a dynamic site get spidered
1.) Do not force feed the spider a cookie
2.) Use 3 or less variables
3.) Have each query string 10 or less digets
4.) Create a sitemap which links to many of the main database locations.
5.) Build up link popularity from a few quality inbound links. The PageRank (or link popularity in search engines other than Google) will make the spider more inclined to spider deep through your site.

Google Keyword Density Analyzer Tool

Jan 14th

The most important part of SEO is off the page factors, but for those of you heavily interested in on the page criteria...
Over at GoRank they created a tool for comaring the keyword density of various pages. In addition they have a small report about their keyword density findings in the post Florida Google.

Pages






    Email Address
    Pick a Username
    Yes, please send me "7 Days to SEO Success" mini-course (a $57 value) for free.

    Learn More

    We value your privacy. We will not rent or sell your email address.