Submit Your Article Forum Rules Search
WebProWorld
Register FAQ Calendar Mark Forums Read
Google Discussion Forum Google Discussion forum is for topics specifically related to Google. There is a subforum dedicated to AdSense/AdWords subjects.

Share Thread: & Tags

Share Thread:

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 01-10-2006, 12:44 PM
greeneagle's Avatar
WebProWorld 1,000+ Club
 
Join Date: Dec 2003
Location: Houston
Posts: 5,285
greeneagle RepRank 0
Default LSI/LSA – Is MSN Ahead of GOOGLE in AI?

LSI/LSA – Is MSN Ahead of GOOGLE in AI?

In another thread it was claimed that MSN has the advantage over GOOGLE in their algorithms in the use of Artificial Intelligence (AI) utilizing LSI/LSA.

I have yet to see anything substantial indicating that MSN has endeavored to that extent. Yet, GOOGLE selects what they determine to be the most accurate description of a page from multiple on page and off page components for SERP listings including a recent revisited importance favoring the “Description Metatag” when determined relevant. IMO – That indicates to me that LSI/LSA has been established to some extent.

What are your thoughts?

Ken
__________________
Mountain Eagle Marketing
Contemporary Art News
Modern Art News
Reply With Quote
  #2 (permalink)  
Old 01-10-2006, 12:53 PM
williamc's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Jul 2003
Location: GoogleVille
Posts: 1,652
williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9
Default

I don't personally see it as MSN being ahead of google, only using it first in their algo. That does not say they are actually ahead in what they can do with it yet.

But this is Mels baby, so I will shut up now until he posts :)
__________________
William Cross
Expert Search Engine Optimization
Man's Best Friend: If you don't believe it, just try this experiment.
Put your dog and your wife in the trunk of the car for an hour. When you open the trunk, who is really happy to see you?
Reply With Quote
  #3 (permalink)  
Old 01-10-2006, 02:06 PM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,944
kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10
Default

Some of you WPW members must not know what LSI/LSA are.

On the latent semantic indexing webpage you find a lot of articles that explain the concepts.

Click on papers in the left menu. A related subject is term vectoring.
__________________
Mini Network:: Financial information at your fingertips
Learn object oriented programming where it started

I will use a search engine before I ask dumb questions.
Reply With Quote
  #4 (permalink)  
Old 01-10-2006, 02:13 PM
williamc's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Jul 2003
Location: GoogleVille
Posts: 1,652
williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9
Default

Actually, I was one of the first people saying it was going to happen along with Jake baille and others. I am well familliar with it.
__________________
William Cross
Expert Search Engine Optimization
Man's Best Friend: If you don't believe it, just try this experiment.
Put your dog and your wife in the trunk of the car for an hour. When you open the trunk, who is really happy to see you?
Reply With Quote
  #5 (permalink)  
Old 01-10-2006, 03:01 PM
WebProWorld Veteran
 
Join Date: Jan 2006
Posts: 352
SPC2 RepRank 0
Default

MSN wasn't the first. Fast has had niche engines using LSI for quite a while. The problem hasn't been implementing LSI in a search engine, but in implementing it in a very large engine.
Reply With Quote
  #6 (permalink)  
Old 01-10-2006, 03:20 PM
WebProWorld Pro
 
Join Date: Nov 2005
Location: Devon UK
Posts: 102
westie RepRank 0
Default

There's a new search engine based in UK which claims to be most advanced in the world!! So press release said! They use AI I believe.

Have a look at previewseek - I use it a lot as I like the clustering and the results.

Regards
Reply With Quote
  #7 (permalink)  
Old 01-10-2006, 06:09 PM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,944
kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10
Default

Is it a meta SE?
Previewseek was also more comprehensive, he said. "It visits all the major search engines on your behalf, so that you can be sure that you are not missing any result. We put the results back together and then we grade them according to our BehaveRank algorithm that continuously evolves based on user behaviour."

My underline or a combination of a unique AI and a meta SE?

Then you have to define what you mean by better? Better at finding new sites / pages?

They say it is more comprehensive and uses what they call their BehaveRank algorithm than changes with user actions.

Good, but should like to see that algorithm.
__________________
Mini Network:: Financial information at your fingertips
Learn object oriented programming where it started

I will use a search engine before I ask dumb questions.
Reply With Quote
  #8 (permalink)  
Old 01-10-2006, 06:17 PM
DrTandem1's Avatar
WebProWorld 1,000+ Club
 
Join Date: Oct 2003
Location: Encinitas, CA
Posts: 1,830
DrTandem1 RepRank 2
Default

I think your premise is false. Google relies on IBLs, not content. There is little that is intelligent about Google.
__________________
DrTandem's San Diego Web Page Design, drtandem.com
Reply With Quote
  #9 (permalink)  
Old 01-10-2006, 06:21 PM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,944
kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10
Default Good evening DrTandem1

It is evening in Norway.

You disagree as usual, as I with you :-)

"PageRank or PR(A) can be calculated using a simple iterative algorithm, and corresponds to the principal eigenvector of the normalized link matrix of the web. Also, a PageRank for 26 million web pages can be computed in a few hours on a medium size workstation. There are many other details which are beyond the scope of this paper."

Yes mathematics is trivial as my professor told me, but Google is much more then algorithms.

It is n datacenters, hardware ....

How large is the global Google index?
__________________
Mini Network:: Financial information at your fingertips
Learn object oriented programming where it started

I will use a search engine before I ask dumb questions.
Reply With Quote
  #10 (permalink)  
Old 01-10-2006, 06:43 PM
dburdon's Avatar
WebProWorld 1,000+ Club
 
Join Date: Oct 2004
Location: Kent, England
Posts: 1,461
dburdon RepRank 2
Default All other things being equal

If all other things were equal google would work just fine. The trouble is they are not.

The Google algo is predicated on the basis that somehow a selection of links taken from the web accurately reflects the relevance of a page to the needs of a searcher.

Now that half the world has a fair idea how the Google algorithm works, they are out there trying to manipulate it. Optimising pages, buying links etc.

Its against this background, that you can see the intrinsic honesty of the straightfoward salesmanship encompassed in pay per click.

Whatever algorithm or update Google puts in place, there will always be attempts to manipulate it.
__________________
Simply Clicks | SEO | SEO Training| Pay Per Click Advertising | Search Engine Powered Marketing
Reply With Quote
  #11 (permalink)  
Old 01-10-2006, 07:05 PM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,944
kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10
Default

buying links etc I dislike that feature myself. Buy your way to the top.

Do we have any ida about wich SE is best at

1. Fighting spam?
2. Finding new sites?

I wote for Google.

If we should mention another SE, one of mine favourites is the Australian SE

Factbites that gives good results on other regions of the world.

Example KW's:

Finance in Russia

Look at the great SERP's. Directory like listings.

Quality is sometimes more important than quantity.
__________________
Mini Network:: Financial information at your fingertips
Learn object oriented programming where it started

I will use a search engine before I ask dumb questions.
Reply With Quote
  #12 (permalink)  
Old 01-10-2006, 07:18 PM
DrTandem1's Avatar
WebProWorld 1,000+ Club
 
Join Date: Oct 2003
Location: Encinitas, CA
Posts: 1,830
DrTandem1 RepRank 2
Default

Quote:
Originally Posted by kgun
buying links etc I dislike that feature myself. Buy your way to the top.

Do we have any ida about wich SE is best at

1. Fighting spam?
2. Finding new sites?

I wote for Google.

Quality is sometimes more important than quantity.
When a new site is buried for up to a year, simply being indexed does no good. Spam? Massive artificial IBLs is all it takes to manipulate Google.

I agree about quality v. quantity. Google has lost the quality battle. Tons of 404 errors that are ages old. This is beyond a quality issue, it goes to purposeful dilution of the SERPs. Like the bar that waters down its liquor on its "special" drinks, you need to pay.
__________________
DrTandem's San Diego Web Page Design, drtandem.com
Reply With Quote
  #13 (permalink)  
Old 01-10-2006, 07:35 PM
dburdon's Avatar
WebProWorld 1,000+ Club
 
Join Date: Oct 2004
Location: Kent, England
Posts: 1,461
dburdon RepRank 2
Default Mystery of SEO

In many respects SEO has become like product placement in films. Do the stars really prefer brand X to smoke, drink, drive etc? Of course not. The brands pay to feature in films. Now the brands pay to feature on the SERPs. Either through an intermediary SEO or through pay per click.

All the SE's have become corrupted. It's just a question of degree and definition. Spam is an arbitrary definition of illicit manipulation. Sure there's crude black hat. But most SEO involves shades of grey where one man's spam is another man's garlic sausage. Given the combined market cap of Google and Yahoo is close to $200 billion it would be a surprise if everything stayed clean and above board.
__________________
Simply Clicks | SEO | SEO Training| Pay Per Click Advertising | Search Engine Powered Marketing
Reply With Quote
  #14 (permalink)  
Old 01-10-2006, 07:36 PM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,944
kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10
Default

Massive artificial IBLs is all it takes to manipulate Google.

Agree.

I am sure, Google are working on other algorithms. I await a new update, where that is the next spam issue.

Since everybody's eyes are on Google, Google is also the number one spam object.

I think that it will be more and more difficult to spam Google.

Look at the meta tag discussions here at WPW. What emphasis Google places on the various tags, will in my view be more and more difficult to know in advance. By the time you have figured it out, there is a new update.

The weakest point in the Google algorithm now is in my opinion exactly link buying.

"To be or not to be, that is the question."

Translated to SE's, IMO

Staying power / consitency over time, that is the question.
__________________
Mini Network:: Financial information at your fingertips
Learn object oriented programming where it started

I will use a search engine before I ask dumb questions.
Reply With Quote
  #15 (permalink)  
Old 01-10-2006, 07:51 PM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,944
kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10
Default

In many respects SEO has become like product placement in films. Do the stars really prefer brand X to smoke, drink, drive etc? Of course not. The brands pay to feature in films. Now the brands pay to feature on the SERPs. Either through an intermediary SEO or through pay per click.

Yes, may be.

One of the most popular Tv programs on Norwegian Tv, "Hotel Cesar" does exactly what you say, David. Brands are shown in seconds. I have been told that they are shown so fast that you (consciously) nearly do not register them.

But people love the program, I think it has been there for more than 4 years.
__________________
Mini Network:: Financial information at your fingertips
Learn object oriented programming where it started

I will use a search engine before I ask dumb questions.
Reply With Quote
  #16 (permalink)  
Old 01-10-2006, 10:39 PM
williamc's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Jul 2003
Location: GoogleVille
Posts: 1,652
williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9
Default

Once again I have read thru an entire thread here where the first 3-4 posts are about the actual topic at hand, and then it just degrades into multiple other topics that have nothing to do with the actual topic.

Nice.
__________________
William Cross
Expert Search Engine Optimization
Man's Best Friend: If you don't believe it, just try this experiment.
Put your dog and your wife in the trunk of the car for an hour. When you open the trunk, who is really happy to see you?
Reply With Quote
  #17 (permalink)  
Old 01-10-2006, 11:59 PM
Mel Mel is offline
WebProWorld 1,000+ Club
 
Join Date: Jul 2003
Posts: 1,919
Mel RepRank 3Mel RepRank 3
Default

OK first off what do we all agree what latent semantic indexing is about?

IMO LSI is best described with regard to search engines as a solution to the problem of "vocabulary mismatch" which means that search engines might not find the same pages for say SEO and Search Engine Optimization and Search engine marketing even though the writers of pages using these terms migght in fact be talking about the same thing but using different words. For a more formal discussion of what LSI as applied to a search engine might be see http://www.cs.utk.edu/~berry/lsi++/node2.html

Next is Google even using LSI in its ranking algorithm?

There are those who say that the Googles purchase of Applied Semantics (who have systems based on LSI) is indicative of the fact that Google is using LSI in the ranking algo.

There are others who say that since you can do a special search in Google by prefixing the search term with a tilde (~) that this search must be based on LSI and therefor Google is using LSI.

My understanding of the Applied Symantics situation is that Google bought Applied Semantics because they were already using one of thier products to determine which Adwords ads were applicable to display for various search terms. I believe that Google are still using this technology in one way or another both Adwords and Adsense, but seriously doubt that it is being used in organic search to any degree.

As an example you can do a bit of research on this by doing a search and noticing that the pages displayed by Adwords to not match up very well with the pages displayed in the organic search alongside, hence the ranking systems must be somewhat different, which leadsme to the conclusion that LSI as used in the Adwords algo is not a major component of the organic ranking algo.


Likewise the ~ search which google says adds synonyms to your search is quite a bit different than LSI in that LSI determines the symantic connections between words in documents and synonyms are simply lists of words that mean the generally the same thing. As an example Search Engine marketingmay be symantically connected to Search engine optimization in many documents but they are not synonyms.

Still others have suggested that Google may be applying LSI not to the text content of webpages but to the anchor text of links that point to them, and for that reason you should vary your anchor text to achieve higher LSI scores, but I have not seen anything more than speculation that this is in fact the case.

I welcome comments on any evidence that Google is in fact using LSI.
__________________
Mel Nelson
Expert SEO | Cheap used cars
Reply With Quote
  #18 (permalink)  
Old 01-11-2006, 12:05 AM
Mel Mel is offline
WebProWorld 1,000+ Club
 
Join Date: Jul 2003
Posts: 1,919
Mel RepRank 3Mel RepRank 3
Default

OK first off what do we all agree what latent semantic indexing is about?

IMO LSI is best described with regard to search engines as a solution to the problem of "vocabulary mismatch" which means that search engines might not find the same pages for say SEO and Search Engine Optimization and Search engine marketing even though the writers of pages using these terms migght in fact be talking about the same thing but using different words. For a more formal discussion of what LSI as applied to a search engine might be see http://www.cs.utk.edu/~berry/lsi++/node2.html

Next is Google even using LSI in its ranking algorithm?

There are those who say that the Googles purchase of Applied Semantics (who have systems based on LSI) is indicative of the fact that Google is using LSI in the ranking algo.

There are others who say that since you can do a special search in Google by prefixing the search term with a tilde (~) that this search must be based on LSI and therefor Google is using LSI.

My understanding of the Applied Symantics situation is that Google bought Applied Semantics because they were already using one of thier products to determine which Adwords ads were applicable to display for various search terms. I believe that Google are still using this technology in one way or another both Adwords and Adsense, but seriously doubt that it is being used in organic search to any degree.

As an example you can do a bit of research on this by doing a search and noticing that the pages displayed by Adwords to not match up very well with the pages displayed in the organic search alongside, hence the ranking systems must be somewhat different, which leads me to the conclusion that LSI as used in the Adwords algo is not a major component of the Google organic ranking algo.


Likewise the ~ search which google says adds synonyms to your search is quite a bit different than LSI in that LSI determines the symantic connections between words in documents and synonyms are simply lists of words that mean the generally the same thing. As an example Search Engine marketing may be semantically connected to Search engine optimization in many documents but they are not synonyms.

Still others have suggested that Google may be applying LSI not to the text content of webpages but to the anchor text of links that point to them, and for that reason you should vary your anchor text to achieve higher LSI scores, but I have not seen anything more than speculation that this is in fact the case.

I welcome comments on any evidence that Google is in fact using LSI.
__________________
Mel Nelson
Expert SEO | Cheap used cars
Reply With Quote
  #19 (permalink)  
Old 01-11-2006, 06:47 AM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,944
kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10
Default

Is it possible to explain the mathematics?

Think of a threedimensional space (room) and arrows pointing from the origo to different places in the space (room). Place documents on the tip of each arrow (vector). Documents that are related, lie close to each other (euclidian distance) in that room. Even if you can not visualize higher dimensions than three, mathematically the computation is done in exactly the same manner (eucledian distance etc. in higher dimensional spaces is analogus).

LSI represents terms and documents in a rich, high-dimensional space, allowing the underlying ("latent"), semantic relationships between terms and documents to be expolited during searching. Each dimension is merely assumed to represent one ore more semantic relationship in the term-document space.

Mathematical methods (singular value decomposition - SVD) are used to reduce the dimension of the term-document space, to filter out noise etc. and find related documents. In this way the underlying semantic relationships between documents are revealed.

LSI statistically analyses the pattern of word usage across the entire document collection, placing documents with similar word usage near each other in the term-document space, and allowing semantically related documents to be near each other even though they may not share terms.

This is simple mathematics (mostly linear algebra learned in under graduate cources in mathematics) and the methods have been known for many years, so it should be no problem for Google to use LSI.
__________________
Mini Network:: Financial information at your fingertips
Learn object oriented programming where it started

I will use a search engine before I ask dumb questions.
Reply With Quote
  #20 (permalink)  
Old 01-11-2006, 07:56 AM
Mel Mel is offline
WebProWorld 1,000+ Club
 
Join Date: Jul 2003
Posts: 1,919
Mel RepRank 3Mel RepRank 3
Default

The math may be rather simple to you (but in all likelyhood not to most posters here) but have you considered the computational necesary resources to run that calculation on 8 billion pages at ranking time?
__________________
Mel Nelson
Expert SEO | Cheap used cars
Reply With Quote
  #21 (permalink)  
Old 01-11-2006, 07:57 AM
Mel Mel is offline
WebProWorld 1,000+ Club
 
Join Date: Jul 2003
Posts: 1,919
Mel RepRank 3Mel RepRank 3
Default

Double post
__________________
Mel Nelson
Expert SEO | Cheap used cars
Reply With Quote
  #22 (permalink)  
Old 01-11-2006, 09:01 AM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,944
kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10
Default

1. Explaining latent semantic indexing (LSI) in more detail as I see it.
Let the space be three (x,y,z) dimensional n=3 (a room, with the origin in the center of the room so you can have negative elements on the x,y and z axis).

Pick two documents in that room. If the arrows stands vertical on each other (orthogonal) there is no relation between the two documents. They are not candidates in the same search query.

If they lay in the same two dimensional plane, the space collapses to a plane, n=2. If they lay on the same arrow it collapses to the real line. In both cases, the documents have related content.

If two documents are placed on the same arrowhead, they should be (near) identical depending on how accurate the algorithme is (at flitering out noise (spam) etc).

2. Computational burden.
How are the space represented? If you think of C++ that is used in LSI, a three dimensional space is a pointer to a pointer to a pointer (pointer***). Similarily for higher dimensions.

Today, there are a lot of C++ liberaries with container classes. The computational burden of course, dependes on the dimensionality, number of elements (documents) and methods that are implemented to operate on the vectors (arrows). Should not be a problem with high dimensional systems with todays computing power and paralellism.

May be (distributed) beta (objects) - see toolbar - may be used to make the computations more compact and efficient.
__________________
Mini Network:: Financial information at your fingertips
Learn object oriented programming where it started

I will use a search engine before I ask dumb questions.
Reply With Quote
  #23 (permalink)  
Old 01-11-2006, 11:13 AM
Mel Mel is offline
WebProWorld 1,000+ Club
 
Join Date: Jul 2003
Posts: 1,919
Mel RepRank 3Mel RepRank 3
Default

Yes Kgun, You can explain it that way if you like, but I doubt that such an explaination will be helpful to many who do not practice mathematics daily.

You seem to have a good handle on LSI, do you agree that for the average searcher, webmaster and SEO that it simply is a way for search engines to find related pages which cannot be found by lexilogical analysis?

IMO we do not have to have a mathematical understanding of the process to know if it works or is useful, sort of like you don't have to know the stress levels in the connecting rods of you car engine in order to use it and find it useful.
__________________
Mel Nelson
Expert SEO | Cheap used cars
Reply With Quote
  #24 (permalink)  
Old 01-11-2006, 11:27 AM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,944
kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10
Default

You seem to have a good handle on LSI, do you agree that for the average searcher, webmaster and SEO that it simply is a way for search engines to find related pages which cannot be found by lexilogical analysis?

Correct as far as I understand. I am not an expert. I have only tried to boil down my mathematical understanding of LSI in the hope that it may be helpful to other members / surfers.


IMO we do not have to have a mathematical understanding of the process to know if it works or is useful, sort of like you don't have to know the stress levels in the connecting rods of you car engine in order to use it and find it useful.

Agree. See my two last posts in the thread about information.
__________________
Mini Network:: Financial information at your fingertips
Learn object oriented programming where it started

I will use a search engine before I ask dumb questions.
Reply With Quote
  #25 (permalink)  
Old 01-12-2006, 11:52 PM
Mel Mel is offline
WebProWorld 1,000+ Club
 
Join Date: Jul 2003
Posts: 1,919
Mel RepRank 3Mel RepRank 3
Default

Well two days have passed with no one responding to my request for any substantiation that Google is in fact using LSI as a part of its ranking alog, so I guess its fair to say that no one has any evidence to support this assumption, and I guess then we should just let the matter die of old age.
__________________
Mel Nelson
Expert SEO | Cheap used cars
Reply With Quote
  #26 (permalink)  
Old 01-13-2006, 04:53 AM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,944
kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10
Default Archimedis: "Do not touch my circles" :-)

I think that to give a definite answer to that question is like saying that you can square the circle, that is construct a square with side lenght r*sqrt(pi). Seemingly a very simple problem.

You can not solve that problem numerically with all available computing power on the earth.

The Google algorithm is perhaps so complex now, that it is nearly impossible to simulate every aspect of it. If you look at the original paper above, Google uses linear algebra to compute the page rank. There is no problem as I said above for Google to use LSI as an element in their indexing. Most probably they use it or a more general method that has LSI as a special case.

Only my guessing, even without having studied the original paper or its relation to LSI in any detail.

GoogleBOT: "Do not touch my circles" :-)
__________________
Mini Network:: Financial information at your fingertips
Learn object oriented programming where it started

I will use a search engine before I ask dumb questions.
Reply With Quote
  #27 (permalink)  
Old 01-13-2006, 01:19 PM
williamc's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Jul 2003
Location: GoogleVille
Posts: 1,652
williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9williamc RepRank 9
Default

Mel didnt ask to prove that google was using it, merely to supply evidence that they might be using it. He wanted to see what reasons people had for thinking google might be using LSI at this stage.
__________________
William Cross
Expert Search Engine Optimization
Man's Best Friend: If you don't believe it, just try this experiment.
Put your dog and your wife in the trunk of the car for an hour. When you open the trunk, who is really happy to see you?
Reply With Quote
  #28 (permalink)  
Old 04-14-2008, 01:22 PM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,944
kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10
Default Re: LSI/LSA – Is MSN Ahead of GOOGLE in AI?

Quote:
Originally Posted by Mel View Post
The math may be rather simple to you (but in all likelyhood not to most posters here) but have you considered the computational necesary resources to run that calculation on 8 billion pages at ranking time?
Late updated reply, but a reply, though.

You may use grid hosting for search tool.

The position of independent documents, should be orthogonal to each other. So you may use "orthogonal principal components decomposition" to find less related documents. The inverted link matrix of the web will give related (linked) documents.

Of course, Google, know all about this.
__________________
Mini Network:: Financial information at your fingertips
Learn object oriented programming where it started

I will use a search engine before I ask dumb questions.

Last edited by kgun; 04-14-2008 at 01:34 PM.
Reply With Quote
  #29 (permalink)  
Old 04-15-2008, 06:13 PM
Peter (IMC)'s Avatar
WebProWorld MVP
WebProWorld MVP
 
Join Date: Dec 2003
Posts: 1,485
Peter (IMC) RepRank 4Peter (IMC) RepRank 4Peter (IMC) RepRank 4Peter (IMC) RepRank 4
Default Re: LSI/LSA – Is MSN Ahead of GOOGLE in AI?

Quote:
Originally Posted by DrTandem1 View Post
I think your premise is false. Google relies on IBLs, not content. There is little that is intelligent about Google.
You just don't like Google do you?

Quote:
LSI/LSA – Is MSN Ahead of GOOGLE in AI?
At the risk of showing my lack of knowledge about AI and LSI/LSA , I'm going to share my thoughts. Artificial Intelligence to me is more about a machine that learns as information is fed into it, like our brains. Latent Semantic Indexing is more like some mathematical logic to see how words relate to eachother inside an "index". I don't see how you can call LSI/LSA, Artificial Intelligence.

Perhaps because the index of a search engine is constantly updated and you can therefore say that new information is fed into it. But as I understand it, LSI/LSA does not include some kind of memory function, so when data is no longer available in the index, it's not going to use memorized information. To me that would be an important part of AI. (Though I guess it is possible to make the algorithms use no longer available, memorized info, as well.)

As to MSN, as far as I understood it, they´re trying to really use artificial intelligence, where Google is still more going after the algorithmical approach. Perhaps these 2 things are pretty much the same thing. Overall though, I think we´re just on the level of "information retrieval" and far from actual Artificial Intelligence.
__________________
FREE SEO ! Really? YES! All you have to do is implement it!
Follow me on Twitter PeterIMC
Reply With Quote
  #30 (permalink)  
Old 04-15-2008, 06:27 PM
Peter (IMC)'s Avatar
WebProWorld MVP
WebProWorld MVP
 
Join Date: Dec 2003
Posts: 1,485
Peter (IMC) RepRank 4Peter (IMC) RepRank 4Peter (IMC) RepRank 4Peter (IMC) RepRank 4
Default Re: LSI/LSA – Is MSN Ahead of GOOGLE in AI?

Quote:
Originally Posted by Mel View Post
Well two days have passed with no one responding to my request for any substantiation that Google is in fact using LSI as a part of its ranking alog, so I guess its fair to say that no one has any evidence to support this assumption, and I guess then we should just let the matter die of old age.
Actually I do believe they use LSI as part of their algorithms. You can test it by using a * in your searches.

Search for: * something

The star can be any word Google seems fit. The ideal is that they come up with something that makes sense in language and not just what ever. LSI would make that a lot easier.

Take 2 combinations of 2 words, where the last word is the same. Choose them such that you know for sure that one combination is more commonly used than the other. If LSI is used, It is more likely that they will choose the most used one to rank high when doing the * something search.
__________________
FREE SEO ! Really? YES! All you have to do is implement it!
Follow me on Twitter PeterIMC
Reply With Quote
  #31 (permalink)  
Old 04-15-2008, 07:36 PM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,944
kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10
Default Re: LSI/LSA – Is MSN Ahead of GOOGLE in AI?

Quote:
Originally Posted by Peter (IMC) View Post
At the risk of showing my lack of knowledge about AI and LSI/LSA , I'm going to share my thoughts. Artificial Intelligence to me is more about a machine that learns as information is fed into it, like our brains. Latent Semantic Indexing is more like some mathematical logic to see how words relate to eachother inside an "index". I don't see how you can call LSI/LSA, Artificial Intelligence.

Perhaps because the index of a search engine is constantly updated and you can therefore say that new information is fed into it. But as I understand it, LSI/LSA does not include some kind of memory function, so when data is no longer available in the index, it's not going to use memorized information. To me that would be an important part of AI. (Though I guess it is possible to make the algorithms use no longer available, memorized info, as well.)
My bolding:

Great comment Peter. Clever man.
__________________
Mini Network:: Financial information at your fingertips
Learn object oriented programming where it started

I will use a search engine before I ask dumb questions.

Last edited by kgun; 04-15-2008 at 07:42 PM.
Reply With Quote
  #32 (permalink)  
Old 09-19-2008, 10:49 AM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,944
kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10kgun RepRank 10
Default Re: LSI/LSA – Is MSN Ahead of GOOGLE in AI?

Any updated information on this subject?
__________________
Mini Network:: Financial information at your fingertips
Learn object oriented programming where it started

I will use a search engine before I ask dumb questions.
Reply With Quote
Reply

  WebProWorld > Search Engines > Google Discussion Forum

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On



All times are GMT -4. The time now is 01:11 PM.



Search Engine Optimization by vBSEO 3.3.0