|
|
||||||
|
||||||
| Index Link To US Private Messages Archive FAQ RSS | ||||||
| Google Discussion Forum Google Discussion forum is for topics specifically related to Google. There is a subforum dedicated to AdSense/AdWords subjects. |
Share Thread: & Tags
|
||||
|
![]() |
|
|
LinkBack | Thread Tools | Display Modes |
|
|||
|
Quote:
Then they do one of two things:- (1) they go and get the previous link text words and pointers, and store them in the short barrels, because they now have the page itself, or (2) they do nothing, because that data is already in the short barrels. I see no reason NOT to put the link data in the short barrels straight away, and I see good reason to put it there straight away. Some circumstantial evidence is that we sometimes see a URL-only listing in the normal serps, when Supplementals have not been included. That could happen because the link text data alone (in the short barrels) is sufficient to get the URL ranked. |
|
|||
|
Quote:
|
|
|||
|
Quote:
Are there a lot of sites which have their rankings back? |
|
|||
|
Addition to my reply to Dave a few posts back...
Remember that the barrels contain individual words, together with some data about each word - they don't contain bunches of text from pages or from links. Also remember that the words in links texts and the words in a page are different things - they are usually on different pages, even though the URL (page) that they are associated with is the same. So no duplication occurs in the short barrels if they store the link text words in them, and then later store the words from the page in them. The only difference between a URL-only listing, and a listing that is based solely on link text (e.g. miserable failure) is that Google has the page that is associated with the miserable failure listing, but they don't have it for the URL-only listing. I.e. the queries are handled identically, and then the programme checks to see if they have a page or not for the URL. If yes, then get a snippet; if no, then only show the URL. That's the way I see it, anyway. |
|
|||
|
Quote:
Dutch site, young site, ... I don't think they are going to spend much time about that. No problem, this has brought to my attention that i should optimise for other SE's also(Google has 85% here) and hope that they win popularity :-) . I learned a lot with this Gproblem, started experienting with adwords, ... So it's not al bad but i prefered to see my site grew. I was planning to go for only organic results and see how long it takes to get results and at whiwh time you get which results. So it was something to learn but i think the learning stage is over. This has a to big effect to take as example of how a site grows natural. But Google has lost my faith here and that never happened before. That is something that is going to play when i make future decissions about advertising and all. I wil also look at other players now. |
|
||||
|
Dave,
I reallly believe that the associated issues here surrounding on-line stores right now are an important and timely issue, so I broke it off over here: http://www.webproworld.com/viewtopic.php?t=61723 Ken |
|
||||
|
Quote:
Take a look at my URL only result on page 18 of one of the affected DC's... http://216.239.59.99/search?q=%22St....&sa=N&filter=0 Every single result for this search, all 18 pages, is marked supplemental except my URL only partially indexed result. Now what do you think? Dave |
|
|||
|
I can see what you are getting at, Dave. But it's not the last result, and non-supplementals don't always outrank supplementals.
I've explained why I see no reason to store a link's data in a different index to where it will be when the target page is fetched and parsed (it will end up there anyway, and the data is useable for rankings without the associated page's data), and your example doesn't change my thinking. I can suggest a reason why the results are as they are ... Since a query in the short and long barrels only returned one result (yours), they went to the supplemental index to try and find more. That's the normal way that they do things. Having found more, they then ranked the results set, and returned them in the order that we see. |
|
|||
|
I find it very strange that Google keeps silence about this.
There's something very weird with this case. I also don't find any info in any SEOnewsletter. And that after a week of heavy problems.(Why? Isn't it important enough?) Is there any clue on how much sites are affected and how many recovered? Are it just a few sites and why just those sites? Answering to these questions is what Google must do, correction, what they had to.
__________________
deInternetMarketeer - SEO |
|
|||
|
I'm not insisting on it. I'm only saying that it makes good sense to me to put the data in the short barrels, and poor sense to me to put it elsewhere. I can't say that I'm right about it, because there may be plenty of things that I haven't considered. I'm only seeing it as an overview.
I can't see the data being in the supplemental index though - else why isn't the listing marked as supplemental. |
|
||||
|
Quote:
Nearly 2 years ago they found the URL. Shortly after finding the URL they went and fully indexed that page. No reason for them to keep the URL only partially indexed page anymore now that they have fresh data. They're not going to throw it away, and keeping that page in the same index as the newly crawled page makes no sense. If the URL only was in the regular index, then doing a search for it would pull it up on one of the unaffected DC's. Guess what... it does not. Why? Because they have the fully indexed page showing as a regular result. Dave |
|
|||
|
Just because the page hasn't been crawled, parsed and stored, doesn't mean that the listing can't be marked as supplemental in the serps, when it's been acquired from the supplemental index. There's nothing preventing them from including the words "Supplemental result" with a URL-only listing.
You seem to have overlooked something, Dave. You are still talking as though the partially indexed data and the crawled page's data are the same things - at least in part. But they aren't. They are completely different things, and completely different data. In other words, the data that they have for the URL-only listing is not only different from the data that they have when the page has been crawled and parsed, but it's also needed in addition to the page's data, AND it is stored in the short barrels along with any data from the page that they also store in the short barrels (e.g. the words in the Title). The data for the URL-only listing consists of the words that were found in the link text on a different page, and the target URL. They don't come from the URL's page at all. So when the target URL's page is fetched and parsed, there is no duplication of data whatsoever. Both lots of data are needed, and both the link text (URL-only) data and some of the page's data, are stored in the short barrels. That's why I say that it makes very good sense to store the URL-only data in the short barrels as soon as they get it. It's going to finish up there anyway (it's never going to be thrown away or moved away - it's the very backbone of their ranking algo), and in the meantime it can be used for rankings without the page itself needing to be fetched and stored. The URL-only data is the link text data that Google bases their rankings on to such a great extent. The data from the page is the content data, and it's completely different from the link text data. They acquire both lots of data at different times, and there's no duplicated data to move to someplace else. You are assuming that there's an overlap of data, but there isn't an overlap. One is link text for the URL, and the other is content for the URL. |
|
||||
|
Phil,
No, there's nothing preventing them from marking the listing as "supplemental", I'm suggesting that they are not doing it. Prior to this mess, the only page that could be found was the fully indexed page. You could not find the URL only page. This tells me that the URL page was not in either the short or the long barrel as a separate page otherwise it would have showed up in a search. Currently, in the affected data centers, you can only find the URL only listing. You cannot find the fully indexed page. This tells me that the fully indexed page was not dumped into the supplemental index but rather had been lost or the data could not be retrieved. Currently, in the repaired DC's, the only page that shows up is the fully indexed page. You cannot find the URL only page. Finally, here's a quote from GoogleGuy... Quote:
|
|
|||
|
Quote:
If they are happy with this then i think that they are going to loose marketshare. SERP's are less relevant then before and this started in februari. They have blown it in my eyes, the longer they wait to make it clear, the longer it's going to take for recovery. Perhaps they lost the battle and have to start all over |
|
|||
|
I could discuss alternative interpretations about the rest of your post with you, Dave, but the most interesting part is the GG quote. How old is it?
The quote makes it sound like the data (link text) from uncrawled URLs really is stored in supplemental, and I can't argue with that. The only thing I can do now is notice when a URL-only listing is shown in the serps to see if other supplementals are also included, or not. If no others are included, I'd still go with my theory, in spite of GG's comment, but if they are, I'd have to abandon my theory. Either way, it still makes much more sense to me to store the URL-only data in the short index from the start (it's never stored in the long index), because it has to be copied to there at some stage, or the links-based algo won't work. |
|
|||
|
I just had a thought.
Google has talked about the Supplemental index as being part of their Auxiliary index. I wonder if another part of the Auxiliary index is an index where URL-only data is stored until the URL's content is fetched and indexed. How's that for a spanner in the works? :-) |
|
||||
|
Quote:
Now, it is possible for a URL only listing to be in the regular index. The thing to note is when you see a URL only result in the SERP's if there is a cache. This is an indication that the URL has been previously crawled but on a subsequent crawl there were unable to retrieve any data. In this is the case, then it would be in the regular index. In the case that we have been discussing regarding my site, it's still my belief that every one of those URL only results are in the supplemental index and directly related to the problem Google is having right now. Too many things point to this being true to think otherwise. If I am indeed correct, I believe I am, then none of the pages have been "moved" to the supplemental index. They are pages that have been there all along and the pages in the regular index have either been lost or they are unable to be retrieved because of this problem. Since when does Google makes sense? :) Dave |
|
||||
|
Quote:
Dave |
|
||||
|
Quote:
But they have stopped talking about it, sounds like to me there leaving it as is. Has anyone seen anything change since they said they had fixed the problem? |
|
|||
|
Quote:
Maybe they were aware that this could happen. Maybe they knew it would happen. Who wil tell? |
|
|||
|
Why ask for feedback from those still effected if they know what the problem is.
They are obviously as confused as we are. However they know their algos better than we do so lets give them just a little more time to fix it all before storming the castle.... |
|
||||
|
Quote:
Dave |
|
|||
|
I have received a reply from Matt Cutts via his blog:
Quote:
|
|
||||
|
Quote:
But like you said before it also sounds like they do not have a clue what is causing the problem. |
|
||||
|
Quote:
Dave |
|
|||
|
Reminds me of the good ole' "Google is Broken" days.
I will be excited to see this "fixed". My experience historically is that Sups are pages on the way out of the G Index, not nice to see your nice DMOZ listed site sitting in mostly in Supplemental Land. :rolleyes: |
|
||||
|
williamc,
Seems like "gray" usually turns "black"... Just another greeneagle philosophical "rant"! There's good "medicine" here: http://www.seo4fun.com/notes/supplementals.html IMO - I believe they have an antedote, for some. Ken |
|
||||
|
Once again Ken, you pasted that before, that has nothing to do with the current google screw-up. Those are the old regular things about the supplemental index. I have actually gone thru it and compared it to some sites still messed up and see no corelation.
|
|
||||
|
Quote:
What I suspect the common thread may be is a particular time period when sites were being crawled, reindexed, etc., or perhaps a particular bot, and the data that was being retrieved was unusable. The data in the main index was replaced with the unusable data. This lead to them having to use the supplemental results. Dave |
|
|||
|
Quote:
I have no idea what effect it has on the bot when i deny it to crawl my site otherwise i tried it. If you deny acces for a few houres and then reallow it, will the bot then return immediately? |
|
|||
|
Can I get back to the thought that this issue is really about the loss of "good" pages from the index leaving only the "Good" homepage.
I think the supplemental pages are a bit of a red herring as in our cases most of them should be there as they are "bad" (old, orphaned, deleted etc) and were probably there all along. They have been simply hidden from view as there were previously more important pages to list before they dissapeared. The issue is where have the "good" pages gone and when are they coming home! PS: Love the "greeneagle effect" gauge. |
|
|||
|
Assuming that the problems we see/saw are the same, then the pages *were* being put in the supplemental index - good pages, et al.
I can't see it being a crawling or bot problem, Dave, because the homepages weren't supplementaled along with the rest. |
|
|||
|
Quote:
Since a week it is not visited. Last time i checked it was at least 2 times a day. Somebody else who have this? |
![]() |
|
| Thread Tools | |
| Display Modes | |
|
|
|
WebProWorld |
Advertise |
Contact Us |
About |
Forum Rules |
MVP's |
Archive |
Newsletter Archive |
Top |
WebProNews
WebProWorld is an iEntry, Inc. ® site - © 2009 All Rights Reserved Privacy Policy and Legal iEntry, Inc. 2549 Richmond Rd. Lexington KY, 40509 |