Comments on: Relevance in Mini and GSA searches /2006/03/07/relevance-in-mini-and-gsa-searches/ Google Search Appliance and Google Mini development Fri, 14 Mar 2014 15:00:46 +0000 hourly 1 http://wordpress.org/?v=4.2.2 By: Joel /2006/03/07/relevance-in-mini-and-gsa-searches/comment-page-1/#comment-5969 Thu, 10 Jan 2008 16:25:14 +0000 /2006/03/07/relevance-in-mini-and-gsa-searches/#comment-5969 Currently nutch-IICE open source project is similar with Google GSA. You can take a look at it.

http://nutch-iice.sourceforge.net/

]]>
By: Paul /2006/03/07/relevance-in-mini-and-gsa-searches/comment-page-1/#comment-16 Fri, 24 Mar 2006 22:34:15 +0000 /2006/03/07/relevance-in-mini-and-gsa-searches/#comment-16 Hi Jim,

Nope, I’d have to put a software update in to change something like that and there haven’t been any for the Mini recently. Funnily enough, I’ve just done a software update on the GSA I work on and RK is still there and acting like it did before.

The Mini & GSA don’t use the same ranking algorithm as big Google, and you can use RK to give an indication of how good a result is for a search (i.e. turn it in to stars or something) so I doubt they’ll turn it off in the search appliances.

It’s rather interesting that they have within the main API though. If it wasn’t doing something, you’d have thought they’d leave it on (and indeed if it wasn’t doing something, what’s it doing there in the first place?)

]]>
By: Jim Westergren /2006/03/07/relevance-in-mini-and-gsa-searches/comment-page-1/#comment-15 Fri, 24 Mar 2006 20:32:35 +0000 /2006/03/07/relevance-in-mini-and-gsa-searches/#comment-15 Hi,

Google has now put all RK values to zero for all URLs.

Either some temporary glitch OR Google didn’t like that value to be public …

Is it the same for the Google mini??

]]>
By: Paul /2006/03/07/relevance-in-mini-and-gsa-searches/comment-page-1/#comment-13 Wed, 08 Mar 2006 16:17:39 +0000 /2006/03/07/relevance-in-mini-and-gsa-searches/#comment-13 There’s no doubt ‘RK’ is a way of scoring a page in the Mini and GSA. What it may be in the main Google API is another matter entirely.

Working out the ranking system in the Mini is one of the things on my ‘to do’ list, which is unfortunately filled with other stuff as well.

From what I’ve seen, on page factors have a much greater effect than they do in big Google, but interlinking does still have an effect. I’m not sure about effects of where a document is in an overall site or directory tree yet, it can be difficult to assess that without outputting large amounts of test pages, which I haven’t had time for.

Of course, it might be that interlinking doesn’t have much effect because there isn’t much interlinking in the relatively small datasets that I’m working with. That’s another thing I’m going to have to test. It could be that a relatively few links will have a very large effect, because there generally wouldn’t be a lot of linking to the same place on an intranet – which is what the search appliances were generally made to search.

]]>
By: Dayo_UK /2006/03/07/relevance-in-mini-and-gsa-searches/comment-page-1/#comment-12 Wed, 08 Mar 2006 14:45:02 +0000 /2006/03/07/relevance-in-mini-and-gsa-searches/#comment-12 Ok, thanks.

So there is little doubt that rk tag is a way of scoring a page – obv the way of scoring a page in Google Mini is a lot different to Big Google.

Is it all on-page factors ?

or would Google Mini recognize a more important document by number of references to it or where it sites in the directory tree ?

]]>
By: Paul /2006/03/07/relevance-in-mini-and-gsa-searches/comment-page-1/#comment-11 Wed, 08 Mar 2006 12:01:05 +0000 /2006/03/07/relevance-in-mini-and-gsa-searches/#comment-11 Yup, the results are shown in relevancy order by default. You can get several of the same RK rating, so it must have a decimal level internally, or something else it sorts by as well, so you can get…

First – 6
Second – 6
Third – 5
Fourth – 5
Fifth – 5
Sixth – 2
Seventh – 2
Eighth – 0
Ninth – 0
etc.

This is consistent from what I’ve seen, it’s not like PageRank in big Google where you can have a PR2 page come higher up than a PR5 page in a set of results.

NB: The top result doesn’t always have an RK of 10, with the rest decreasing from that, so the Mini must have some sort of relevancy algorithm that says “this is the most relevant page for the search, but I still only give it a 5/10 for relevancy for the term.”

]]>
By: Dayo_UK /2006/03/07/relevance-in-mini-and-gsa-searches/comment-page-1/#comment-10 Wed, 08 Mar 2006 11:44:05 +0000 /2006/03/07/relevance-in-mini-and-gsa-searches/#comment-10 Oops – the comments does not expect tags – before the numbers in the above posts should be the rk tag. EG RK of 5, 4, 2 and 4.

Cheers

Dayo

]]>
By: Dayo_UK /2006/03/07/relevance-in-mini-and-gsa-searches/comment-page-1/#comment-9 Wed, 08 Mar 2006 11:42:47 +0000 /2006/03/07/relevance-in-mini-and-gsa-searches/#comment-9 Hi, So results are returned in a relevacy order by default.

So if you choose to display the value you will get results like this ?:-

First Title
First Desc
First URL – 5

Second Title
Second Desc
Second Url – 4

etc

So the seems to directly reflect the relevancy in a relevancy search ? – or can an of 2 show higher than an of 4 in a relevancy ordered search ?

]]>