Relevance challenges for search engines
Hamlet Batista put up an excellent piece on SEOMoz a few days ago, entitled 7 Reasons Why Search Engines Don’t Return Relevant Results 100% of the Time. In it, he describes, funnily enough, the seven reasons why search engines don’t return relevant results:
- Relevance is subjective
- Natural language searches
- Poor queries
- Synonymy
- Polysemy
- Imperfect performance
- Spam
Batista really does a superb job of exploring each of these reasons; I’m just going to additionally touch on a few of them here.
1. Relevance is subjective
Let’s start with the first one, ‘relevance is subjective’. Batista describes it this way:
You can do a search for ‘coffee’ in Canada and find Tim Horton’s website as the most relevant. Makes sense, as that’s the most popular coffee chain in Canada, but for somebody in Seattle, Starbucks might be the most relevant result. You can do a search for the ‘49ers’ and be looking for the football team, but a historian may be looking for research material on California. And you might even do a search today for ‘bones’ trying to find where to buy your dog a treat, but tomorrow you do that same search looking for an episode of the TV series ‘Bones’ that you missed the night before.
…So far the best approaches the search engines have come up with are the use of human quality raters and personalized search. The better the search engines profile the searcher, the higher the chances of producing relevant results. This method obviously raises a lot of privacy concerns.
At VortexDNA, of course, we take the concept of subjective relevance a step further than location or job description; we suggest, and have shown, that it is profoundly affected by the user’s core purpose and values.
He also suggests that personalization inevitably leads to privacy concerns—only true for methods that rely on tracking history and demographics. When values are used to calculate relevance, there’s no need to track search history or clickstream.
2. Natural language searches
Next on Batista’s list is the use of natural language in search queries:
A search engine, on the other hand, receives ‘who has smith as last name in chicago’ or ’smith last name chicago’. The query is in natural language — our language.
Is it, though? When was the last time you spoke with a person and said, ‘Smith last name chicago’? I submit to you that we are far more demanding of our search engines than we are of any human being. Look at Batista’s examples under the previous point about ‘bones’ and ‘coffee’. Would you go t an information desk and ask, ‘Bones?’ When they’re put forth as search examples, though, we don’t question them; it’s in fact a highly plausible scenario for us to a word or two at a search engine and then be disappointed when they’re unable to disambiguate our queries.
That’s not natural language; it’s unreasonable expectations. It also leads into Batista’s next point:
3. Poor queries
His description of poor queries include colloquialisms (like ’sucker’ for vacuum) and misspellings. As I stated above, I think poor queries also includes minimalist terms and odd syntax. We couldn’t expect a human being to know what we were after with those words, but we do hope for a machine to guess our intent.
6. Imperfect performance
As I said, I’m only going to touch on some of the seven, so we’ll skip synonymy and polysemy and go straight to imperfect performance. Batista says that the two criteria that define search performance are precision and recall:
Precision is a measure of how efficient the search engine is in returning only the relevant results for the search. The more irrelevant results, the lower the precision. Recall, on the other hand, measures how good the search engine is in returning all the relevant results. (Of course, this assumes the researcher knows how many relevant results there are.) The more relevant results missing from the search, the lower the recall.
Ideally, a search engine should identify all relevant documents without returning any irrelevant ones (100% precision and 100% recall). In practice, this has been proven to be impossible, as precision and recall are inversely proportional.
It sounds like Heisenberg’s Uncertainty Principle, which states that
…it is impossible to perfectly measure a particle’s position and velocity at the same time. The more accurately you measure a particle’s position, the more inaccurate your measure of its velocity, and vice versa.
It may sound strange that I’m citing quantum physics when we’re talking about search engine performance, but I call parallels where I see ‘em, thank you very much.
The point here is not only that it appears to be impossible to achieve perfect precision and perfect recall simultaneously, but also that the aim should be to find the optimum tension or balance between the two. At what point does declining recall produce diminishing returns for incremental increases in precision, and vice versa?
I have some additional questions about these measures; namely, how precision and recall can be defined when we already know that relevance is subjective (see point 1). They do, however, serve as valuable parameters for putting search improvement efforts in context.
I really appreciate Batista’s skill in describing these seven challenges, and I believe we’re only scratching the surface here. What do you see as the biggest challenge search engines face in delivering the results you want?





July 24th, 2007 at 4:58 am
Kaila - This is an excellent analysis of my post. Thank you!
Sounds like you have something big going on here. Great content and impressive product!
July 25th, 2007 at 9:08 am
Thanks, Hamlet! I appreciate the kind words, and welcome any feedback you have for us.
All the best,
Kaila
July 25th, 2007 at 12:41 pm
I see that you are interested in search engines. You should take a look at www.linguisticagents.com. It’s a start-up company that has developed a natural language understanding technology that will be used in many applications in addition to search. Let me know what you think - techshrek@gmail.com.
September 27th, 2007 at 8:15 am
thank you for your post.
I think that there are 2 real problem because they are fundametal:
#Relevance is subjective
#Natural language searches
All others is the sybject of work for search-engines companies and I think we will get more and more improvments in this field.
StumbleUpon is a good step ahead. It can fight some of other reasons.
October 1st, 2007 at 3:30 am
Hello Vasiliy,
Thank you for your comment! Those are certainly two significant issues in getting more relevant search results. I would suggest that even those fundamental issues are being addressed by search engine companies. Powerset and Hakia are hard at work deconstructing natural language. Our own VortexDNA technology is a solution for subjective relevance.
There are so many possibilities open to us—this is truly an exciting time.