Pages

Saturday, December 10, 2016

How to deal with the synonyms. Awesome examples

http://opensourceconnections.com/blog/2016/12/02/solr-elasticsearch-synonyms-better-patterns-keyphrases/

Monday, October 24, 2016

Visualizing Geohash

I've rescued a lost article on visualizing geohashes

Wednesday, October 5, 2016

Good explanation of how the DisMax works (despite that it based on Solr, but not my favorite Elasticsearch)

http://opensourceconnections.com/blog/2013/03/07/the-anatomy-of-a-dismax-query/
There you can find some useful ideas how to combine search and graphs together to improve your relevance.

http://blog.comperiosearch.com/blog/2014/02/05/dynamic-search-ranking-using-elasticsearch-neo4j-and-piwik/ 

Sunday, October 2, 2016

Very interesting article that has a link on the interesting tool.

Friday, August 19, 2016

http://lucidworks.com/blog/2016/08/17/learning-to-rank-solr/ - very interesting presentation on ranking search results (and measuring the results) on Bloomberg
http://lucidworks.com/blog/2016/08/17/learning-to-rank-solr/ - very interesting presentation on automatic ranking the search results (and measuring the results) on Bloomberg

Friday, July 15, 2016

Finding You the Best Hotel at LateRooms.com with Elasticsearch

https://www.elastic.co/blog/finding-you-the-best-hotel-at-laterooms-com-with-elasticsearch

Good overview. Some of the ideas and approaches are the same as ours.

Monday, June 13, 2016

proper sorting is not that easy

Trying to sort site-wide search results on most e-commerce sites typically end up a mess, with irrelevant search results being propelled to the top of the list as users sort by price, customer ratings, etc

http://baymard.com/blog/faceted-sorting

Tuesday, June 7, 2016

Basic NLP with the bash https://gibrown.com/2013/01/26/unix-bi-grams-tri-grams-and-topic-modeling/

Thursday, May 5, 2016

Bags of Words vs. Bags of Things

 What most users don’t know is that the search engine is answering a different question than they are asking – it returns the documents that have the words that they entered (or synonyms if the application designers have provided this) which is sometimes not exactly “what” they are looking for.

https://lucidworks.com/blog/2014/07/02/automatic-phrase-tokenization-improving-lucene-search-precision-by-more-precise-linguistic-analysis/
The user is telling the search engine what they want. They are looking for specific things, not specific words. The more that we can do to redress this mismatch between tokens and things, the better the user experience will be.

Monday, April 11, 2016

I've found new awesome blog on search - http://opensourceconnections.com/blog/

Some of the articles in that blog:
And many other

Wednesday, February 10, 2016

Product Insights for Airbnb

Got a lot of fresh ideas how to improve our ranking model

Monday, February 1, 2016

lucene solr commit history 1

Tuesday, January 19, 2016

Martijn van Groningen at #bbuzz 2014

Very clear introduction to percolators

Query understanding

Good short overview