ABOUT SEARCHING AND SEARCH PRACTICIES

437 bookmarks

Custom sorting

Applying Context Aware Spell Checking in Spark NLP

Introduction

·medium.com·Oct 6, 2022

Applying Context Aware Spell Checking in Spark NLP

Visualizing search metrics

·nathanday.shinyapps.io·Oct 6, 2022

Visualizing search metrics

Do all-stopword queries matter?

Many search engines don’t index “stopwords”, words that are very common and have little meaning by themselves. The stopword list is often just the most frequent words in the langu…

·observer.wunderwood.org·Oct 6, 2022

Do all-stopword queries matter?

BM25 The Next Generation of Lucene Relevance - OpenSource Connections

There’s something new cooking in how Lucene scores text. Lucene just switched to something called BM25 in trunk. That means a new scoring formula for Solr and Elasticsearch.

·opensourceconnections.com·Oct 6, 2022

BM25 The Next Generation of Lucene Relevance - OpenSource Connections

What is Learning To Rank? - OpenSource Connections

What is Learning to Rank? Learning to rank ties machine learning into the search engine relevance.

·opensourceconnections.com·Oct 6, 2022

What is Learning To Rank? - OpenSource Connections

How is search different than other machine learning problems? - OpenSource Connections

In this blog, we explore what makes search distinct from other machine learning problems. How does one approach search ranking as a machine learning problem? We go through a couple of approaches that give you an intuition on how to evaluate a learning to rank method.

·opensourceconnections.com·Oct 6, 2022

How is search different than other machine learning problems? - OpenSource Connections

How to Implement a Normalized Discounted Cumulative Gain (NDCG) Ranking Quality Scorer in Quepid - OpenSource Connections

Our search relevancy engineer at OpenSource Connections (OSC), uses Quepid every day! Read on to learn how to implement a normalized discounted cumulative gain ranking quality scorer in Quepid

·opensourceconnections.com·Oct 6, 2022

How to Implement a Normalized Discounted Cumulative Gain (NDCG) Ranking Quality Scorer in Quepid - OpenSource Connections

An Introduction to Search Quality - OpenSource Connections

Welcome, dear reader, to my first OSC blog post. Let’s dive in! While search relevance is often equated with ensuring customers find what they need, that is only part...

·opensourceconnections.com·Oct 6, 2022

An Introduction to Search Quality - OpenSource Connections

The Unreasonable Effectiveness of Collocations - OpenSource Connections

Recently while experimenting with word2vec-based features with Learning to Rank, I was exploring using collocations to improve the accuracy of my embeddings. If you read the original word2vec paper...

·opensourceconnections.com·Oct 6, 2022

The Unreasonable Effectiveness of Collocations - OpenSource Connections

Falsehoods Programmers Believe About Search - OpenSource Connections

105 falsehoods programmers believe about search, a complex field where competence is hard-won through training, practice, and experience.

·opensourceconnections.com·Oct 6, 2022

Falsehoods Programmers Believe About Search - OpenSource Connections

Understanding BERT and Search Relevance - OpenSource Connections

This article gives an overview into the opportunities and challenges when applying advanced transformer models such as BERT to search.

·opensourceconnections.com·Oct 6, 2022

Understanding BERT and Search Relevance - OpenSource Connections

Demystifying nDCG and ERR - OpenSource Connections

We unwrap the mystery behind two popular search relevance metrics nDCG and ERR through visualization, and discuss their pros and cons.

·opensourceconnections.com·Oct 6, 2022

Demystifying nDCG and ERR - OpenSource Connections

What is a 'Relevant' Search Result? - OpenSource Connections

Five years ago, I wrote an article called What is Search Relevance?. Back then, I had to shout to convince people to even notice whether search results were accurate...

·opensourceconnections.com·Oct 6, 2022

What is a 'Relevant' Search Result? - OpenSource Connections

Choosing your search relevance evaluation metric - OpenSource Connections

Ensuring results are relevant is tricky but critical to a good search experience. Choosing an evaluationmetric to summarize the performance of the search engine can be equally challenging because...

·opensourceconnections.com·Oct 6, 2022

Choosing your search relevance evaluation metric - OpenSource Connections

Feedback debt: what the Segway teaches search teams - OpenSource Connections

Does anyone else remember the Segway? Segway was billed as the most revolutionary transportation innovation, well, ever. We would use it instead of cars; instead of biking and walking....

·opensourceconnections.com·Oct 6, 2022

Feedback debt: what the Segway teaches search teams - OpenSource Connections

5 Right Ways to Measure How Search Is Performing - OpenSource Connections

Search is a big deal — how else could we navigate the digital age where information overload is the status quo? But despite search being omnipresent, it can still...

·opensourceconnections.com·Oct 6, 2022

5 Right Ways to Measure How Search Is Performing - OpenSource Connections

Building an Effective Search Team: the key to great search & relevancy - OpenSource Connections

What are the key roles in a Search Team and how can you find people to fill them both from within and without your organisation?

·opensourceconnections.com·Oct 6, 2022

Building an Effective Search Team: the key to great search & relevancy - OpenSource Connections

E-commerce Site-Search KPIs - Part 1 - Customers - OpenSource Connections

We will outline all the things you need to know about measuring the quality and effectiveness of your store’s site-search.

·opensourceconnections.com·Oct 6, 2022

E-commerce Site-Search KPIs - Part 1 - Customers - OpenSource Connections

E-commerce Site-Search KPIs - Part 2 - Products - OpenSource Connections

Today we’ll discuss KPIs for the actual product items that are presented to a buyer when they use your site-search.

·opensourceconnections.com·Oct 6, 2022

E-commerce Site-Search KPIs - Part 2 - Products - OpenSource Connections

E-Commerce Site-Search KPIs - Part 3 - Queries - OpenSource Connections

How much money are you losing or gaining, depending on a site-search query? You need to be able to know if search is working, and you can with these KPIs!

·opensourceconnections.com·Oct 6, 2022

E-Commerce Site-Search KPIs - Part 3 - Queries - OpenSource Connections

Fundamentals of query rewriting (part 1): introduction to query expansion - OpenSource Connections

Johannes Peter introduces query expansion in the first part of a series on fundamentals of query rewriting

·opensourceconnections.com·Oct 6, 2022

Fundamentals of query rewriting (part 1): introduction to query expansion - OpenSource Connections

Metacrap

·people.well.com·Oct 6, 2022

Metacrap

Hybrid search sum of its parts? Berlin Buzzwords 2022

Over the decades, information retrieval has been dominated by classical methods such as BM25. These lexical models are simple and effective yet vulnerable to vocabulary mismatch. With the introduction of pre-trained language models such as BERT and its relatives, deep retrieval models have achieved superior performance with their strong ability to capture semantic relationships. The downside is that training these deep models is computationally expensive, and suitable datasets are not always available for fine-tuning toward the target domain. While deep retrieval models work best on domains close to what they have been trained on, lexical models are comparatively robust across datasets and domains. This suggests that lexical and deep models can complement each other, retrieving different sets of relevant results. But how can these results effectively be combined? And can we learn something from language models to learn new indexing methods? This talk will delve into both these approaches and exemplify when they work well and not so well. We will take a closer look at different strategies to combine them to get the best of both, even in zero-shot cases where we don't have enough data to fine-tune the deep model. The Search track is presented by OpenSource Connections

·pretalx.com·Oct 6, 2022

Hybrid search sum of its parts? Berlin Buzzwords 2022