Attention Residuals: A Comprehensive Understanding
This paper addresses a fundamental problem in training deep transformer models: uncontrolled hidden-state magnitude growth as model depth increases. The auth...
Ranking and recommendation systems power everything from Google Search to Netflix suggestions. While today’s systems use deep learning and large language models (LLMs), their foundations were laid decades earlier — with ideas that are still relevant in production today.
This post curates the classic works (1970s–2010s) that every AI engineer working in ranking / recommendation systems should know before diving into modern architectures.
Robertson & Jones, 1976 – The 2-Poisson Model
Early probabilistic formulation of information retrieval, paving the way for modern ranking.
Okapi BM25 (Robertson & Walker, 1994)
The most influential bag-of-words ranking function ever created. Despite neural models, BM25 is still used as a baseline in 2025.
RankNet (Burges et al., 2005)
First neural approach to ranking with a pairwise loss. Established the move from heuristics to machine-learned ranking.
LambdaRank (Burges, 2006)
Adjusted gradients to directly optimize NDCG, bridging the gap between ML training and ranking metrics.
ListNet (Cao et al., 2007)
First listwise approach, training directly on permutations of ranked lists.
LambdaMART (Burges, 2010)
Combined LambdaRank with gradient-boosted decision trees.
Still the de facto industry standard for large-scale ranking pipelines.
Breese, Heckerman & Kadie, 1998 – Empirical Analysis of Predictive Algorithms
Compared user-based vs. item-based collaborative filtering. A foundational evaluation.
Item-based CF (Sarwar et al., 2001)
First scalable CF method, widely adopted in e-commerce platforms.
Matrix Factorization for Recommender Systems (Koren et al., 2009)
The Netflix Prize breakthrough: decomposing users and items into latent vectors. Still a benchmark today.
Temporal Dynamics in MF (Koren, 2009)
Extended MF with time-sensitive embeddings, modeling user preference shifts.
Factorization Machines (Rendle, 2010)
Generalized MF to arbitrary sparse features, allowing second-order interactions.
Direct ancestor of modern feature interaction models like DeepFM.
Precision, Recall, F-measure (van Rijsbergen, 1979)
Core IR evaluation metrics still taught today.
Discounted Cumulative Gain (DCG/NDCG) – Järvelin & Kekäläinen, 2002
A metric designed specifically for ranking quality.
Still the gold standard for search and recommendation evaluation.
LETOR Benchmark (Liu et al., 2007)
First public learning-to-rank dataset, crucial for standardizing comparisons.
TREC Evaluation Methodology (Voorhees & Harman, 2005)
Large-scale shared evaluation tasks that shaped the culture of IR research.
@article{reneejia2025classic-foundational-papers-on-ranking-recommendation-systems,
title = "Classic Foundational Papers on Ranking & Recommendation Systems",
author = "Renee Jia",
journal = "renee-jia.github.io",
year = "2025",
url = "https://renee-jia.github.io/ai-learning-guide/classic-foundational-papers-ranking-recommendation-systems/"
}
This paper addresses a fundamental problem in training deep transformer models: uncontrolled hidden-state magnitude growth as model depth increases. The auth...
Browser agents are getting better fast, but the web is full of things that try to steer behavior. If that already works on humans, why would agents be immune?
How ads ranking systems went from aggregated feature counts to retrieve-and-compress architectures that handle 10,000+ user events under millisecond latency ...
From specification gaming in classical RL to deceptive alignment and jailbreaks in LLMs—a survey of how reward hacking has become a central challenge in AI s...
Reasoning in Large Language Models: A Research-Centric Overview
A comprehensive guide to sequence-based recommendation techniques and key research papers, tracing the evolution from early statistical methods to modern tra...
Contemporary RecSys: Industry-Scale Architectures & Multimodal Systems (2020–2025)
Explore how deep learning transformed ranking and recommendation systems, from Wide & Deep to BERT4Rec, covering core technologies powering platforms lik...
Classic Foundational Papers on Ranking & Recommendation Systems
AI Beginner’s Guide: Learning Artificial Intelligence from Scratch
Welcome to My Blog