Renee Jia

The Evolution of Reward Hacking and Jailbreak Research in AI

25-30 min read

From specification gaming in classical RL to deceptive alignment and jailbreaks in LLMs—a survey of how reward hacking has become a central challenge in AI s...

Reasoning in Large Language Models: A Research-Centric Overview

15-25 min read

“Can LLMs reason?” is one of those questions that generates more heat than light, mostly because people mean very different things by “reasoning.” This post ...

Sequential Learning in Recommendation Systems: From Markov Chains to Transformers

20-30 min read

A comprehensive guide to sequence-based recommendation techniques and key research papers, tracing the evolution from early statistical methods to modern tra...

Contemporary RecSys: Industry-Scale Architectures & Multimodal Systems (2020–2025)

15-25 min read

This era represents a pivotal shift toward production-ready, billion-scale architectures that power today’s major platforms. This comprehensive guide covers ...

Deep Learning Era of Ranking & Recommendation Systems: Must-Read Papers (2016–2020)

15-25 min read

Explore how deep learning transformed ranking and recommendation systems, from Wide & Deep to BERT4Rec, covering core technologies powering platforms lik...

Recent posts

The Evolution of Reward Hacking and Jailbreak Research in AI

Reasoning in Large Language Models: A Research-Centric Overview

Sequential Learning in Recommendation Systems: From Markov Chains to Transformers

Contemporary RecSys: Industry-Scale Architectures & Multimodal Systems (2020–2025)

Deep Learning Era of Ranking & Recommendation Systems: Must-Read Papers (2016–2020)