Blog Posts

Welcome to my blog! Here I share my thoughts, research insights, and experiences in AI and machine learning.

Generative AI and LLM

What If AI Was Never Meant to Learn From Us

What If AI Was Never Meant to Learn From Us

March 23, 2026 15-20 min read

If human data isn't the optimal medium — would we even know? We've spent years feeding models everything humans have ever written, and just assumed...

地基没人查过,楼已经一百层了

Reward Hacking & Jailbreak Research

The Evolution of Reward Hacking and Jailbreak Research in AI

March 03, 2026 25-30 min read

From specification gaming in classical RL to deceptive alignment and jailbreaks in LLMs—a survey of how reward hacking has become a central challenge in AI...

LLM Reasoning

Reasoning in Large Language Models: A Research-Centric Overview

May 25, 2025 15-25 min read

“Can LLMs reason?” is one of those questions that generates more heat than light, mostly because people mean very different things by “reasoning.” This post...

AI Agents

The Web Is Not a Neutral Environment for Agents

The Web Is Not a Neutral Environment for Agents

March 15, 2026 5-10 min read

Browser agents are getting better fast, but the web is full of things that try to steer behavior. If that already works on humans, why...

Long Sequence User Modeling in Ads Ranking

Modeling Long User Histories for Ads Ranking

March 07, 2026 20-30 min read

How ads ranking systems went from aggregated feature counts to retrieve-and-compress architectures that handle 10,000+ user events under millisecond latency constraints.

Sequential Learning in Ranking AI

Sequential Learning in Recommendation Systems: From Markov Chains to Transformers

January 22, 2025 20-30 min read

A comprehensive guide to sequence-based recommendation techniques and key research papers, tracing the evolution from early statistical methods to modern transformer-based architectures

Contemporary Recommendation Systems

Contemporary RecSys: Industry-Scale Architectures & Multimodal Systems (2020–2025)

January 20, 2025 15-25 min read

This era represents a pivotal shift toward production-ready, billion-scale architectures that power today’s major platforms. This comprehensive guide covers the essential papers that define contemporary...

Deep Learning Era in Recommendation Systems

Deep Learning Era of Ranking & Recommendation Systems: Must-Read Papers (2016–2020)

January 20, 2025 15-25 min read

Explore how deep learning transformed ranking and recommendation systems, from Wide & Deep to BERT4Rec, covering core technologies powering platforms like YouTube, Facebook, and Amazon...

Classic Foundations Recommendation Systems

Classic Foundational Papers on Ranking & Recommendation Systems

January 15, 2025 15-20 min read

Ranking and recommendation systems power everything from Google Search to Netflix suggestions. While today’s systems use deep learning and large language models (LLMs), their foundations...


More posts coming soon! Feel free to reach out if you have any questions or suggestions.