Dating, Dining, and Doomscrolling: The Algorithms Behind Modern Choice

Overview of Recommendation system - Workshop for Pycon India 2025
deep-learning
attention
Author

Aman Pandey

Published

December 9, 2025

This was workshop for Pycon India 2025

Dating, Dining, and Doomscrolling: The Algorithms Behind Modern Choice

Abstract

Recommendation systems are no longer just about suggesting the next movie, they now shape everything from what we eat and wear to who we date, what we watch, and even what we believe. In this hands-on workshop, we’ll explore the powerful systems behind these “invisible puppeteers” of modern digital life.

You’ll learn not just how recommendation systems work, but how they shape your choices, reinforce (or challenge) your biases, and how companies fine-tune them to maximize retention, revenue, and engagement. We’ll build and break down a recommendation pipeline from retrieval to ranking and dive into production challenges, optimization tricks, and the ethical trade-offs involved.

We’ll also discuss how users can hack these systems to personalize their feeds and how developers can design fairer, more diverse systems. Whether you’re into dating apps, content feeds, or building ML pipelines, this workshop will help you understand the full spectrum of RecSys.

Description of the workshop

1. Introduction (10 mins)

  • Welcome and Overview

    • Introduction to the workshop: What are recommendation systems and why do they matter?

    • Real-world examples:

      • Dating apps (e.g., Tinder)
      • Content feeds (e.g., Instagram)
      • Food recommendations (e.g., Zomato)
    • Goals of the workshop:

      • Learn how RecSys work
      • Dive into their challenges
      • Understand their ethical implications
  • Audience Poll / Engagement

    • Quick questions to gauge familiarity with RecSys

    • Brief survey:

      • Who uses these systems?
      • Who’s familiar with the tech behind them?

2. Foundations of Recommendation Systems (30 mins)

Goal: Give a strong conceptual grounding before diving into implementation.

  • Types of Training Data

    • Explicit Feedback: Ratings, thumbs-up/down
    • Implicit Feedback: Clicks, dwell time, views, purchases
  • Interaction Types

    • Positive-only vs. Positive & Negative Feedback
  • System Categories by Use-Case

    • General-purpose, Context-aware, Sequential, Knowledge-based RecSys
  • Core Approaches

    • Collaborative Filtering (User-based vs. Item-based)
    • Content-Based Filtering
    • Hybrid Models
  • Application Examples

    • Netflix → Hybrid
    • Amazon → Item-based CF
    • Tinder → Context-aware
    • Spotify → Sequential
    • Zomato → Content + CF
    • Pratilipi → Long-tail engagement via diversity-aware models
  • Recommendation Pipeline

    • Retrieval Stage: Candidate generation
    • Ranking Stage: Sorting by relevance
    • Exploration vs. Exploitation
  • Ranking Methods

    • Pointwise / Pairwise / Listwise
  • Metrics that Matter

    • Offline Metrics: Precision@K, Recall, MAP, NDCG

    • Business Goal Alignment:

      • YouTube → Watch Time
      • Tinder → Match Rate

3. Hands-On Session: Building a Recommendation System (40 mins)

Goal: Implement and understand the inner workings of a real model.

  • Exercise 1: Building a Two-Tower Model (30 mins)

    • Use PyTorch to build a simple two-tower architecture
    • Demonstrate user-item embedding and dot-product similarity
    • Step-by-step: Data loading → Model → Loss → Training
  • Exercise 2: Evaluating Your Model (10 mins)

    • Use standard metrics: Precision, Recall, MAP
    • Compare online metrics vs. offline metrics

Break (10 mins)


4. Challenges in Recommendation Systems (30 mins)

Real-World Problems

  • Bias in Training Data: Echo chambers, skewed preferences
  • Cold Start: New users/items with little interaction data
  • Scalability: Latency and compute at scale

Ethical Challenges

  • Algorithmic Bias & Fairness: Impact on sensitive attributes

  • Diversity vs. Personalization: Avoiding homogenized content

  • Debiasing Techniques:

    • Exposure Bias: Overexposure to certain items
    • Popularity Bias: Trend dominance
    • Selection Bias: Skewed relevance due to interaction patterns

Case Study: Pratilipi

  • How an Indian storytelling platform balances fairness, diversity, and engagement

5. Hands-On Recap & Final Exercise (30 mins)

Mini Lab: Use your model to explore bias, diversity, and fairness.

  • Apply:

    • Diversity boosting
    • Popularity debiasing
    • Freshness filters
  • Re-evaluate:

    • Compare fairness, diversity, and relevance metrics before and after
  • Group Reflection:

    • Discuss improvements, regressions, and trade-offs

6. Gaming the System: How Users Can Personalize Their Feeds (10 mins)

  • How users can influence their recommendation feed (e.g., YouTube, Spotify)
  • Implications of over-personalization (filter bubbles, doomscrolling)
  • Tips for better and diverse recommendations

7. Conclusion & Q&A (20 mins)

  • Wrap-Up

  • Q&A: Open floor for questions on theory, implementation, or ethics

  • Additional Resources:

    • GitHub repo with code, datasets, and readings
    • Encourage continued learning and experimentation

Here is the presentation I used in the talk:

Slides

Explore the Code

You can interact with the notebook on Google Colab.

Back to top