How to Build a Recommendation System with Minimal Code

Recommendation systems power some of the most successful products in technology—Netflix’s movie suggestions, Amazon’s product recommendations, Spotify’s playlists, and YouTube’s endless video queues. The sophistication of these systems might suggest they require extensive machine learning expertise and thousands of lines of code to implement. In reality, you can build surprisingly effective recommendation systems with remarkably little code by leveraging the right libraries and focusing on core principles rather than complex optimizations.

The key to building recommendations with minimal code is understanding that effective recommendations don’t require cutting-edge algorithms or massive infrastructure. Simple approaches based on similarity and collaborative filtering often perform excellently for small to medium-sized datasets and user bases. By using Python libraries like pandas, scikit-learn, and scipy, you can implement functional recommendation systems in under 100 lines of code that deliver genuine value to users. The challenge is not coding complexity but understanding which approach fits your data and use case.

Understanding Recommendation Approaches

Before writing code, you need to understand the fundamental approaches to recommendations. This conceptual foundation determines what type of system you’ll build and what code you’ll need. Recommendation systems generally fall into three categories, each with different data requirements and implementation complexity.

Content-based filtering recommends items similar to those a user has liked previously. If someone watches action movies, recommend more action movies. This approach requires understanding item attributes—genres, keywords, descriptions, features. The advantage is that it works for new users who have interacted with just one or two items and doesn’t require data about other users. The disadvantage is that it tends to recommend similar items repeatedly, creating a filter bubble where users don’t discover diverse content.

Collaborative filtering recommends items based on patterns in user behavior—if User A and User B liked similar items in the past, and User A likes a new item, recommend it to User B. This approach requires user-item interaction data but no understanding of item attributes. It naturally discovers complex patterns like “people who like X also like Y” that might not be obvious from content alone. The downside is the cold start problem: new users with no history and new items with no interactions cannot be recommended effectively.

Hybrid approaches combine both methods, using content similarity when collaborative data is sparse and collaborative signals when available. These systems provide the best user experience but require more implementation complexity. For minimal code implementations, starting with either pure content-based or pure collaborative filtering makes more sense, choosing based on what data you have available.

Building Content-Based Recommendations

Content-based recommendations are the simplest to implement and require only item metadata and a single user’s preferences. The core concept is measuring similarity between items based on their attributes, then recommending the most similar items to those the user already likes. This can be implemented in surprisingly few lines of code.

The foundation is representing items as vectors of features. For movies, this might include genres, director, actors, keywords from descriptions, release year, and ratings. For products, it might include categories, brands, specifications, and review keywords. For articles, it might be topics, keywords, and author. The key is converting these attributes into numerical vectors that algorithms can process.

Text-based attributes benefit from TF-IDF (Term Frequency-Inverse Document Frequency) vectorization, which converts text into numerical vectors that capture which words are important to each item while downweighting common words that appear everywhere. Python’s scikit-learn library provides a TF-IDF vectorizer that handles this transformation automatically, requiring just a few lines of code.

Once items are vectors, calculating similarity becomes straightforward. Cosine similarity measures the angle between vectors—items with similar feature patterns have high cosine similarity regardless of vector magnitude. If two movies share many genres, actors, and keywords, they’ll have high cosine similarity. Computing all pairwise similarities creates a similarity matrix where you can quickly look up the most similar items to any given item.

Here’s a complete content-based recommender in minimal code:

import pandas as pd
from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.metrics.pairwise import cosine_similarity

# Load your data (movies with titles and descriptions)
movies = pd.DataFrame({
    'title': ['The Matrix', 'Inception', 'Toy Story', 'Finding Nemo', 'The Shawshank Redemption'],
    'description': [
        'Hacker discovers reality is simulation fights machines',
        'Thief enters dreams to plant ideas corporate espionage',
        'Toys come alive when humans leave adventure friendship',
        'Fish searches ocean for lost son underwater adventure',
        'Wrongfully imprisoned man escapes through tunnels friendship redemption'
    ]
})

# Convert descriptions to TF-IDF vectors
tfidf = TfidfVectorizer(stop_words='english')
tfidf_matrix = tfidf.fit_transform(movies['description'])

# Compute similarity between all movies
similarity_matrix = cosine_similarity(tfidf_matrix, tfidf_matrix)

# Function to get recommendations
def get_recommendations(title, n=3):
    # Find the index of the movie
    idx = movies[movies['title'] == title].index[0]
    
    # Get similarity scores for this movie with all others
    sim_scores = list(enumerate(similarity_matrix[idx]))
    
    # Sort by similarity (excluding the movie itself)
    sim_scores = sorted(sim_scores, key=lambda x: x[1], reverse=True)[1:n+1]
    
    # Return movie titles
    movie_indices = [i[0] for i in sim_scores]
    return movies['title'].iloc[movie_indices].tolist()

# Get recommendations
print(get_recommendations('The Matrix'))  # Returns similar movies like Inception

import pandas as pd
from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.metrics.pairwise import cosine_similarity

# Load your data (movies with titles and descriptions)
movies = pd.DataFrame({
    'title': ['The Matrix', 'Inception', 'Toy Story', 'Finding Nemo', 'The Shawshank Redemption'],
    'description': [
        'Hacker discovers reality is simulation fights machines',
        'Thief enters dreams to plant ideas corporate espionage',
        'Toys come alive when humans leave adventure friendship',
        'Fish searches ocean for lost son underwater adventure',
        'Wrongfully imprisoned man escapes through tunnels friendship redemption'
    ]
})

# Convert descriptions to TF-IDF vectors
tfidf = TfidfVectorizer(stop_words='english')
tfidf_matrix = tfidf.fit_transform(movies['description'])

# Compute similarity between all movies
similarity_matrix = cosine_similarity(tfidf_matrix, tfidf_matrix)

# Function to get recommendations
def get_recommendations(title, n=3):
    # Find the index of the movie
    idx = movies[movies['title'] == title].index[0]
    
    # Get similarity scores for this movie with all others
    sim_scores = list(enumerate(similarity_matrix[idx]))
    
    # Sort by similarity (excluding the movie itself)
    sim_scores = sorted(sim_scores, key=lambda x: x[1], reverse=True)[1:n+1]
    
    # Return movie titles
    movie_indices = [i[0] for i in sim_scores]
    return movies['title'].iloc[movie_indices].tolist()

# Get recommendations
print(get_recommendations('The Matrix'))  # Returns similar movies like Inception

This approach scales well up to thousands of items and works immediately—no training period required. You can enhance it by incorporating multiple features: combining text descriptions with categorical features like genres, using weighted combinations where certain attributes matter more, and normalizing numerical features so they contribute appropriately to similarity calculations.

Recommendation System Approaches

📋

Content-Based

Uses item attributes. Works for new users. Needs: item features

👥

Collaborative

Uses user behavior. Discovers patterns. Needs: interaction data

🔀

Hybrid

Combines both. Best results. Needs: features + interactions

Implementing Collaborative Filtering Simply

Collaborative filtering feels more complex because it discovers patterns in user behavior rather than following explicit rules about item similarity. However, user-based collaborative filtering can be implemented with the same simplicity as content-based approaches—the core idea is just finding similar users instead of similar items.

The starting point is a user-item matrix showing which users interacted with which items. This might be explicit ratings (1-5 stars), implicit feedback (watched/didn’t watch, clicked/didn’t click), or purchase history (bought/didn’t buy). The matrix is typically sparse—most users interact with only a small fraction of available items—but algorithms handle this gracefully.

User-based collaborative filtering finds users similar to the target user based on their interaction patterns, then recommends items those similar users liked. If you and I rated movies similarly, and I loved a movie you haven’t seen, the system recommends it to you. Calculating user similarity works exactly like item similarity: treat each user’s interactions as a vector and compute cosine similarity between users.

The implementation mirrors content-based filtering but operates on the transposed matrix. Instead of finding similar items, you find similar users. Instead of recommending items similar to what the user liked, you recommend items that similar users liked. The code structure remains nearly identical, just switching what you’re comparing:

import pandas as pd
from sklearn.metrics.pairwise import cosine_similarity
import numpy as np

# Create user-item matrix (users as rows, items as columns, values are ratings)
ratings = pd.DataFrame({
    'user': [1, 1, 1, 2, 2, 2, 3, 3, 3, 4, 4],
    'item': ['A', 'B', 'C', 'A', 'B', 'D', 'A', 'C', 'D', 'B', 'C'],
    'rating': [5, 4, 3, 5, 4, 5, 2, 4, 5, 4, 5]
})

# Pivot to create user-item matrix
user_item_matrix = ratings.pivot(index='user', columns='item', values='rating').fillna(0)

# Calculate user similarity
user_similarity = cosine_similarity(user_item_matrix)
user_similarity_df = pd.DataFrame(user_similarity, 
                                   index=user_item_matrix.index,
                                   columns=user_item_matrix.index)

def get_user_recommendations(user_id, n=3):
    # Get similar users
    similar_users = user_similarity_df[user_id].sort_values(ascending=False)[1:4]
    
    # Get items the user hasn't rated
    user_items = user_item_matrix.loc[user_id]
    unrated_items = user_items[user_items == 0].index
    
    # Score items based on similar users' ratings
    item_scores = {}
    for item in unrated_items:
        score = 0
        for similar_user, similarity in similar_users.items():
            score += user_item_matrix.loc[similar_user, item] * similarity
        item_scores[item] = score
    
    # Return top items
    recommendations = sorted(item_scores.items(), key=lambda x: x[1], reverse=True)[:n]
    return [item for item, score in recommendations]

print(get_user_recommendations(1))

import pandas as pd
from sklearn.metrics.pairwise import cosine_similarity
import numpy as np

# Create user-item matrix (users as rows, items as columns, values are ratings)
ratings = pd.DataFrame({
    'user': [1, 1, 1, 2, 2, 2, 3, 3, 3, 4, 4],
    'item': ['A', 'B', 'C', 'A', 'B', 'D', 'A', 'C', 'D', 'B', 'C'],
    'rating': [5, 4, 3, 5, 4, 5, 2, 4, 5, 4, 5]
})

# Pivot to create user-item matrix
user_item_matrix = ratings.pivot(index='user', columns='item', values='rating').fillna(0)

# Calculate user similarity
user_similarity = cosine_similarity(user_item_matrix)
user_similarity_df = pd.DataFrame(user_similarity, 
                                   index=user_item_matrix.index,
                                   columns=user_item_matrix.index)

def get_user_recommendations(user_id, n=3):
    # Get similar users
    similar_users = user_similarity_df[user_id].sort_values(ascending=False)[1:4]
    
    # Get items the user hasn't rated
    user_items = user_item_matrix.loc[user_id]
    unrated_items = user_items[user_items == 0].index
    
    # Score items based on similar users' ratings
    item_scores = {}
    for item in unrated_items:
        score = 0
        for similar_user, similarity in similar_users.items():
            score += user_item_matrix.loc[similar_user, item] * similarity
        item_scores[item] = score
    
    # Return top items
    recommendations = sorted(item_scores.items(), key=lambda x: x[1], reverse=True)[:n]
    return [item for item, score in recommendations]

print(get_user_recommendations(1))

Item-based collaborative filtering flips this approach: instead of finding similar users, find similar items based on which users liked them. Items that are frequently liked by the same users are considered similar. This approach often performs better because item similarities tend to be more stable than user similarities—a movie’s characteristics don’t change, but a user’s preferences might evolve.

The beauty of item-based collaborative filtering is that you can precompute item similarities once and reuse them for all recommendation requests. This makes it highly efficient for production systems. User-based filtering requires recalculating similarities for each user, while item-based filtering does heavy computation once during a nightly batch process, then uses simple lookups during user requests.

Handling Cold Start and Sparse Data

The biggest challenge with minimal-code recommendation systems is handling cold start problems—new users with no history and new items with no interactions. Sophisticated systems use complex strategies, but simple approaches can be surprisingly effective with minimal additional code.

For new users, start with popularity-based recommendations. Show them the most popular or highest-rated items overall. This provides immediate value while you collect their preferences. After just a few interactions, you can switch to personalized recommendations. Implementation requires just counting interaction frequencies or averaging ratings across items.

For new items, content-based features provide immediate recommendations even with zero interaction history. If a new movie enters your system, you can recommend it to users who liked similar movies based on genre, cast, and description—no user ratings required. This is why hybrid approaches handle cold start better than pure collaborative filtering.

Sparse data—when most users have interacted with very few items—degrades recommendation quality. Simple strategies help: set minimum thresholds for considering users or items (ignore users with fewer than 5 ratings), use dimensionality reduction techniques like SVD to find latent factors that explain sparse data, or incorporate content features to supplement sparse interaction data.

For minimal code implementations, the pragmatic approach is setting reasonable expectations. A recommendation system with 100 users and 50 items won’t perform miracles. But it will provide value by surfacing relevant content users might otherwise miss. As you collect more data, the same simple algorithms naturally improve in quality without code changes.

Matrix Factorization for Better Results

If you want to step beyond basic similarity approaches while maintaining minimal code, matrix factorization provides significant quality improvements with only modest code additions. The idea is decomposing your sparse user-item matrix into two smaller matrices that capture latent factors—hidden patterns explaining why users like certain items.

The surprise library makes implementing matrix factorization almost trivially simple. It provides SVD (Singular Value Decomposition) and other factorization algorithms that handle sparse data well and require minimal configuration. You load your ratings data, train a model in one line, and generate predictions in another line:

from surprise import SVD, Dataset, Reader
from surprise.model_selection import train_test_split
import pandas as pd

# Prepare data
ratings = pd.DataFrame({
    'user': [1, 1, 1, 2, 2, 2, 3, 3, 3],
    'item': ['A', 'B', 'C', 'A', 'B', 'D', 'A', 'C', 'D'],
    'rating': [5, 4, 3, 5, 4, 5, 2, 4, 5]
})

# Configure and load
reader = Reader(rating_scale=(1, 5))
data = Dataset.load_from_df(ratings[['user', 'item', 'rating']], reader)

# Train model
trainset = data.build_full_trainset()
model = SVD()
model.fit(trainset)

# Get predictions for a user
user_id = 1
all_items = ratings['item'].unique()
user_items = ratings[ratings['user'] == user_id]['item'].values

# Predict ratings for items user hasn't seen
predictions = []
for item in all_items:
    if item not in user_items:
        pred = model.predict(user_id, item)
        predictions.append((item, pred.est))

# Sort and return top recommendations
recommendations = sorted(predictions, key=lambda x: x[1], reverse=True)[:3]
print([item for item, score in recommendations])

from surprise import SVD, Dataset, Reader
from surprise.model_selection import train_test_split
import pandas as pd

# Prepare data
ratings = pd.DataFrame({
    'user': [1, 1, 1, 2, 2, 2, 3, 3, 3],
    'item': ['A', 'B', 'C', 'A', 'B', 'D', 'A', 'C', 'D'],
    'rating': [5, 4, 3, 5, 4, 5, 2, 4, 5]
})

# Configure and load
reader = Reader(rating_scale=(1, 5))
data = Dataset.load_from_df(ratings[['user', 'item', 'rating']], reader)

# Train model
trainset = data.build_full_trainset()
model = SVD()
model.fit(trainset)

# Get predictions for a user
user_id = 1
all_items = ratings['item'].unique()
user_items = ratings[ratings['user'] == user_id]['item'].values

# Predict ratings for items user hasn't seen
predictions = []
for item in all_items:
    if item not in user_items:
        pred = model.predict(user_id, item)
        predictions.append((item, pred.est))

# Sort and return top recommendations
recommendations = sorted(predictions, key=lambda x: x[1], reverse=True)[:3]
print([item for item, score in recommendations])

Matrix factorization discovers that movies cluster into latent factors—perhaps one factor represents action-adventure preference, another represents preference for complex plots, another represents tolerance for slower pacing. Users also get scores on these factors. Recommendations come from matching users to items with similar factor patterns.

The advantage over simple similarity is that matrix factorization captures complex patterns that don’t reduce to simple “users who liked X also liked Y” rules. It handles sparse data better because it infers missing ratings based on learned patterns. The training process takes longer than computing similarities, but once trained, generating recommendations is very fast.

Implementation Checklist

✓ Choose Your Approach

Content-based if you have item features; collaborative if you have interaction data

✓ Prepare Your Data

Clean missing values, normalize ratings, create user-item matrix or feature vectors

✓ Compute Similarities

Use cosine similarity for content or users/items; precompute for efficiency

✓ Handle Edge Cases

Fall back to popularity for new users; use content features for new items

✓ Evaluate Quality

Track click-through rates, ratings on recommendations, diversity of suggestions

Making Your System Production-Ready

A minimal-code recommendation system can absolutely run in production with just a few practical considerations. The code examples above work fine for hundreds or even thousands of users and items, but you need to think about how to integrate them into an actual application and how to keep recommendations fresh as data changes.

Precomputation is your friend for production systems. Computing similarities on-demand for every recommendation request wastes resources. Instead, run a nightly job that computes all necessary similarities, saves them to a database or cache, and serves recommendations from these precomputed values. This makes recommendation requests fast regardless of how complex your similarity calculations are.

Caching recommendations improves performance dramatically. For most users, recommendation lists don’t need to update in real-time—if someone watches a movie today, showing updated recommendations tomorrow is perfectly acceptable. Cache recommendation lists per user for 6-24 hours. This reduces computation load while maintaining freshness. Only invalidate caches when users explicitly interact with items.

Data updates need attention in production. As users rate items, as new items are added, and as popularity shifts, your recommendations should evolve. The pragmatic approach is batch updates: rebuild similarity matrices nightly or weekly depending on your data volume and rate of change. Real-time systems are possible but add significant complexity—save that for when simple batch updates prove insufficient.

Monitoring recommendation quality in production is crucial but simple. Track basic metrics: what percentage of users click on recommendations, how do recommendation ratings compare to overall ratings, are recommendations too similar or appropriately diverse. These metrics reveal whether your system provides value without requiring sophisticated machine learning operations infrastructure.

The beauty of starting with minimal code is that you can iterate based on real usage. Deploy a simple content-based recommender, see how users respond, gather data, then enhance with collaborative filtering. This incremental approach avoids over-engineering while ensuring every addition addresses actual needs rather than theoretical improvements.

Conclusion

Building a recommendation system with minimal code is entirely feasible and often sufficient for small to medium-scale applications. By leveraging libraries like scikit-learn for similarity calculations, surprise for matrix factorization, and pandas for data manipulation, you can implement functional recommenders in under 100 lines that deliver genuine value. The key is understanding fundamental approaches—content-based, collaborative, and hybrid—and choosing based on your available data rather than chasing algorithmic sophistication.

The minimal-code approach is not just about quick implementation but about pragmatic engineering. Start simple, deploy quickly, learn from real usage, and iterate based on actual needs. A simple recommender providing modest value today beats a perfect system that never ships. As your data grows and requirements evolve, the same foundational code can be enhanced incrementally without architectural rewrites. Focus on solving your users’ actual problems rather than building impressive but unnecessary complexity.