Size Recommendations

Zalando

A Hierarchical Bayesian Model for Size Recommendation in Fashion

  • Represent articles as a combination of brand, usage, size, and fit
  • A neural network is then trained to learn a latent vector describing each article defined as the combination of features mentioned before
  • Customer vector representation is obtained by aggregating over purchased articles
  • Finally, a gradient boosted classifier predicts the fit of an article to a customer: it learns the joint probability of a customer purchasing a given product size and the resulting return status being either too small, too big or no return

Size recommendations

Asos Approach

Neural Collaborative Filtering approach

Collaborative filtering uses customer-product interactions and is based on the assumption that customers buying similar products have similar tastes. This principle naturally translates into the size and fit domain as "customers with similar body shapes tend to buy clothes in similar sizes".

  • learn a latent space at a product size level instead of at a product level i.e. we have a different vector for every possible size of a product
  • asymmetric framework so that users are not represented explicitly, but as the aggregate of the product vectors with which they have interacted
  • train different models for each product category (tops, bottoms or shoes), so all trained embeddings belong to the same category and the learned latent space represents the same body part

Size recommendations

Asos Approach

Transfer from Brand to Product

Model of product-size combinations instead of just products => product-size interaction matrix ~10 times sparser than for product recommendations

  • Assume that each brand has consistent sizes
  • Learn latent representations Vbs for every combination of brand b and size s
  • Transfer this to a product level: Vps = Vbs , ∀ps ∈ bs

Impacts

  • Improves generalisation
  • Boosts performance
  • Leads to faster convergence

Multiple Personas detection

Goal: Reduce noise in recommendations

State of the Art

  • Explicitly use different user profiles (Netflix)
  • Use of empirically determined thresholds on the range of purchased sizes
  • Statistical models
  • Filter out users where the mean and std deviation of the purchased sizes exceeds a category level threshold
  • Hierarchical clustering method where clusters are iteratively merged as long as the std deviation does not exceed an empirical threshold
  • Persona distribution learned from a dirichlet distribution
  • Gaussian kernel density estimation approach refined to a gaussian mixture model

Multiple Personas detection

Asos approach

  • Gaussian Mixture Model
  • Assumes that one user is centred around one core size
  • Each persona is represented by a Gaussian distribution, whose mean µ corresponds to the persona’s core size

Multiple Personas detection

Asos approach

Process

  • Run the GMM to detect λ personas within Hu

    • The parameter λ (number of personas) is iteratively increased as long as
      • sλ is higher than sλ-1
      • The core size of each mixture component differs by at least 1 size unit
  • Calculating the silhouette score sλ associated with that mixture

When the iterative process is finished, if λ > 1 that customer is identified as buying for multiple personas.

Accounts with multiple personas are removed from training and testing sets for size recommendations.

Multiple Personas detection

Asos approach

UK3, UK3, UK3.5, UK4, UK4                         ➞ 1 persona

{UK2, UK2}, {UK5, UK5, UK6}                       ➞ 2 personas

{UK2, UK3, UK3, UK3, UK4, UK4}, {UK6, UK6}, {UK9} ➞ 3 personas

UK2, UK3, UK4, UK5, UK6, UK6, UK7, UK8, UK9       ➞ reseller