• Graduate program
  • Research
  • Summer School
  • Events
    • Summer School
      • Applied Public Policy Evaluation
      • Deep Learning
      • Economics of Blockchain and Digital Currencies
      • Economics of Climate Change
      • Foundations of Machine Learning with Applications in Python
      • From Preference to Choice: The Economic Theory of Decision-Making
      • Gender in Society
      • Machine Learning for Business
      • Sustainable Finance
      • Tuition Fees and Payment
      • Business Data Science Summer School Program
    • Events Calendar
    • Events Archive
    • Tinbergen Institute Lectures
    • 16th Tinbergen Institute Annual Conference
    • Annual Tinbergen Institute Conference
  • News
  • Alumni
  • Magazine
Home | Events Archive | A Contextual Bandit Algorithm for Linear Mixed Effects Models
Research Master Pre-Defense

A Contextual Bandit Algorithm for Linear Mixed Effects Models


  • Series
    Research Master Defense
  • Speaker
    Hong Deng
  • Location
    Online
  • Date and time

    August 28, 2020
    15:00 - 16:00

The thesis generalizes the linear contextual bandit problems for potentially individual-clustered data. Upper confidence bound-typed bandit algorithms are widely used for contextually dependent decisions, such as customized recommender systems; however, the correlations of observations within individuals are rarely discussed in prior work. To allow for the presence of individual heterogeneity, linear mixed effects models are imposed for the reward generation, and a learning algorithm taking into account individual heterogeneity, called LIME-UCB, is proposed. The algorithm constructs the confidence interval by combing information across and within individuals, and achieves efficient learning for data with high level of individual heterogeneity.