R: Implementing Elo ratings for team game; assigning values to multiple variables from within a loop

Question

I have data that looks like this:

  a1   a2   a3   a4   a5   h1   h2   h3   h4   h5 a.evt.score   h.evt.score
3311 4003 2737 3784 4177 2632  726  633  438 5444           0             1
1696  371 4471 2119  274 1947 5745 3622  438 5444           1             0           
1696  371 4471 1199 2230 1947 5745 3622 5034 4166           1             0 
3191 4471 2737  274 2230 3598  633 5034 5444 3485           1             0
3191 3685 3486 3784 4177 2632  726  633  438 5444           0             1 
127  713 1609 5444 4166 3311  371 4471 1199 2230           1             0
127  713 1609 2345 3485 1696 4003 2737 1199 2230           1             0
127  713 1609 2345 3485 1696 4003 2737 1199 2230           1             0
1947 5745 3622  438 5444 3311  371 4471 3784 4177           1             0
2632  726  633 5444 4166 3191 3685 3486  274 2230           0             1
2632  726  633  438 5444 3191 3685 3486 3784 4177           0             1
5745 3598 5198 4166 3485 1696 4003 2737  274 2230           0             1
2632  726  633 2345 5034 3311  371 4471 3784 4177           1             0
127 3859  726  438 5444 1696 4003 2737 2119  274           1             0
2632  713  633 5034 4166 3191 3685 3486 3784 4177           1             0

The numbers in the a1, a2, a3..., h4, h5 columns are unique ids of players. (a1, ... , a5) play on the "away" team, and (h1, ..., h5) are their opponents.

Each row is an event in the game.

"a.evt.score" indicates whether or not the away team "won" the event.

I would like to, for each player, calculate his Elo rating after every event (row) in the data.

The formula used to calculate a players' rating is:

R_new = R_old + k*(Score - Expected)

Where "Score" is 1 if the team wins the event, and 0 if not.

Let k be 30 (tells how much each event influences the overall rating).

And have every player start with an R_old of 2200.

"Expected", I calculate with the formula (say we are looking at player 1 on the away team):

h.R <- c(h1.R, h2.R, h3.R, h4.R, h5.R)
a1.E <- sum(1/(1+10^((h.R - a1.R)/400)))/5

So, a1's new rating would be:

a1.R <- a1.R + 30*(a.evt.score - a1.E)

I would like my end result to be a vector, for every player, of their history of Elo ratings.

So, for every row in the data, I would like to:

Get the most recent Elo for every player involved. Set this to R_old.
For each player, calculate a new Elo based on the result of the event.
Append this new rating (R_new) to the start of each players' history vector.

The issue I'm running into is that I can't figure out how to pull a value (R_old) from a named variable (a given player's Elo history vector) when I'm inside a loop/apply function, or how to append the calculated rating to the variable.

How can I go about doing the above?

second last row in the example has both a.evt.score and h.evt.score as 1. How do I interpret that? — Ricky
Also I presume you will need a starting rating for this to be meaningful? i.e. the very first R_old for all players? Or do we just assume everyone start at 0 rating (in which case you'll see everyone in the team having the same ratings after round 1)? It would help if you provide a sample initial R_old vector giving vectors for all unique ids in the table. — Ricky
Thanks for catching that Ricky. And every skater starts with a 2200 rating. — Colin

Tensibai Tensibai · Accepted Answer · 2015-12-16T16:01:28

My best bet, there's probably room for improvement.

The main idea is to build a list of players, with one entry by player id to store the player score history.

The new score calculation is done in a separate function, maybe I didn't get exactly what you're wishing to do. I hope I commented enough to explain what's going on.

k<-30
ateam<-paste0("a",1:5)
hteam<-paste0("h",1:5)
playersid <- unique(unname( unlist( datas[, c(ateam,hteam) ] ) ))
scores=as.list(rep(2200,length(playersid)))
names(scores)<-playersid

getPlayerScore <- function(player,team_score,opponents_scores) {
  old_score <- scores[[as.character(player)]][1]
  expect <- sum(1/10^((opponents_scores - old_score)/400))/5
  return(old_score + k*(team_score - expect))
}

updateTeamPlayersScore<-function(row,team) {
  opteam<-ifelse(team=="a","h","a") # get the team we're against
  players <- unlist(row[get(paste0(team,"team"))]) # get the players list
  opponents <- unlist(row[get(paste0(opteam,"team"))]) # get the oppenents list
  # Get the oppents scores 
  opponents_score <- sapply(scores[as.character(opponents)],function(x) { x[[1]] } ) 
  # loop over the players and return the list of updated scores
  r<-lapply(players,function(x) {
    new_score <- getPlayerScore(x,as.numeric(row[paste0(team,".evt.score")]),opponents_score)
    c(new_score,scores[[as.character(x)]])
  })
  # Update the list names
  names(r) <- as.character(opponents)
  r # return the new scores list
}

# loop over the rows.
# The update is done after calculation to avoid side-effect on h scores with updated a scores
for (i in 1:nrow(datas)) {
  row <- datas[i,]
  # Get updated scores for team a
  new_a <- updateTeamPlayersScore(row,"a")
  # Get updated scores for team h
  new_h <- updateTeamPlayersScore(row,"h")
  # update team 'a' scores
  scores[names(new_a)] <- new_a
  # update team 'h' scores
  scores[names(new_h)] <- new_h
}

Result

> head(scores)
$`3311`
[1] 2124.757 2119.203 2111.189 2136.164 2165.133 2200.000

$`1696`
[1] 2135.691 2135.032 2170.030 2168.635 2200.000 2200.000

$`3191`
[1] 2142.342 2141.330 2176.560 2174.560 2170.000 2200.000

$`127`
[1] 2098.406 2123.018 2158.292 2193.603 2200.000

$`1947`
[1] 2158.292 2193.603 2200.000

$`2632`
[1] 2100.837 2132.849 2168.509 2173.636 2170.000 2200.000

Data used:

datas<-read.table(text="  a1   a2   a3   a4   a5   h1   h2   h3   h4   h5 a.evt.score   h.evt.score
    3311 4003 2737 3784 4177 2632  726  633  438 5444           0             1
    1696  371 4471 2119  274 1947 5745 3622  438 5444           1             0           
    1696  371 4471 1199 2230 1947 5745 3622 5034 4166           1             0 
    3191 4471 2737  274 2230 3598  633 5034 5444 3485           1             0
    3191 3685 3486 3784 4177 2632  726  633  438 5444           0             1 
    127  713 1609 5444 4166 3311  371 4471 1199 2230           1             0
    127  713 1609 2345 3485 1696 4003 2737 1199 2230           1             0
    127  713 1609 2345 3485 1696 4003 2737 1199 2230           1             0
    1947 5745 3622  438 5444 3311  371 4471 3784 4177           1             0
    2632  726  633 5444 4166 3191 3685 3486  274 2230           0             1
    2632  726  633  438 5444 3191 3685 3486 3784 4177           0             1
    5745 3598 5198 4166 3485 1696 4003 2737  274 2230           0             1
    2632  726  633 2345 5034 3311  371 4471 3784 4177           1             0
    127 3859  726  438 5444 1696 4003 2737 2119  274           1             0
    2632  713  633 5034 4166 3191 3685 3486 3784 4177           1             0",header=T)

R: Implementing Elo ratings for team game; assigning values to multiple variables from within a loop

2 Answers