Using R, I am trying to extract data from a raster layer using a polygon layer. The polygons are much smaller than the raster cells:
Now I call extract()
from raster
library:
a <- extract(raster, polygons, weights = TRUE, small = TRUE)
a
# ...
# [[1551]]
# value weight
# 209 0.03 # top left cell - more than 50% of the polygon area
There are two problems - the weight is the proportion of the cell area covered by the polygon, and the weights are rounded to 1/100. In my case, only the top left cell is present in the output (value 209) - the weight of 3 other cells was rounded to zero and they were excluded. However, the bottom left cell covers significant proportion of the polygon and should be included also!
I need a proper weighted mean. Can this be done somehow else using extract()
? Or any other way?
PS: note aside: I think the weights in extract()
are not designed very well - the weight should be the proportion of polygon area covered by the particular cell, not vice versa. Then, the weighted mean for the polygon would be also easier to compute (just multiply the two numbers in each row and sum up), and rounding to 1/100 wouldn't be a big problem.
Reproducible example - (download the files - simplified version, actual data are much bigger):
require(raster)
rast <- raster("my.tif")
poly <- readOGR(".", "socc_buff_Rx")
a <- extract(rast, poly, weights = TRUE, small = TRUE)
a
extract
, 4) increase the raster resolution by further splitting the cells (your idea). So it depends which solution is in fact easiest and also computationally more efficient. And I don't know yet how to implement these solutions... – Tomas