Statistical model for the distribution of field size:
In order to have a distribution of the oil field sizes, I checked the validity of lognormal model:
Figure 9
The blue points are Simmons data for the 6 oil field categories he defined (see Figure 8 ). The estimated sigma value is 4.2 when oil production is expressed in thousands of barrels per day. The fit is rather good.
Statistical model for the genration of random discovery data:
As discussed before, I'm missing a pdf for the small field discoveries. So I took the pdf of next category (Cat. 5) and rescaled it by the expected total numbers of discovery for small fields (4,000 up to the year 2000).
The algorithm to generate random discovery data is the following (this process is repeated N times):
$this->bbcode_second_pass_code('', '
N= total number of oil fields
D= random variable associated to the decade of the discovery (we cosnider 8 decades from the 40s to 2010s).
Category= random variable associated with the field size (from supergiants (Cat. 1) to small fields (Cat. 6)).
P= random variable associated with the mean daily production output of an oil field (in thousands of barrels per day).
1) Randomly choose a decade d according to the following distribution:
prob(D) = [0.0623 0.0970 0.1577 0.1572 0.1317 0.1314 0.1314 0.1314]
2) According to the value of D=d, randomly choose a field category:
prob(Category | D= d) =[
0.0057 0.0018 0 0.0011 0 0 0 0
0.0057 0.0055 0.0034 0.0011 0.0014 0 0 0
0.0086 0.0018 0.0068 0.0011 0.0014 0 0 0
0.0230 0.0074 0.0068 0.0103 0.0014 0.0014 0.0014 0.0014
0.0144 0.0148 0.0148 0.0148 0.0150 0.0150 0.0150 0.0150
0.9425 0.9686 0.9682 0.9715 0.9810 0.9836 0.9836 0.9836]
2) According to the value of D=d and Category= c, randomly choose a field production output:
prob(P= p | D= d, Category= c)= ModifiedLognormal(p|sigma)
3) Derive an URR from the value of p:
beta ~ Unform([50, 110[)
URR= beta x p / (365x1e6)
')
Remarks:- The first decade (40s) is in fact covering the period 1900-1940.
- The last step (3) has to be more justified because I don't know how to relates a field average production output and its URR.
The function called ModifiedLognormal() is a resampling of the original learn lognormal distribution on Figure 1 (a technique called importance sampling I believe). This technique produces a modified lognormal distribution based on the different pattern of discoveries observed for each decades.
$this->bbcode_second_pass_code('', '
According to the value c of the realization of the random variable Category | D (ranging from 1 to 6) we do the following:
ModifiedLognormal:
1) beta ~ Uniform([minp(c) maxp(c)[)
2) We find the production p such as:
LogNormalCDF(p) >= beta
[minp(c) maxp(c)[ is the production interval defined for the category c. For instance, if c= 2 we have [minp(c) maxp(c)[= [500 1,000[ tbpd.
')
Results:
Here are the results from a small simulation from 40 runs with N= 5,500 fields.
Figure 10
Figure 11
Figure 12
We can see that small fields have a large contribution The average peak position is:
2013.95 +/- 2 years at 30.55 +/ 1.0 Gb
URR= 2.664 +/- 0.11 Tb
Figure 13
Conclusions:
1. This an optimistic scenario because I repeated the discovery pattern from the 90s to the following decades (2000s and 2010s). Clearly, it's not obvious that we will have the same pattern.
2. A quick simulation seems to replicate expected proportions between the different field categories (supergiants and small fields produced 20% and 53% respectively of the world total production in 2000). There is a peak for small fields production only in 2030 as well as for Cat. 5 fields. Maybe this kind of fields are being overlooked.
3. the statistical relationship between a field mean production and its URR is not trivial.
4. The choice of the number N of oil fields influence greatly the result.
5. The statistical model to simulate worldwide discoveries (Figure 10) seems to replicate well the data from Simmons (Figure 8 ).