by khebab » Thu 12 Jan 2006, 02:01:33
Motivation:
There is a lot of debate about why a sum of individual oil field productions should produce a symmetric curve. Intuitively, we feel that there is some degree of connection with the
Central Limit Theorem:
$this->bbcode_second_pass_quote('', 'a')ny sum of many independent identically distributed random variables will tend to be distributed according to a particular "attractor distribution". The most important and famous result is called simply The Central Limit Theorem which states that if the sum of the variables has a finite variance, then it will be approximately normally distributed
However, individual oil field productions are not random variables and are rather deterministic therefore the Central Limit Theorem cannot be invoked directly. The objectives are the following:
1- look at the conditions required to converge toward a particular "attractor distribution"
2- test the validity of the logisitic and gaussian distributions
Methodology:I made the following assumptions:
- production profiles of individual fields follow a triangular distribution function
- the URR, the left side slope (angle beta) , the right side slope (angle gamma) and the time of production start are random variables
three relevant properties for an oil field are the surface of the triangle (URR) and the slopes on the left and right side of the peak (beta and gamma angles) which represents the production growth an decline rates respectively.
[align=center]

[/align]
I use basic properties of triangles to derive a, b and c from beta, gamma and the URR: The law of sines and cosines:
[align=center]


[/align]
By manipulating this two equations we get:
$this->bbcode_second_pass_code('', 'K1= sin(beta)/sin(gamma)
K2= 2*URR/sin(beta)
A=1-K1^2
B= 2*K1*K2*cos(gamma)
C= -K2^2
Delta= B^2 - 4.0*A*C
c= sqrt((-B+sqrt(Delta))/(2*A))
a= K2 / c
b= c * K1')
Consequently, the oil field production profile is the following:
$this->bbcode_second_pass_code('', 'f(x|alpha,beta,t,URR)= (x-t)/(a*c*cos(beta)), if x >= t and x <= t + c*cos(beta)
f(x|alpha,beta,t,URR)= (t + a - x)/(a*(a-c*cos(beta))), if x > t + c*cos(beta) and x <= t + a')
For now, I assume uniform distributions for the different random variables:
- alpha and beta are distributed within an angle domain such as the resulting slopes are between 2% and 15%,
- the URR is uniformly distributed between 5 and 20,
- the starting year t between 1 and 21
Results:Erratum: for the subplot at the bottom left corner of each figure, the y axis label should be "Prod. / Cum. Prod." instead of "Prod."[align=center]

Result with 10 oil fields
[/align]
[align=center]

Result with 100 oil fields
[/align]
[align=center]

Result with 1,000 oil fields[/align]
[align=center]

Result with 10,000 oil fields[/align]
Discussion:1- the total production seems not to converge completely toward a gaussian or a logistic distribution in particular on the tails. However, these two models seem valid
2- the resulting curve is slightly asymmetric (skewness= 0.43) and a gamma function is maybe more appropriate
3- this is a simple model and there are many improvements possible in particular on the probability distribution function of the different variables
Edit (01/12/2006): corrections of a few english mistakes! sorry about that, english is not my first language (especially late at night).