Clusters of Quakes and Wells
I decided to look at the spatial correlations of clusters of quake and well data in Oklahoma after the previous blogs showing a strong time-series correlation. The main conclusion of this analysis is that every major cluster of earthquakes is associated with a cluster of injection wells.
Data cleaning
I took the same data sets as in the previous blogs. I updated the cleaning of well data to include only wells with a MaxPress listed above 1000 (presumably psi) and those listed as "active". The result is a list of 3200 wells active high pressure wells.
Establishing Clusters
A key element of cluster analysis is the choice of the distance function. I played around with various functions and ended up using the geographical distance squared. If you imagine that planar diffusion spreads as a circle this makes sense.
To speed up calculation I wrote my own Mathematica function for the geographic distance between points
mapsqDistance[x_List, y_List] :=
Block[{dist},
dist = (0.663*(x[[1]] - y[[1]])^2 + (x[[2]] - y[[2]])^2)]
It accounts for the latitude of Oklahoma (about 35.5 degrees). Results were accurate to 0.25% between individual points about 70 miles apart.
Mapping the results
I experimented with different numbers of clusters of wells and quakes and generally found about 7-8 was the right range. Fewer and ajor groups were missed. Many more and there was a lot of overlap between clusters.
A result of a Well cluster run is shown below for NCluster = 7. I did several of these with different random orders of sequence of data points. The clusters were very stable with only minor differences between runs.
![]() |
| Clusters of Injection Wells in Oklahoma |
The locations of individual wells as shown, with points color coded for the cluster. The large circles are centered at the median of the cluster data points and the radius of the circle is representative of the number of points in the cluster and their distance apart.
Radius = 0.004*Sqrt[(WCluster[[i]] // Length)/Norm[StandardDeviation[WCluster[[i]]]]]]
The square root ensures the area of the circle is proportional to the number of points. A large standard deviation is indicated by a smaller circle (less tightly clustered).
Here are the clusters for the earthquakes with NCluster = 8. The tight clusters of earthquakes in the Tulsa area are evident as are less concentrated clusters in the South and Southeast of the State (as revealed by the small circles).
![]() |
| Clusters of Earthquakes in Oklahoma |
While the graphs are visually appealing the standard deviation is confounded with the number of points in the cluster. A cluster with a large number of points spread over a large area would be hard to discern from a small tight cluster.
Going 3D
Making the circle into cylinders solves the problem of confounding. In the plot below the radius of the cylinder is proportional to the square root of the standard deviation (so the area of the cylinder is related to the spatial distribution of points). And I have made the volume of the cylinder equal to the number of points in the cluster (this means the height is equal to the number of elements divided by the standard deviation).
This analysis shows quite plainly the correlation between wells and earthquakes. While the overlaps are not perfect, one can make an intuitive connection between well clusters and earthquake clusters in an almost 1:1 fashion. Every major cluster of earthquakes is closely associated with a cluster of wells.
The converse isn't true; the well cluster in the panhandle has no earthquake associated with it. However, I discovered quite by accident (forgot to turn on the spatial filtering in the earthquake data cleaning at one point) there is an earthquake clusters in the Texas panhandle just to the south of it. I don't have time this morning but I'll post that analysis later on.
It's interesting that the overall numbers of wells don't correlate well to the number of earthquakes. This might be related to further details (like magnitude of the quake or injection volume) that I haven't yet explored.








