This is another article in our "Solar Energy Data Analysis"
series that analyzes data available from Open PV project website
. The full dataset is huge so we decided to play with a subset of data coming from California. If you're not yet familiar with the Open PV project it's a collaborative initiative between government, industry and public sector that compiles PV installation data and makes it publicly available. The data used in this analysis is available by this link
. It contains data about solar PV system installs in California from July 1998 to January 2016. The csv dataset contains many columns but for the sake of keeping the task more manageable we've only used cost_per_watt which is the total cost of the installation divided by the number of Watts the system is capable of producing. The analysis is done in Jupyter notebook using ipython so python code used to make the graph with plot.ly is also included.