As of 2023, the HPWREN website (http://hpwren.ucsd.edu) was the locally
maintained culmination of many years' efforts by a very small support
staff. This article presents an overview of its transition to a more
automated AWS based cloud service (which today is in its initial
prototype form and remains a work in progress).
Key take-aways from this article include:
Legacy HPWREN image workflow and website system challenges
Limited operational improvements over time to the legacy system with attempts to
Anticipated AWS longer term benefits
This article describes the recent (and ongoing) transition of the HPWREN camera image and website data management infrastructure from traditional in house servers and processes to AWS cloud based services. It provides an overview of previous legacy data flows, the recent (Phase I) transition to AWS, and anticipated future (Phase II) AWS enhancements. In short, it summarizes how we got here, where we are currently, and where we are going. See Appendices for diagrams illustrating legacy as well as emerging AWS based workflows.
In support of its Public Safety and Research and Education missions, HPWREN currently collects camera images from many hundreds of cameras throughout Southern California at a rate of (at least) 1 image per minute, and has done so since around 2001 starting with a handful of cameras in San Diego. We also build, publish and store videos from the newest images every three hours. Access to this imagery is available through https://hpwren.ucsd.edu/cameras
For over two decades, images have been fetched, processed and stored by HPWREN in-house servers and provided to the public using our website. The system was established and maintained by a small handful of people, and remains today similarly supported by a sparse staff. Given the growth to many hundreds of cameras across many remote Southern California sites, staff support is stretched to its limit. Additionally, the increases in cameras over time have led to the need for multiple independent processes to collect and process camera images and a multitude of servers, with ever increasing storage expansion requirements, for their housing, publication and archival. Until recently, the effort to integrate a new camera into our workflow and [re]build the multitude of website files required the manual editing of dozens of files and scripts across many servers. In 2023 we built a set of comma separated text files and control scripts to help automate the creation of many of those files.
In short, at the start of our third decade of operations, we were maintaining dozens of legacy servers (and VMs) with the prospect of needing at least a half dozen or more for further required storage expansion. By this time we had reached the limits of storage expansion or our existing servers, had started backing up storage to shelved disks, had initiated manual cloud backups of older data, had begun efforts to decimate past image data to reduce storage requirements, and were on the verge of adopting a self managed distributed storage system in the form of a CEPH cluster (the cluster itself depending on half a dozen or so new servers). The feasibility of establishing and managing the CEPH cluster, along with all the rest of our legacy systems, came into question.
For the previous few years, HPWREN had been investigating cloud based alternatives, and in particular the costs and benefits of AWS S3 based storage. S3, EdgeFS and CEPH storage experiments had also taken place in conjunction with the our participation in the Pacific Research Platform (https://nationalresearchplatform.org/media/pacific-research-platform-video). We also used AlertCEPH (a storage system maintained by https://alertcalifornia.org/) in early 2022 and 2023. In 2023, at the invitation of our UCSD campus partner, Qualcomm Institute (https://qi.ucsd.edu), HPWREN participated in an AWS proof of concept, working with Xpertec Solutions, which ultimately led to the current Phase I AWS project. This Phase I project recently transitioned to production on 4/1/24 with the new websire at https://www.hpwren.ucsd.edu .
The somewhat oversimplified, but general HPWREN camera workflow, prior to AWS adoption, can be summarized as follows:
To add a new
site's cameras to the system, prior to AWS adoption, the following
actions took place (these actions are applied to up to 12 camera imagers
for each new site)*
*Some of the initial steps above were automated in 2023, driven by newly created scripts referencing updated csv and config files which maintained all needed camera information and status
The initial goals (Phase I) of the AWS proof of concept project were one or more of the following
A collection of AWS based tools were deployed initially to replicate the
HPWREN website and camera image workflows for developing a new AWS
hosted website and archival data access mechanism. These goals have
generally been accomplished (by the release to production on 4/1/24),
though various implementation idiosyncrasies identified as HPWREN staff
are now using those tools, are still being worked through.
At the completion of Phase I, the general process for website additions
included:
To make general website changes:
Around 2015
Around 2020
Around 2022
Overall 2024 architecture
Updating the Website