On-demand webinar
The Hotel NERSC Data Collect: Where Data Checks In, But Never Checks Out
Hosted by:
Thomas Davis
Cary Whitney
Overview
The NERSC data collect system is designed to provide access to 30TB of logs and time-series data generated by the supercomputers at Berkeley Lab. This talk will cover the life of an index inside the cluster, from initial tagging, node routing, snapshot/restore, use of aliases to combine indexes, and archiving on high disk capacity nodes using generic hardware. Additionally, Thomas and Cary will highlight several aspects of using Elasticsearch as a large, long term data storage engine, including index allocation tagging, use of index aliases, Curator and scripts to generate snapshots, long term archiving of these snapshots, and restoration.
![Video thumbnail](https://play.vidyard.com/Y21CNQeCB1Atg62fouUnNv.jpg)
View next
![](/static-res/images/video-thumbnail-elastic-logo-monitor.png)
![](https://play.vidyard.com/i77AyvNf78wwtNSzmn98gk.jpg)
![](/static-res/images/video-thumbnail-elastic-logo-monitor.png)
![](https://play.vidyard.com/7JSnbxHEL4sBC44rSFFK5n.jpg)
![](https://play.vidyard.com/d32UNVVq4UV9dMsZ9qbmhA.jpg)