Welcome to the CERN Open Data discussion forum!
The CERN Open Data portal manages several petabytes of open data from particle physics.
The data are released by LHC collaborations in periodic batches after a certain embargo period. The portal contains raw data samples, experimental collision datasets and simulated datasets suitable for physics research use cases (in ROOT format), dedicated data samples for designated communities such as Machine Learning (in H5 format), up to simplified derived data formats and event display files suitable for education use cases (in CSV and JSON formats).
The data comes accompanied with detailed provenance information, configuration files and associated documentation. The usage of data is demonstrated via several analysis examples that are to be run with provided Virtual Machines images and Docker containers or in Jupyter notebooks.