I am wondering if there is anywhere to seek help with a published dataset. The dataset which is listed here (CERN Open Data Portal) shows that there should only be certain values in column 10, but there are tons of files which have values outside of the given range…
For one concrete example, in the following filename there values outside of (-99, -11, 11, 0) which are listed in the above link as the possible values in that column. There are many such files which prevents any model from learning correctly. I think the model trained in the paper on the above linked page must have used a correct version and there were probably errors somewhere when it was uploaded to the portal.
Is there anyway I can get help on this dataset?