<?xml version='1.0' encoding='UTF-8'?>
<?xml-stylesheet type="text/xsl" href="/static/xsl/oai2.xsl"?>
<OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd">
  <responseDate>2026-04-04T18:19:18Z</responseDate>
  <request metadataPrefix="oai_datacite" identifier="oai:rodare.hzdr.de:4526" verb="GetRecord">https://rodare.hzdr.de/oai2d</request>
  <GetRecord>
    <record>
      <header>
        <identifier>oai:rodare.hzdr.de:4526</identifier>
        <datestamp>2026-03-13T10:17:14Z</datestamp>
        <setSpec>openaire_data</setSpec>
        <setSpec>user-novo</setSpec>
        <setSpec>user-rodare</setSpec>
      </header>
      <metadata>
        <oai_datacite xmlns="http://schema.datacite.org/oai/oai-1.0/" xsi:schemaLocation="http://schema.datacite.org/oai/oai-1.0/ oai_datacite.xsd">
          <isReferenceQuality>true</isReferenceQuality>
          <schemaVersion>3.1</schemaVersion>
          <datacentreSymbol>HZDR.RODARE</datacentreSymbol>
          <payload>
            <resource xmlns="http://datacite.org/schema/kernel-3" xsi:schemaLocation="http://datacite.org/schema/kernel-3 http://schema.datacite.org/meta/kernel-3/metadata.xsd">
              <identifier identifierType="DOI">10.14278/rodare.4526</identifier>
              <creators>
                <creator>
                  <creatorName>Ratliff, Hunter</creatorName>
                  <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0003-3761-5415</nameIdentifier>
                  <affiliation>Department of Computer science, Electrical engineering and Mathematical sciences, Western Norway University of Applied Sciences, Inndalsveien 28, Bergen, 5063, Vestland, Norway</affiliation>
                </creator>
                <creator>
                  <creatorName>Blangiardi, Francesco</creatorName>
                  <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0009-0006-7725-6751</nameIdentifier>
                  <affiliation>Technology Methods and Systems Data Based Methods, Fraunhofer ENAS, Technologie Campus 3, Chemnitz, 09126, Saxony, Germany</affiliation>
                </creator>
                <creator>
                  <creatorName>Kögler, Toni</creatorName>
                  <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-9501-0898</nameIdentifier>
                  <affiliation>Helmholtz-Zentrum Dresden - Rossendorf</affiliation>
                </creator>
              </creators>
              <titles>
                <title>PHITS simulations of neutron and gamma-ray production from and transport of 70–250 MeV protons in heterogeneous 1D tissue phantoms</title>
              </titles>
              <publisher>Rodare</publisher>
              <publicationYear>2025</publicationYear>
              <subjects>
                <subject>proton therapy</subject>
                <subject>treatment verification</subject>
                <subject>particle transport calculations</subject>
                <subject>PHITS</subject>
              </subjects>
              <dates>
                <date dateType="Issued">2025-09-24</date>
              </dates>
              <language>en</language>
              <resourceType resourceTypeGeneral="Dataset"/>
              <alternateIdentifiers>
                <alternateIdentifier alternateIdentifierType="url">https://rodare.hzdr.de/record/4526</alternateIdentifier>
              </alternateIdentifiers>
              <relatedIdentifiers>
                <relatedIdentifier relatedIdentifierType="URL" relationType="IsIdenticalTo">https://www.hzdr.de/publications/Publ-43014</relatedIdentifier>
                <relatedIdentifier relatedIdentifierType="DOI" relationType="IsPartOf">10.14278/rodare.3996</relatedIdentifier>
                <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://rodare.hzdr.de/communities/novo</relatedIdentifier>
                <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://rodare.hzdr.de/communities/rodare</relatedIdentifier>
              </relatedIdentifiers>
              <version>1.0.0</version>
              <rightsList>
                <rights rightsURI="https://creativecommons.org/licenses/by/4.0/legalcode">Creative Commons Attribution 4.0 International</rights>
                <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
              </rightsList>
              <descriptions>
                <description descriptionType="Abstract">&lt;p&gt;&lt;strong&gt;PHITS simulations of neutron and gamma-ray production from and transport of 70--250 MeV protons in heterogeneous 1D tissue phantoms&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Hunter N. Ratliff&amp;sup1;, Francesco Blangiardi&amp;sup2;, Toni K&amp;ouml;gler&amp;sup3;˒⁴&lt;/p&gt;

&lt;p&gt;&amp;sup1;Department of Computer science, Electrical engineering and Mathematical sciences, Western Norway University of Applied Sciences, Inndalsveien 28, Bergen, 5063, Vestland, Norway &lt;a href="https://orcid.org/0000-0003-3761-5415"&gt;ORCID&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&amp;sup2;Technology Methods and Systems Data Based Methods, Fraunhofer ENAS, Technologie Campus 3, Chemnitz, 09126, Saxony, Germany &lt;a href="https://orcid.org/0009-0006-7725-6751"&gt;ORCID&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&amp;sup3;Helmholtz-Zentrum Dresden &amp;mdash; Rossendorf, Institute of Radiooncology &amp;mdash; OncoRay, Dresden, Germany; ⁴OncoRay &amp;mdash; National Center for Radiation Research in Oncology, Faculty of Medicine and University Hospital Carl Gustav Carus, Technische Universit&amp;auml;t Dresden, Helmholtz-Zentrum Dresden &amp;mdash; Rossendorf, Dresden, Germany &lt;a href="https://orcid.org/0000-0002-9501-0898"&gt;ORCID&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Introduction&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This dataset corresponds to the PHITS simulation data used in &lt;em&gt;&amp;quot;Fast proton transport and neutron production in proton therapy using Fourier neural operators&amp;quot;&lt;/em&gt; [1]. A concise description of the simulation setup is provided here; please refer to the paper for detailed discussion, description, analysis, and further results derived from this dataset, along with additional references.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Description of simulations&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This dataset consists of PHITS [2] simulations for 47 different proton energies from 70 MeV to 250 MeV incident upon different &amp;quot;1D&amp;quot; heterogeneous cylindrical phantoms (varied materials every 0.5 mm in length, uniform radially and rotationally) whose composition (materials and sequence along length) are taken from randomly sampled rays cast through a 3D CT phantom with CT number mapped to material composition and density via the &lt;code&gt;HumanVoxelTable-KumamotoUniv.data&lt;/code&gt; conversion table within the RT-PHITS utility distributed with PHITS. Included tallies score spatial distributions of energy deposition, LET, proton current (with an additional angular dimension), neutron production, gamma-ray production, and a variety of diagnostic tallies. Event-by-event &amp;quot;list-mode&amp;quot; data is scored for neutron and gamma-ray production, called &amp;quot;dump&amp;quot; tallies in PHITS.&lt;/p&gt;

&lt;p&gt;Given the objective of these simulations was for AI model development, the 47 energies are divided into 37 &lt;strong&gt;training&lt;/strong&gt; energies (70 MeV to 250 MeV in 5 MeV steps) and 10 &lt;strong&gt;testing&lt;/strong&gt; energies (73 MeV to 245.8 MeV in 19.2 MeV steps). For each energy, two simulations were ran: (1) a simulation with &lt;strong&gt;1E8&lt;/strong&gt; (one hundred million) protons simulated where all &lt;strong&gt;tallies&lt;/strong&gt; (including &lt;strong&gt;dump&lt;/strong&gt; tallies) were included/enabled and (2) a simulation with &lt;strong&gt;1E9&lt;/strong&gt; (one billion) protons simulated but with only &lt;strong&gt;dump&lt;/strong&gt; tallies enabled (other tallies disabled to reduce memory consumption and increase simulation speed). Furthermore, all of the above was actually performed twice: (1) initially with purely &lt;strong&gt;monoenergetic&lt;/strong&gt; beam energies and with a spatial spread of 2.5 mm and (2) a second &amp;quot;more realistic&amp;quot; set with &lt;strong&gt;Gaussian-distributed&lt;/strong&gt; energies (with energy-dependent FWHM) and slightly wider 4.0 mm beam spread.&lt;/p&gt;

&lt;p&gt;All simulation outputs were automatically processed from the plaintext and binary files produced by PHITS into compressed pickle file objects (NumPy arrays, Pandas DataFrames, dictionaries) using the PHITS Tools [3] Python utility. These Python objects were then utilized in the subsequent analysis of the paper this simulation set was generated for. The corresponding data repository used for AI model development can be found at [4].&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Structure of this repository&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The volume of data present in this repository is quite substantial (~700 GB available here / some TB including files only available upon request). Therefore, the repository has been structured in a way to allow flexibility in only downloading data of interest.&lt;/p&gt;

&lt;p&gt;The root directory of this repository consists of 39 top-level directories whose names indicate their contents; each has been &lt;code&gt;.tar&lt;/code&gt; archived and has either undergone &lt;code&gt;.xz&lt;/code&gt; compression via &lt;code&gt;xz&lt;/code&gt; on the tarball or with Python&amp;#39;s LZMA compression on the tarball&amp;#39;s contents prior to archiving. Within each are two directories: &lt;code&gt;training&lt;/code&gt; and &lt;code&gt;testing&lt;/code&gt;. Within each of these are directories of the format &lt;code&gt;???_MeV&lt;/code&gt;, where &lt;code&gt;???&lt;/code&gt; is replaced by three digits specifying the nominal beam energy in MeV. (This is &lt;code&gt;???p?&lt;/code&gt; for the energies of the testing dataset, with &lt;code&gt;p&lt;/code&gt; in place of a decimal point.) Thus, each &lt;code&gt;training&lt;/code&gt; directory contains 37 subdirectories, and each &lt;code&gt;testing&lt;/code&gt; directory contains 10 subdirectories. (One should note that there are no setup differences between &lt;code&gt;training&lt;/code&gt; and &lt;code&gt;testing&lt;/code&gt; data; they are simply divided here in the same way as in the paper.) Each &lt;code&gt;???_MeV&lt;/code&gt;/&lt;code&gt;???p?_MeV&lt;/code&gt; directory contains simulation input/output and/or PHITS Tools processed output, depending on the top-level directory it is contained within. Input and output file names do not differ between different energies; directory structure is used to keep them distinguished/separated.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;PHITS input information&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;One top-level directory differs from all of the others, and this is &lt;code&gt;common_inputs&lt;/code&gt;. As the name suggests, this directory contains all PHITS input information used in generating all of the simulation outputs.&lt;/p&gt;

&lt;p&gt;The core two PHITS input files used are &lt;code&gt;beam-on-target_phits-input_MonoE.inp&lt;/code&gt; for the monoenergetic beam simulation set and &lt;code&gt;beam-on-target_phits-input_GaussE.inp&lt;/code&gt; for the Gaussian-distributed beam energy simulation set. Within these inputs are lines using the PHITS insert file function &lt;code&gt;infl:{*}&lt;/code&gt;; all inserted files used in the PHITS simulations are also contained within this &lt;code&gt;common_inputs&lt;/code&gt; directory. The single exception to this is &lt;code&gt;PARAMETERS_files-1-and-7.txt&lt;/code&gt;, which is simply the &lt;code&gt;file(1)&lt;/code&gt; and &lt;code&gt;file(7)&lt;/code&gt; PHITS &lt;code&gt;[Parameters]&lt;/code&gt; arguments and will be system-specific paths to PHITS installation/data files. Also note that relative paths are used in the &lt;code&gt;infl:{*}&lt;/code&gt; commands; these relative paths differ to how this repository is structured given the repository has been restructured in post for distribution convenience. File names are still unique and can be found in this &lt;code&gt;common_inputs&lt;/code&gt; directory. The &lt;code&gt;CELL&lt;/code&gt; subdirectory contains the &lt;code&gt;[Cell]&lt;/code&gt; sections used for the varied phantom compositions, and the &lt;code&gt;MAPPINGS_OF_ENERGY_TO_CELL_FILES.csv&lt;/code&gt; file details how these files are paired with the 47 different beam energies.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;PHITS outputs (raw and processed)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The remaining 38 top-level directories contain simulation/processed output. When these simulations were ran, all output was contained in each &lt;code&gt;???_MeV&lt;/code&gt; directory. As detailed earlier, these have been split into various top-level directories here to allow more convenient download of only desired files. Nominally, each of these &lt;code&gt;???_MeV&lt;/code&gt; directories contained the following before being split:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;a &lt;code&gt;beam-on-target_phits-input.inp&lt;/code&gt; PHITS input file (and a simple &lt;code&gt;phits.in&lt;/code&gt; pointing to this input file, needed for parallel running of PHITS); note that these inputs have all specific source energy information populated within this file&lt;/li&gt;
	&lt;li&gt;a &lt;code&gt;phantom_composition_info.csv&lt;/code&gt; file also detailing the phantom composition used for that beam energy&lt;/li&gt;
	&lt;li&gt;&lt;code&gt;phits*.out&lt;/code&gt; file(s), &lt;strong&gt;raw&lt;/strong&gt; summary output files generated by PHITS&lt;/li&gt;
	&lt;li&gt;&lt;code&gt;*.out&lt;/code&gt; &lt;strong&gt;raw&lt;/strong&gt; plaintext tally output files from PHITS&lt;/li&gt;
	&lt;li&gt;&lt;code&gt;*.eps&lt;/code&gt; &lt;strong&gt;graphical&lt;/strong&gt; visualizations of tally output, generated by PHITS&lt;/li&gt;
	&lt;li&gt;&lt;code&gt;*_dmp.out*&lt;/code&gt; &lt;strong&gt;raw&lt;/strong&gt; binary tally &lt;strong&gt;dump&lt;/strong&gt; files from PHITS&lt;/li&gt;
	&lt;li&gt;&lt;code&gt;*.pickle.xz&lt;/code&gt; &lt;strong&gt;processed&lt;/strong&gt; tally output (and &lt;code&gt;phits.out&lt;/code&gt; metadata) from PHITS Tools, LZMA-compressed pickle files&lt;/li&gt;
	&lt;li&gt;&lt;code&gt;*_dmp_namedtuple_list.pickle.xz&lt;/code&gt; &lt;strong&gt;processed&lt;/strong&gt; tally &lt;strong&gt;dump&lt;/strong&gt; output from PHITS Tools, formatted as a NumPy record array (np.recarray)&lt;/li&gt;
	&lt;li&gt;&lt;code&gt;*_dmp_Pandas_df.pickle.xz&lt;/code&gt; &lt;strong&gt;processed&lt;/strong&gt; tally &lt;strong&gt;dump&lt;/strong&gt; output from PHITS Tools, formatted as a Pandas DataFrame (same numerical data as in NumPy recarray)&lt;/li&gt;
	&lt;li&gt;&lt;code&gt;*.png&lt;/code&gt; and &lt;code&gt;*.pdf&lt;/code&gt; &lt;strong&gt;graphical&lt;/strong&gt; visualizations of tally output, generated by PHITS Tools&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The top-level directories of this repository are named in a way to detail (1) which simulations their contents pertain to and (2) which output files are contained within them. The directories are named using an underscore-delimited pattern whose components have the following names and meanings:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;Beam type:
	&lt;ul&gt;
		&lt;li&gt;&lt;strong&gt;&lt;code&gt;MonoE&lt;/code&gt;&lt;/strong&gt; refers to simulations with the monoenergetic beams with 2.5 mm spread&lt;/li&gt;
		&lt;li&gt;&lt;strong&gt;&lt;code&gt;GaussE&lt;/code&gt;&lt;/strong&gt; refers to simulations with the Gaussian-distributed energies and 4.0 mm spread&lt;/li&gt;
	&lt;/ul&gt;
	&lt;/li&gt;
	&lt;li&gt;Simulated number of protons:
	&lt;ul&gt;
		&lt;li&gt;&lt;strong&gt;&lt;code&gt;1E8&lt;/code&gt;&lt;/strong&gt; refers to simulations with 10^8 (one hundred million) protons simulated&lt;/li&gt;
		&lt;li&gt;&lt;strong&gt;&lt;code&gt;1E9&lt;/code&gt;&lt;/strong&gt; refers to simulations with 10^9 (one billion) protons simulated&lt;/li&gt;
	&lt;/ul&gt;
	&lt;/li&gt;
	&lt;li&gt;Output source/type:
	&lt;ul&gt;
		&lt;li&gt;&lt;strong&gt;&lt;code&gt;raw&lt;/code&gt;&lt;/strong&gt; refers to the PHITS input and PHITS-generated output&lt;/li&gt;
		&lt;li&gt;&lt;strong&gt;&lt;code&gt;processed&lt;/code&gt;&lt;/strong&gt; refers to the Python-formatted processed output produced by PHITS Tools&lt;/li&gt;
		&lt;li&gt;&lt;strong&gt;&lt;code&gt;plots&lt;/code&gt;&lt;/strong&gt; refers to the &lt;code&gt;*.eps&lt;/code&gt; files produced by PHITS and the &lt;code&gt;*.png&lt;/code&gt; and &lt;code&gt;*.pdf&lt;/code&gt; files produced by PHITS Tools, all containing graphical plots of tally output (only relevant to &lt;strong&gt;&lt;code&gt;1E8&lt;/code&gt;&lt;/strong&gt; simulations)&lt;/li&gt;
	&lt;/ul&gt;
	&lt;/li&gt;
	&lt;li&gt;Other labels:
	&lt;ul&gt;
		&lt;li&gt;&lt;strong&gt;&lt;code&gt;proton-tally&lt;/code&gt;&lt;/strong&gt; refers to output from the huge &lt;code&gt;[T-Cross]&lt;/code&gt; tally used only in &lt;strong&gt;&lt;code&gt;1E8&lt;/code&gt;&lt;/strong&gt; simulations for scoring proton phase space as a function of energy, position, and direction (separated from others owing to its considerable size)&lt;/li&gt;
		&lt;li&gt;&lt;strong&gt;&lt;code&gt;neutron-dump&lt;/code&gt;&lt;/strong&gt; refers to the event-by-event neutron production data scored by a &lt;code&gt;[T-Product]&lt;/code&gt; tally&amp;#39;s &amp;quot;dump&amp;quot; option
		&lt;ul&gt;
			&lt;li&gt;&lt;strong&gt;&lt;code&gt;NumPy&lt;/code&gt;&lt;/strong&gt; and &lt;strong&gt;&lt;code&gt;Pandas&lt;/code&gt;&lt;/strong&gt; to denote if &lt;strong&gt;&lt;code&gt;processed&lt;/code&gt;&lt;/strong&gt; contents are formatted as NumPy record arrays or Pandas Dataframes&lt;/li&gt;
		&lt;/ul&gt;
		&lt;/li&gt;
		&lt;li&gt;&lt;strong&gt;&lt;code&gt;gamma-dump&lt;/code&gt;&lt;/strong&gt; refers to the event-by-event gamma-ray production data scored by a &lt;code&gt;[T-Product]&lt;/code&gt; tally&amp;#39;s &amp;quot;dump&amp;quot; option
		&lt;ul&gt;
			&lt;li&gt;&lt;strong&gt;&lt;code&gt;NumPy&lt;/code&gt;&lt;/strong&gt; and &lt;strong&gt;&lt;code&gt;Pandas&lt;/code&gt;&lt;/strong&gt; to denote if &lt;strong&gt;&lt;code&gt;processed&lt;/code&gt;&lt;/strong&gt; contents are formatted as NumPy record arrays or Pandas Dataframes&lt;/li&gt;
		&lt;/ul&gt;
		&lt;/li&gt;
		&lt;li&gt;&lt;strong&gt;&lt;code&gt;other&lt;/code&gt;&lt;/strong&gt; refers to output from all other tallies aside from the above three (energy deposition, LET, diagnostic tallies, etc.; only relevant to &lt;strong&gt;&lt;code&gt;1E8&lt;/code&gt;&lt;/strong&gt; simulations given all tallies except dump tallies were disabled for &lt;strong&gt;&lt;code&gt;1E9&lt;/code&gt;&lt;/strong&gt; simulations) along with (for &lt;strong&gt;&lt;code&gt;raw&lt;/code&gt;&lt;/strong&gt; directories) PHITS input-related files and &lt;code&gt;phits*.out&lt;/code&gt; file(s).&lt;/li&gt;
	&lt;/ul&gt;
	&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;em&gt;For clarity, the dataset notation here corresponds to that used in [1] as follows: &lt;code&gt;GaussE_1E8&lt;/code&gt; = &lt;code&gt;ES8&lt;/code&gt; and &lt;code&gt;GaussE_1E9&lt;/code&gt; = &lt;code&gt;ES9&lt;/code&gt;. (The paper did not use &lt;code&gt;MonoE_1E8&lt;/code&gt; and &lt;code&gt;MonoE_1E9&lt;/code&gt;, but if it had they would&amp;#39;ve been designated with &lt;code&gt;NES8&lt;/code&gt; and &lt;code&gt;NES9&lt;/code&gt;, respectively.)&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;All put together, this results in the following top-level directories contained in this repository:&lt;/p&gt;

&lt;p&gt;&lt;span class="math-tex"&gt;\(\begin{array}{lrrc} \textbf{Directory} &amp;amp; \textbf{Files} &amp;amp; \textbf{Uncompressed size (GB)} &amp;amp; \textbf{Available upon request} \\ \hline \texttt{common_inputs} &amp;amp; 54 &amp;amp; 0.002 &amp;amp; \\ \texttt{GaussE_1E8_raw_proton-tally} &amp;amp; 564 &amp;amp; 361.30 &amp;amp; \\ \texttt{GaussE_1E8_raw_neutron-dump} &amp;amp; 611 &amp;amp; 41.24 &amp;amp; \\ \texttt{GaussE_1E8_raw_gamma-dump} &amp;amp; 611 &amp;amp; 85.82 &amp;amp; \\ \texttt{GaussE_1E8_raw_other} &amp;amp; 1927 &amp;amp; 22.73 &amp;amp; \\ \texttt{GaussE_1E8_processed_proton-tally} &amp;amp; 564 &amp;amp; 17.19 &amp;amp; \\ \texttt{GaussE_1E8_processed_neutron-dump_NumPy} &amp;amp; 611 &amp;amp; 37.25 &amp;amp; \\ \texttt{GaussE_1E8_processed_neutron-dump_Pandas} &amp;amp; 611 &amp;amp; 45.77 &amp;amp; \\ \texttt{GaussE_1E8_processed_gamma-dump_NumPy} &amp;amp; 611 &amp;amp; 73.81 &amp;amp; \\ \texttt{GaussE_1E8_processed_gamma-dump_Pandas} &amp;amp; 611 &amp;amp; 90.16 &amp;amp; \\ \texttt{GaussE_1E8_processed_other} &amp;amp; 1551 &amp;amp; 4.35 &amp;amp; \\ \texttt{GaussE_1E8_plots} &amp;amp; 3525 &amp;amp; 7.60 &amp;amp; \\ \texttt{GaussE_1E9_raw_neutron-dump} &amp;amp; 1081 &amp;amp; 408.81 &amp;amp; \times \\ \texttt{GaussE_1E9_raw_gamma-dump} &amp;amp; 1081 &amp;amp; 854.58 &amp;amp; \times \\ \texttt{GaussE_1E9_raw_other} &amp;amp; 1316 &amp;amp; 0.59 &amp;amp; \\ \texttt{GaussE_1E9_processed_neutron-dump_NumPy} &amp;amp; 1081 &amp;amp; 372.46 &amp;amp; \times \\ \texttt{GaussE_1E9_processed_neutron-dump_Pandas} &amp;amp; 1081 &amp;amp; 457.42 &amp;amp; \times \\ \texttt{GaussE_1E9_processed_gamma-dump_NumPy} &amp;amp; 1081 &amp;amp; 738.00 &amp;amp; \times \\ \texttt{GaussE_1E9_processed_gamma-dump_Pandas} &amp;amp; 1081 &amp;amp; 901.41 &amp;amp; \times \\ \texttt{GaussE_1E9_processed_other} &amp;amp; 1175 &amp;amp; 0.02 &amp;amp; \\ \texttt{MonoE_1E8_raw_proton-tally} &amp;amp; 94 &amp;amp; 360.92 &amp;amp; \\ \texttt{MonoE_1E8_raw_neutron-dump} &amp;amp; 282 &amp;amp; 40.83 &amp;amp; \\ \texttt{MonoE_1E8_raw_gamma-dump} &amp;amp; 282 &amp;amp; 83.69 &amp;amp; \\ \texttt{MonoE_1E8_raw_other} &amp;amp; 1222 &amp;amp; 30.17 &amp;amp; \\ \texttt{MonoE_1E8_processed_proton-tally} &amp;amp; 47 &amp;amp; 13.69 &amp;amp; \\ \texttt{MonoE_1E8_processed_neutron-dump_NumPy} &amp;amp; 94 &amp;amp; 37.23 &amp;amp; \\ \texttt{MonoE_1E8_processed_neutron-dump_Pandas} &amp;amp; 94 &amp;amp; 45.77 &amp;amp; \\ \texttt{MonoE_1E8_processed_gamma-dump_NumPy} &amp;amp; 94 &amp;amp; 72.23 &amp;amp; \\ \texttt{MonoE_1E8_processed_gamma-dump_Pandas} &amp;amp; 94 &amp;amp; 88.29 &amp;amp; \\ \texttt{MonoE_1E8_processed_other} &amp;amp; 846 &amp;amp; 2.18 &amp;amp; \\ \texttt{MonoE_1E8_plots} &amp;amp; 799 &amp;amp; 2.25 &amp;amp; \\ \texttt{MonoE_1E9_raw_neutron-dump} &amp;amp; 2364 &amp;amp; 407.99 &amp;amp; \times \\ \texttt{MonoE_1E9_raw_gamma-dump} &amp;amp; 2364 &amp;amp; 836.55 &amp;amp; \times \\ \texttt{MonoE_1E9_raw_other} &amp;amp; 329 &amp;amp; 0.04 &amp;amp; \\ \texttt{MonoE_1E9_processed_neutron-dump_NumPy} &amp;amp; 94 &amp;amp; 371.30 &amp;amp; \times \\ \texttt{MonoE_1E9_processed_neutron-dump_Pandas} &amp;amp; 94 &amp;amp; 455.94 &amp;amp; \times \\ \texttt{MonoE_1E9_processed_gamma-dump_NumPy} &amp;amp; 94 &amp;amp; 721.08 &amp;amp; \times \\ \texttt{MonoE_1E9_processed_gamma-dump_Pandas} &amp;amp; 94 &amp;amp; 879.13 &amp;amp; \times \\ \texttt{MonoE_1E9_processed_other} &amp;amp; 188 &amp;amp; 0.01 &amp;amp; \\ \hline \textbf{TOTAL} &amp;amp; \textbf{30397} &amp;amp; \textbf{8969.84} &amp;amp; \end{array}\)&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;(Data marked as &amp;quot;Available upon request&amp;quot; is only available upon additional specific request.)&lt;/p&gt;

&lt;p&gt;And, as stated earlier, each of these top-level directories is divided into a &lt;code&gt;training&lt;/code&gt; subdirectory (containing 37 &lt;code&gt;???_MeV&lt;/code&gt; directories) and a &lt;code&gt;testing&lt;/code&gt; subdirectory (containing 10 &lt;code&gt;???p?_MeV&lt;/code&gt; directories), where the &lt;code&gt;???[p?]_MeV&lt;/code&gt; directories only (1) contain particular files (2) relevant to certain simulations&amp;mdash;as specified by the top-level directory&amp;#39;s name.&lt;/p&gt;

&lt;p&gt;As a note to anyone surveying the &lt;code&gt;raw&lt;/code&gt; files, all &lt;code&gt;GaussE&lt;/code&gt; simulations were ran with OpenMP parallelization with 10 processes. For &lt;code&gt;1E8&lt;/code&gt; simulations, this was conducted as ten PHITS runs of 1E7 protons each; for &lt;code&gt;1E9&lt;/code&gt; simulations, this was conducted as twenty runs of 5E7 protons each. (PHITS runs can be &amp;quot;chained&amp;quot; as &amp;quot;restart calculations&amp;quot;, where one run can resume from where a previous run ended.) In these simulations, the generated &lt;code&gt;phits.out&lt;/code&gt; files from each run were renamed to &lt;code&gt;phits-#A-#B.out&lt;/code&gt; (where &lt;code&gt;#A&lt;/code&gt; was an internal number &lt;code&gt;1&lt;/code&gt; to &lt;code&gt;47&lt;/code&gt; pairing with each simulated beam energy, and &lt;code&gt;#B&lt;/code&gt; is the run number, &lt;code&gt;0&lt;/code&gt; to &lt;code&gt;19&lt;/code&gt;) and moved into a &lt;code&gt;phitsout&lt;/code&gt; subdirectory after each run&amp;#39;s completion. However, this was less uniform for the &lt;code&gt;MonoE&lt;/code&gt; simulations; for those, the strategy was to complete each simulation in a single run of PHITS. This generally involved using a hybrid OpenMP + MPI parallelization with anywhere from 80 to 160 processes each, split between OMP and MPI (noting that some &lt;code&gt;1E9&lt;/code&gt; runs were conducted with only MPI parallelization). None of this influences the output format of the standard tally outputs. However, the number of dump files produced is equal to the number of MPI processes utilized. This means that each &lt;code&gt;GaussE&lt;/code&gt; simulation only has one dump file per dump tally owing to only using OpenMP parallelization (which merges its dump files at the end of calculation) while the &lt;code&gt;MonoE&lt;/code&gt; simulations contain a varied number of dump files per dump tally owing to varriations in parallelization strategies employed in those simulations. PHITS Tools ultimately merges all dump outputs back together in its processing, meaning if looking at the &lt;code&gt;processed&lt;/code&gt; output this quirk of how simulations were conducted should not be apparent at all.&lt;/p&gt;

&lt;p&gt;Given PHITS Tools was under ongoing development as this dataset was being produced, the &lt;code&gt;GaussE&lt;/code&gt; directories contain some extra output not present in the &lt;code&gt;MonoE&lt;/code&gt; directories. Most notably, only for the &lt;code&gt;GaussE&lt;/code&gt; simulations do the &lt;code&gt;plot&lt;/code&gt; directories contain PNG and PDF plot files generated by PHITS Tools and the &lt;code&gt;*_processed_*&lt;/code&gt; directories contain dictionary objects of the processed &lt;code&gt;phits*.out&lt;/code&gt; files.&lt;/p&gt;

&lt;p&gt;Note that, for convenience, the &lt;code&gt;phits*.out&lt;/code&gt; file(s) for each simulation are also copied to all &lt;code&gt;*_raw_*&lt;/code&gt; directories. The &lt;code&gt;phits*.out&lt;/code&gt; file(s) contain the full PHITS input echo, among other information about the simulation. For the &lt;code&gt;GaussE&lt;/code&gt; simulations, these are within a further &lt;code&gt;phitsout&lt;/code&gt; subdirectory for each beam energy. Also for all &lt;code&gt;GaussE_*_processed_*&lt;/code&gt; directories, the processed &lt;code&gt;phits*.out&lt;/code&gt; file(s), &lt;code&gt;phits*_out.pickle.xz&lt;/code&gt;, are included too.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;References&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;[1] F. Blangiardi, H.N. Ratliff et al., &lt;em&gt;&amp;quot;Fast proton transport and neutron production in proton therapy using Fourier neural operators&amp;quot;&lt;/em&gt;, &lt;em&gt;(in preparations for submission)&lt;/em&gt; (2025)&lt;/p&gt;

&lt;p&gt;[2] T. Sato, Y. Iwamoto, S. Hashimoto, T. Ogawa, T. Furuta, S. Abe, T. Kai, Y. Matsuya, N. Matsuda, Y. Hirata, T. Sekikawa, L. Yao, P.E. Tsai, H.N. Ratliff, H. Iwase, Y. Sakaki, K. Sugihara, N. Shigyo, L. Sihver and K. Niita, &lt;em&gt;&amp;quot;Recent improvements of the Particle and Heavy Ion Transport code System - PHITS</description>
              </descriptions>
            </resource>
          </payload>
        </oai_datacite>
      </metadata>
    </record>
  </GetRecord>
</OAI-PMH>
