Dataset Open Access

Boron data set for machine learning applications

Fiedler, Lenz; Cangi, Attila


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.14278/rodare.3746</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Fiedler, Lenz</subfield>
    <subfield code="u">HZDR</subfield>
    <subfield code="0">(orcid)0000-0002-8311-0613</subfield>
  </datafield>
  <controlfield tag="001">3746</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Cangi, Attila</subfield>
    <subfield code="u">HZDR</subfield>
    <subfield code="0">(orcid)0000-0001-9162-262X</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="a">https://www.hzdr.de/publications/Publ-41336</subfield>
    <subfield code="i">isIdenticalTo</subfield>
    <subfield code="n">url</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="a">https://www.hzdr.de/publications/Publ-40059</subfield>
    <subfield code="i">isReferencedBy</subfield>
    <subfield code="n">url</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="a">10.14278/rodare.3745</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="n">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="o">oai:rodare.hzdr.de:3746</subfield>
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">user-rodare</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">7689679534</subfield>
    <subfield code="u">https://rodare.hzdr.de/record/3746/files/bispectrum.zip</subfield>
    <subfield code="z">md5:d8eba936cdebe917f19d434bf01a8d09</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">55798</subfield>
    <subfield code="u">https://rodare.hzdr.de/record/3746/files/dft_inputs.zip</subfield>
    <subfield code="z">md5:bc87667bdeebac1559cfaaf8b9b70806</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">84943</subfield>
    <subfield code="u">https://rodare.hzdr.de/record/3746/files/dft_outputs.zip</subfield>
    <subfield code="z">md5:425809ad0482a02d99eaeacd26b1dd8b</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">19895273856</subfield>
    <subfield code="u">https://rodare.hzdr.de/record/3746/files/ldos.zip</subfield>
    <subfield code="z">md5:e7d3fad65ee4881f91c4585e5c1c88c5</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">32423497</subfield>
    <subfield code="u">https://rodare.hzdr.de/record/3746/files/models.zip</subfield>
    <subfield code="z">md5:988242476b02f61228f6833cefc0fb7b</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-rodare</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;&lt;strong&gt;Boron data set for machine learning applications&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This dataset contains DFT inputs, outputs, LDOS data and bispectrum descriptor vectors for an &amp;alpha;-rhombohedral boron cell of 144 atoms at room temperature and ambient mass density. All simulations have been performed at an LDOS converged k-grid of 4x4x4 k-points.&lt;/p&gt;

&lt;p&gt;This dataset contains one .zip file for each of its five type of data (bispectrum descriptors, LDOS, DFT inputs, DFT outputs and trained models).&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Authors:&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;- Fiedler, Lenz (HZDR / CASUS)&lt;br&gt;
- Cangi, Attila (HZDR / CASUS)&lt;/p&gt;

&lt;p&gt;Affiliations&lt;em&gt;:&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;HZDR - Helmholtz-Zentrum Dresden-Rossendorf&lt;br&gt;
CASUS - Center for Advanced Systems Understanding&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Dataset description&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;- Total size: 26 GB&lt;br&gt;
- System: B144&lt;br&gt;
- Temperature(s): 298K&lt;br&gt;
- Mass density(ies): 2.483 gcc&lt;br&gt;
- Crystal Structure: amorphous (material mp-160 in the materials project)&lt;br&gt;
- Number of atomic snapshots: 15&lt;br&gt;
- Contents:&lt;br&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp; - ideal crystal structure: no&lt;br&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp; - MD trajectory: no&lt;br&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp; - Atomic positions: no&lt;br&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp; - DFT inputs: yes&lt;br&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp; - DFT outputs (energies): yes&lt;br&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp; - SNAP vectors: yes&lt;br&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; - dimensions: 108x108x35x94 (last dimension: first three entries are x,y,z coordinates, data size is 91)&lt;br&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; - units: a.u.&lt;br&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp; - LDOS vectors: yes&lt;br&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; - dimensions: 108x108x35x241&lt;br&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; - units: 1/(eV*Angstrom^3)&lt;br&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp; - trained networks: yes&lt;/p&gt;

&lt;p&gt;&lt;br&gt;
&lt;em&gt;Dataset structure&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;A .zip file is included for each for each of its five type of data:&lt;/p&gt;

&lt;p&gt;- ldos.zip: holds the LDOS vectors (one HDF5 file per snapshot)&lt;br&gt;
- bispectrum.zip: holds the bispectrum fingerprint vectors&amp;nbsp; (one HDF5 file per snapshot)&lt;br&gt;
- dft_outputs: holds the outputs from the DFT calculations, i.e. energies and simulation parameters in a .json format (one per snapshot)&lt;br&gt;
- dft_inputs: holds the inputs for the DFT calculations, in the form of a QE input file (one per snapshot)&lt;br&gt;
- models: holds five trained NN models for the data set&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2025-05-14</subfield>
  </datafield>
  <controlfield tag="005">20250515093529.0</controlfield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Boron data set for machine learning applications</subfield>
  </datafield>
</record>
262
41
views
downloads
All versions This version
Views 262262
Downloads 4141
Data volume 213.3 GB213.3 GB
Unique views 238238
Unique downloads 2525

Share

Cite as