Dataset Open Access

LDOS/SNAP data for MALA: Beryllium at 298K

Fiedler, Lenz; Cangi, Attila


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2022-02-18</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">LDOS/SNAP data for MALA: Beryllium at 298K</subfield>
  </datafield>
  <controlfield tag="001">1834</controlfield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Fiedler, Lenz</subfield>
    <subfield code="u">HZDR / CASUS</subfield>
    <subfield code="0">(orcid)0000-0002-8311-0613</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="a">https://www.hzdr.de/publications/Publ-35016</subfield>
    <subfield code="i">isIdenticalTo</subfield>
    <subfield code="n">url</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="a">10.14278/rodare.1833</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="n">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-rodare</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.14278/rodare.1834</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="o">oai:rodare.hzdr.de:1834</subfield>
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">user-rodare</subfield>
  </datafield>
  <controlfield tag="005">20230127125610.0</controlfield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">214946397</subfield>
    <subfield code="u">https://rodare.hzdr.de/record/1834/files/N1024.zip</subfield>
    <subfield code="z">md5:cfb934835556ca8d77cf18821b468971</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">29186428750</subfield>
    <subfield code="u">https://rodare.hzdr.de/record/1834/files/N128.zip</subfield>
    <subfield code="z">md5:54e49d566ef88d7897e74aaf1f7138b6</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">112066694</subfield>
    <subfield code="u">https://rodare.hzdr.de/record/1834/files/N2048.zip</subfield>
    <subfield code="z">md5:276663b3392d2b4e035af8e6c9264ed7</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">43680626268</subfield>
    <subfield code="u">https://rodare.hzdr.de/record/1834/files/N256.zip</subfield>
    <subfield code="z">md5:e8698b13beb3dbfbbe99f3dc35a4dd3d</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">46934764110</subfield>
    <subfield code="u">https://rodare.hzdr.de/record/1834/files/N512.zip</subfield>
    <subfield code="z">md5:5848cbc3a3ccc52ac07a00aab525c851</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">4310</subfield>
    <subfield code="u">https://rodare.hzdr.de/record/1834/files/README.md</subfield>
    <subfield code="z">md5:5eb3423b53f6e223deda06eacbf13cfd</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">2067</subfield>
    <subfield code="u">https://rodare.hzdr.de/record/1834/files/sample_inputs.zip</subfield>
    <subfield code="z">md5:7882af49ea82d1c715f954e8eb9e42a1</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;pre&gt;&lt;em&gt;&lt;strong&gt;Beryllium data set for Machine Learning applications&lt;/strong&gt;&lt;/em&gt;
&lt;/pre&gt;

&lt;p&gt; &lt;/p&gt;

&lt;p&gt;This dataset contains DFT inputs, outputs, LDOS data and fingerprint vectors for a beryllium cell at ambient conditions and varying sizes. Different levels of k-grid convergence were employed:&lt;br&gt;
 -&amp;nbsp; Gamma point (gamma_point)&lt;br&gt;
 -&amp;nbsp; total energy convergence (k-grid converged to 1meV/atom to total energy difference, total_energy_convergence)&lt;br&gt;
 -&amp;nbsp; LDOS convergence (k-grid converged to LDOS without unphyiscal oscillations, ldos_convergence)&lt;/p&gt;

&lt;p&gt; &lt;/p&gt;

&lt;p&gt;The data set contains a .zip file for each system size (see below), as well as one .zip file containing sample scripts for recalculation and preprocessing of data.&lt;br&gt;
 The cutoff energy was converged with respect to the energy convergence and held fixed 40Ry for all three levels of k-grids. Note that not for all sizes of unit cells data for all types of k-grid were generated.&lt;/p&gt;

&lt;p&gt; &lt;/p&gt;

&lt;pre&gt;&lt;strong&gt;Authors:&lt;/strong&gt;

&lt;em&gt;- &lt;/em&gt;Fiedler, Lenz (HZDR / CASUS)
&lt;em&gt;- &lt;/em&gt;Cangi, Attila (HZDR / CASUS)

&lt;em&gt;Affiliations&lt;/em&gt;&lt;strong&gt;:&lt;/strong&gt;

HZDR - Helmholtz-Zentrum Dresden-Rossendorf

CASUS - Center for Advanced Systems Understanding

&lt;strong&gt;Dataset description&lt;/strong&gt;

&lt;em&gt;- &lt;/em&gt;Total size: 143G GB 
&lt;em&gt;- &lt;/em&gt;System: Be128, Be256, Be512, Be1024, Be2048
&lt;em&gt;- &lt;/em&gt;Temperature(s): 298K
&lt;em&gt;- &lt;/em&gt;Mass density(ies): 1.896 gcc
&lt;em&gt;- &lt;/em&gt;Crystal Structure: hpc (material mp-87 in the materials project)
&lt;em&gt;- &lt;/em&gt;Number of atomic snapshots: 145
  &lt;em&gt;  - &lt;/em&gt;40 (Be128)
  &lt;em&gt;  - &lt;/em&gt;35 (Be256)
   &lt;em&gt;- &lt;/em&gt;30 (Be512)
   &lt;em&gt;- &lt;/em&gt;20 (Be1024)
   &lt;em&gt;- &lt;/em&gt;10 (Be2048)
&lt;em&gt;- &lt;/em&gt;Contents:
   &lt;em&gt;- &lt;/em&gt;ideal crystal structure: yes
  &lt;em&gt;  - &lt;/em&gt;MD trajectory: yes
  &lt;em&gt;  - &lt;/em&gt;Atomic positions: yes
   &lt;em&gt;- &lt;/em&gt;DFT inputs: yes
  &lt;em&gt;  - &lt;/em&gt;DFT outputs (energies): yes
  &lt;em&gt;  - &lt;/em&gt;SNAP vectors: yes (partially, see below)
      &lt;em&gt;  - &lt;/em&gt;dimensions: XxYxZx94 (last dimension: first three entries are x,y,z coordinates, data size is 91), where X, Y, Z are:
         &lt;em&gt;- &lt;/em&gt;Be128: 72x72x120 (size per file: 447MB)
         &lt;em&gt;- &lt;/em&gt;Be256: 144x72x120  (size per file: 893MB)
         &lt;em&gt;- &lt;/em&gt;Be512: 144x144x120 (size per file: 1.8GB)
      &lt;em&gt;  - &lt;/em&gt;units: a.u./Bohr
  &lt;em&gt;  - &lt;/em&gt;LDOS vectors: yes (partially, see below)
      &lt;em&gt;  - &lt;/em&gt;dimensions: XxYxZx250, where X, Y, Z are:
         &lt;em&gt;- &lt;/em&gt;Be128: 72x72x120 (size per file: 1.2GB)
         &lt;em&gt;- &lt;/em&gt;Be256: 144x72x120  (size per file: 2.4GB)
         &lt;em&gt;- &lt;/em&gt;Be512: 144x144x120 (size per file: 4.7GB)
      &lt;em&gt;  - &lt;/em&gt;units: 1/eV
      &lt;em&gt;- &lt;/em&gt;note: LDOS parameters are the same for all sizes of the unit cell
  &lt;em&gt;  - &lt;/em&gt;trained networks: no

&lt;strong&gt;Data generation&lt;/strong&gt;

Ideal crystal structures were obtained using the Materials Project. (https://materialsproject.org/materials/mp-87/)
DFT-MD calculations were performed using either QuantumESPRESSO (https://www.quantum-espresso.org/, QE, for Be128, Be256 and Be512) or the Vienna Ab initio Simulation Package (https://www.vasp.at/, VASP, for Be1024, Be2048). DFT calculations were performed using QuantumESPRESSO. 
For the VASP calculations, the standard VASP pseudopotentials were used. For Quantum Espresso, pslibrary was used (https://dalcorso.github.io/pslibrary/).
SNAP vectors were calculated using MALA (https://github.com/mala-project/mala) and its LAMMPS (https://github.com/mala-project/mala) interface. The LDOS was preprocessed using MALA as well.

&lt;strong&gt;Dataset structure&lt;/strong&gt;

The folder called &amp;quot;sample_inputs&amp;quot; is provided to show how MALA preprocessing and LDOS calculation have been performed. 
For each temperature/mass density/number of atoms, the following subfolders exist:

&lt;em&gt;- &lt;/em&gt;md_inputs: Input files for the MD simulations, either as QE or VASP file(s)
&lt;em&gt;- &lt;/em&gt;md_outputs: The MD trajectory plus a numpy array containing the temperatures at the individual time steps
&lt;em&gt;- &lt;/em&gt;gamma_point
&lt;em&gt;- &lt;/em&gt;total_energy_convergence
&lt;em&gt;- &lt;/em&gt;ldos_convergence

Each gamma_point/total_energy_convergence/ldos_convergence contains the following folders:

&lt;em&gt;- &lt;/em&gt;ldos: holds the LDOS vectors
&lt;em&gt;- &lt;/em&gt;fingerprints: holds the SNAP fingerprint vectors
&lt;em&gt;- &lt;/em&gt;snapshots: holds the atomic positions of the atomic snapshots for which DFT and LDOS calculations were performed (as .xyz files)
&lt;em&gt;- &lt;/em&gt;dft_outputs: holds the outputs from the DFT calculations, i.e. energies in the form of a QE output file
&lt;em&gt;- &lt;/em&gt;dft_inputs: holds the inputs for the DFT calculations, in the form of a QE input file

Please note that the numbering of the snapshots is contiguous per temperature/mass density/number of atoms, NOT within the k-grids themselves. 
Also, LDOS and fingerprint files have only been calculated for snapshots in the ldos_convergence 
folders. Therefore, no LDOS and fingerprint files have been calculated for the 1024 anf 2048 atom systems.
&lt;/pre&gt;</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Cangi, Attila</subfield>
    <subfield code="u">HZDR / CASUS</subfield>
    <subfield code="0">(orcid)0000-0001-9162-262X</subfield>
  </datafield>
</record>
1,530
5,407
views
downloads
All versions This version
Views 1,5301,530
Downloads 5,4075,407
Data volume 134.2 TB134.2 TB
Unique views 574574
Unique downloads 303303

Share

Cite as