Dataset Open Access
<?xml version='1.0' encoding='utf-8'?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:adms="http://www.w3.org/ns/adms#" xmlns:cnt="http://www.w3.org/2011/content#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dct="http://purl.org/dc/terms/" xmlns:dctype="http://purl.org/dc/dcmitype/" xmlns:dcat="http://www.w3.org/ns/dcat#" xmlns:duv="http://www.w3.org/ns/duv#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:frapo="http://purl.org/cerif/frapo/" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:gsp="http://www.opengis.net/ont/geosparql#" xmlns:locn="http://www.w3.org/ns/locn#" xmlns:org="http://www.w3.org/ns/org#" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:prov="http://www.w3.org/ns/prov#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:schema="http://schema.org/" xmlns:skos="http://www.w3.org/2004/02/skos/core#" xmlns:vcard="http://www.w3.org/2006/vcard/ns#" xmlns:wdrs="http://www.w3.org/2007/05/powder-s#"> <rdf:Description rdf:about="https://doi.org/10.14278/rodare.1834"> <rdf:type rdf:resource="http://www.w3.org/ns/dcat#Dataset"/> <dct:type rdf:resource="http://purl.org/dc/dcmitype/Dataset"/> <dct:identifier rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURI">https://doi.org/10.14278/rodare.1834</dct:identifier> <foaf:page rdf:resource="https://doi.org/10.14278/rodare.1834"/> <dct:creator> <rdf:Description rdf:about="http://orcid.org/0000-0002-8311-0613"> <rdf:type rdf:resource="http://xmlns.com/foaf/0.1/Agent"/> <foaf:name>Fiedler, Lenz</foaf:name> <foaf:givenName>Lenz</foaf:givenName> <foaf:familyName>Fiedler</foaf:familyName> <org:memberOf> <foaf:Organization> <foaf:name>HZDR / CASUS</foaf:name> </foaf:Organization> </org:memberOf> </rdf:Description> </dct:creator> <dct:creator> <rdf:Description rdf:about="http://orcid.org/0000-0001-9162-262X"> <rdf:type rdf:resource="http://xmlns.com/foaf/0.1/Agent"/> <foaf:name>Cangi, Attila</foaf:name> <foaf:givenName>Attila</foaf:givenName> <foaf:familyName>Cangi</foaf:familyName> <org:memberOf> <foaf:Organization> <foaf:name>HZDR / CASUS</foaf:name> </foaf:Organization> </org:memberOf> </rdf:Description> </dct:creator> <dct:title>LDOS/SNAP data for MALA: Beryllium at 298K</dct:title> <dct:publisher> <foaf:Agent> <foaf:name>Rodare</foaf:name> </foaf:Agent> </dct:publisher> <dct:issued rdf:datatype="http://www.w3.org/2001/XMLSchema#gYear">2022</dct:issued> <dct:issued rdf:datatype="http://www.w3.org/2001/XMLSchema#date">2022-02-18</dct:issued> <owl:sameAs rdf:resource="https://rodare.hzdr.de/record/1834"/> <adms:identifier> <adms:Identifier> <skos:notation rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURI">https://rodare.hzdr.de/record/1834</skos:notation> </adms:Identifier> </adms:identifier> <owl:sameAs rdf:resource="https://www.hzdr.de/publications/Publ-35016"/> <dct:isVersionOf rdf:resource="https://doi.org/10.14278/rodare.1833"/> <dct:isPartOf rdf:resource="https://rodare.hzdr.de/communities/rodare"/> <owl:versionInfo>1.1.0</owl:versionInfo> <dct:description><pre><em><strong>Beryllium data set for Machine Learning applications</strong></em> </pre> <p> </p> <p>This dataset contains DFT inputs, outputs, LDOS data and fingerprint vectors for a beryllium cell at ambient conditions and varying sizes. Different levels of k-grid convergence were employed:<br> -&nbsp; Gamma point (gamma_point)<br> -&nbsp; total energy convergence (k-grid converged to 1meV/atom to total energy difference, total_energy_convergence)<br> -&nbsp; LDOS convergence (k-grid converged to LDOS without unphyiscal oscillations, ldos_convergence)</p> <p> </p> <p>The data set contains a .zip file for each system size (see below), as well as one .zip file containing sample scripts for recalculation and preprocessing of data.<br> The cutoff energy was converged with respect to the energy convergence and held fixed 40Ry for all three levels of k-grids. Note that not for all sizes of unit cells data for all types of k-grid were generated.</p> <p> </p> <pre><strong>Authors:</strong> <em>- </em>Fiedler, Lenz (HZDR / CASUS) <em>- </em>Cangi, Attila (HZDR / CASUS) <em>Affiliations</em><strong>:</strong> HZDR - Helmholtz-Zentrum Dresden-Rossendorf CASUS - Center for Advanced Systems Understanding <strong>Dataset description</strong> <em>- </em>Total size: 143G GB <em>- </em>System: Be128, Be256, Be512, Be1024, Be2048 <em>- </em>Temperature(s): 298K <em>- </em>Mass density(ies): 1.896 gcc <em>- </em>Crystal Structure: hpc (material mp-87 in the materials project) <em>- </em>Number of atomic snapshots: 145 <em> - </em>40 (Be128) <em> - </em>35 (Be256) <em>- </em>30 (Be512) <em>- </em>20 (Be1024) <em>- </em>10 (Be2048) <em>- </em>Contents: <em>- </em>ideal crystal structure: yes <em> - </em>MD trajectory: yes <em> - </em>Atomic positions: yes <em>- </em>DFT inputs: yes <em> - </em>DFT outputs (energies): yes <em> - </em>SNAP vectors: yes (partially, see below) <em> - </em>dimensions: XxYxZx94 (last dimension: first three entries are x,y,z coordinates, data size is 91), where X, Y, Z are: <em>- </em>Be128: 72x72x120 (size per file: 447MB) <em>- </em>Be256: 144x72x120 (size per file: 893MB) <em>- </em>Be512: 144x144x120 (size per file: 1.8GB) <em> - </em>units: a.u./Bohr <em> - </em>LDOS vectors: yes (partially, see below) <em> - </em>dimensions: XxYxZx250, where X, Y, Z are: <em>- </em>Be128: 72x72x120 (size per file: 1.2GB) <em>- </em>Be256: 144x72x120 (size per file: 2.4GB) <em>- </em>Be512: 144x144x120 (size per file: 4.7GB) <em> - </em>units: 1/eV <em>- </em>note: LDOS parameters are the same for all sizes of the unit cell <em> - </em>trained networks: no <strong>Data generation</strong> Ideal crystal structures were obtained using the Materials Project. (https://materialsproject.org/materials/mp-87/) DFT-MD calculations were performed using either QuantumESPRESSO (https://www.quantum-espresso.org/, QE, for Be128, Be256 and Be512) or the Vienna Ab initio Simulation Package (https://www.vasp.at/, VASP, for Be1024, Be2048). DFT calculations were performed using QuantumESPRESSO. For the VASP calculations, the standard VASP pseudopotentials were used. For Quantum Espresso, pslibrary was used (https://dalcorso.github.io/pslibrary/). SNAP vectors were calculated using MALA (https://github.com/mala-project/mala) and its LAMMPS (https://github.com/mala-project/mala) interface. The LDOS was preprocessed using MALA as well. <strong>Dataset structure</strong> The folder called &quot;sample_inputs&quot; is provided to show how MALA preprocessing and LDOS calculation have been performed. For each temperature/mass density/number of atoms, the following subfolders exist: <em>- </em>md_inputs: Input files for the MD simulations, either as QE or VASP file(s) <em>- </em>md_outputs: The MD trajectory plus a numpy array containing the temperatures at the individual time steps <em>- </em>gamma_point <em>- </em>total_energy_convergence <em>- </em>ldos_convergence Each gamma_point/total_energy_convergence/ldos_convergence contains the following folders: <em>- </em>ldos: holds the LDOS vectors <em>- </em>fingerprints: holds the SNAP fingerprint vectors <em>- </em>snapshots: holds the atomic positions of the atomic snapshots for which DFT and LDOS calculations were performed (as .xyz files) <em>- </em>dft_outputs: holds the outputs from the DFT calculations, i.e. energies in the form of a QE output file <em>- </em>dft_inputs: holds the inputs for the DFT calculations, in the form of a QE input file Please note that the numbering of the snapshots is contiguous per temperature/mass density/number of atoms, NOT within the k-grids themselves. Also, LDOS and fingerprint files have only been calculated for snapshots in the ldos_convergence folders. Therefore, no LDOS and fingerprint files have been calculated for the 1024 anf 2048 atom systems. </pre></dct:description> <dct:accessRights rdf:resource="http://publications.europa.eu/resource/authority/access-right/PUBLIC"/> <dct:accessRights> <dct:RightsStatement rdf:about="info:eu-repo/semantics/openAccess"> <rdfs:label>Open Access</rdfs:label> </dct:RightsStatement> </dct:accessRights> <dcat:distribution> <dcat:Distribution> <dct:rights> <dct:RightsStatement rdf:about="https://creativecommons.org/licenses/by/4.0/legalcode"> <rdfs:label>Creative Commons Attribution 4.0 International</rdfs:label> </dct:RightsStatement> </dct:rights> <dcat:accessURL rdf:resource="https://doi.org/10.14278/rodare.1834"/> </dcat:Distribution> </dcat:distribution> </rdf:Description> </rdf:RDF>
All versions | This version | |
---|---|---|
Views | 1,675 | 1,675 |
Downloads | 5,457 | 5,457 |
Data volume | 135.1 TB | 135.1 TB |
Unique views | 704 | 704 |
Unique downloads | 342 | 342 |