Band Gap Estimator

Predict electronic band gaps from crystal structures

Oxide materials - AnBnOn CIF Machine Learning

This tool provides approximate band gap estimates for uploaded crystal structures (CIF format). Key features:

  • Quick prediction of electronic band gaps
  • Developed for oxide materials
  • Requires complete CIF files with formula units
For research-grade accuracy, consider our premium reports.
Try it!
For More Accurate Results

Our premium reports offer detailed band gap analysis using advanced computational methods. Available in the Pricing section.

OxyHopper

Analyze oxygen migration pathways

Oxygen migration Diffusion analysis Void mapping

This tool visualizes and analyzes oxygen ion migration pathways in crystal structures (CIF format). Key features:

  • Determination of migration pathway dimensionality
  • Calculation of void and channel sizes
  • Identification of potential diffusion bottlenecks
For other ions, see BatteryMaterials.
Try it!
For More Accurate Results

Our premium reports offer detailed oxygen migration pathways analysis. Available in the Pricing section.

File Requirements

Band Gap Estimator
  • CIF file must contain complete crystal structure information
  • Must include number of formula units (_cell_formula_units_Z)
  • Best results for AnBnOn structures (training dataset focus)
OxyHopper
  • CIF file must contain complete crystal structure information
  • Must include number of formula units (_cell_formula_units_Z)
  • Supports only oxide materials

Abstract

The materials-analayzer.info service is designed for the rapid assessment of the electrical conductivity of materials with the composition AnBnOn (where O - oxygen, A and B - metals, and n - any integer), thereby determining the applicability of a potential conductive material in electrochemical devices, such as: solid oxide fuel cells and all-solid-state metal-ion batteries. The assessment of electrical conductivity is based on predicting the band gap using developed machine learning models. The service also offers an OxyHopper tool for analyzing the oxygen ion migration pathways in crystal structures.

1. Description

The band gap (Eg) is a fundamental property that is directly related to a material's potential use in optical, electronic, and energy applications. By controlling a material's composition or structure, its Eg can be tuned, enabling the creation of materials suited for specific applications. Selecting such materials and evaluating their conductive properties is a complex challenge, which has recently been addressed through computational modeling. However, these methods are unsuitable for the large-scale screening of compound databases due to their high computational cost. An optimal solution to this problem is the use of machine learning methods, which can predict physical properties in cases where the target property cannot be determined directly without significant experimental or computational resources.

The developed service predicts Eg values with high accuracy, thereby determining the suitability of a potential conductive material for the aforementioned electrochemical devices. For the rapid assessment of Eg, a machine learning model was used, trained on data from The Materials Project, AFLOW, and OQMD databases for structures with the composition AnBnOn (where O - oxygen, A and B - metals, and n - any integer). The training utilized over 12000 compounds, with a test set comprising more than 1300 compounds.

2. Tools

2.1 Band Gap Estimator

The prediction of Eg is based on computable features and reference data. The calculation of geometric features for crystal structures utilizes Voronoi partitioning. The ionicity degrees of chemical bonds are determined based on the electronegativity values of the elements. Using data on the number of outer-shell electrons, atomic masses of the elements, and the unit cell volume of the crystal, the electron density values for the structures are calculated.

2.2 OxyHopper

In this section, the search for possible interstitial sites for oxygen during its diffusion is performed. This search is also based on partitioning the crystal space into Voronoi polyhedra.

3. Input files

The service can only process a CIF file as input, which must contain information about the crystal structure, including the number of formula units (_cell_formula_units_Z). The Drag-and-Drop technology implemented in the service allows users to easily upload files to the site by dragging and dropping them into a dedicated form.

4. Output files

The Band Gap Estimator tool generates a report in *.pdf format. This report presents the values of descriptors calculated from the user-uploaded structure and the predicted band gap value.

OxyHopper generates a report in *.pdf format, which provides the concentration of interstitial oxygen atoms and the ionicity degrees of the oxygen bond with each metal, as well as a *.cif file. This CIF file contains the original crystal structure information along with additional atoms marked as H, representing the interstitial sites.

5. Errors, limitations, and troubleshooting recommendations

  • Unsupported file format. The file containing crystal structure data must be in *.cif format with the number of formula units specified.
  • Research Objects. The study focuses on ionic structures containing only "cation-anion" bonds.
  • User perceives the Eg estimation as incorrect. Please verify that your structure is not in the primitive P1 symmetry, as symmetry was one of the significant descriptors used in training the model. To increase symmetry, the user can utilize the AFLOW service or Python libraries such as pymatgen or ASE.
  • The structure is disordered. In this case, the service may generate an error and fail to process the compound. It is necessary to create a pseudo-ordered supercell that corresponds to the given composition. This can be done using, for example, the SUPERCELL program.

6. Copyright

Certificate of State Registration of the Computer Program No. 2025680575 dated August 7, 2025. "Materials Analyzer" - assessment of ionic and electronic conductivity in solid-state materials.

Rightsholder: LIMITED LIABILITY COMPANY "ANALIZ MATERIALOV" (RU).

Authors: Morkhova Yelizaveta Aleksandrovna (RU), Smolkov Mikhail Igorevich (RU), Kabanov Artem Anatolyevich (RU), Osipov Vladislav Timofeevich (RU), Blatov Vladislav Anatolyevich (RU).

Theoretical Validation

Our methods show consistent agreement with experimental studies across various material classes:

Published Validations:
Learn More About Our Methods

For detailed information about our theoretical approaches and software packages, please read DOI: 10.1016/B978-0-12-823144-9.00062-5.

Academic Use

When using results from this service in publications or research, please include a citation reference to this page:

Band Gap Estimator
APA (7th Edition)
LLC "Materials Analyzer". (2025). Materials Analyzer - Band Gap Estimation. https://materials-analyzer.info
MLA (9th Edition)
Materials Analyzer - Band Gap Estimation. materials-analyzer.info. Accessed XX Month 2025.
Chicago Style (Notes)
Materials Analyzer - Band Gap Estimation, https://materials-analyzer.info (accessed Month XX, 2025)
Russian ГОСТ style
Анализ Материалов. Предсказание ширины запрещенной зоны [Электронный ресурс]. — Режим доступа: https://materials-analyzer.info (дата обращения: dd.mm.yyyy).
OxyHopper
APA (7th Edition)
LLC "Materials Analyzer". (2025). Materials Analyzer - OxyHopper. https://materials-analyzer.info
MLA (9th Edition)
Materials Analyzer - OxyHopper. materials-analyzer.info. Accessed XX Month 2025.
Chicago Style (Notes)
Materials Analyzer - OxyHopper, https://materials-analyzer.info (accessed Month XX, 2025)
Russian ГОСТ style
Анализ Материалов. Анализ миграций кислорода [Электронный ресурс]. — Режим доступа: https://materials-analyzer.info (дата обращения: dd.mm.yyyy).

We appreciate your support in acknowledging our service.

Dataset used in Band Gap prediction.

Number of materials of each major computational materials databases used to create this dataset.

About Our Machine Learning Model

The prediction model was specifically trained on AnBnOn structures. Results for other material classes may not be accurately predicted.

About the Databases
Materials Project

An open database of computed materials properties designed to accelerate materials innovation by providing researchers with powerful tools for materials design.

Horton, M.K., Huck, P., Yang, R.X. et al. Accelerated data-driven materials science with the Materials Project. Nat. Mater. (2025)

ML models quality metrics on training data (90% of all structures) from the dataset.

Band Gap Estimator for Classification
Accuracy
98.219%

Proportion of correctly predicted instances

Precision
98.251%

Ratio of true positives to all predicted positives

Recall
97.709%

Ratio of true positives to all actual positives

F1-Score
97.979%

Harmonic mean of precision and recall

Band Gap Estimator for Regression
R² Score
0.9507

Proportion of variance explained by the model

RMSE
0.2695

Root Mean Square Error - standard deviation of residuals

MSE
0.0726

Mean Square Error - average squared difference

Model Performance Summary
Classification Model

The Band Gap Estimator for Classification demonstrates excellent performance across all metrics with accuracy at 98.219%. The high precision (98.251%) and recall (97.709%) values indicate a well-balanced model with strong predictive capabilities.

Regression Model

The Band Gap Estimator for Regression shows strong performance with an R² value of 0.9507, indicating that the model explains approximately 95% of the variance in the target variable. The low RMSE (0.2695) and MSE (0.0726) values confirm the model's accuracy in predictions.

Both models are performing exceptionally well.

ML models quality metrics on testing data (10% of all structures) from the dataset.

Band Gap Estimator for Classification
Accuracy
88.579%

Proportion of correctly predicted instances

Precision
86.569%

Ratio of true positives to all predicted positives

Recall
83.794%

Ratio of true positives to all actual positives

F1-Score
85.681%

Harmonic mean of precision and recall

Band Gap Estimator for Regression
R² Score
0.9026

Proportion of variance explained by the model

RMSE
0.37947

Root Mean Square Error - standard deviation of residuals

MSE
0.14724

Mean Square Error - average squared difference

Model Performance Summary
Classification Model

The Band Gap Estimator for Classification demonstrates great performance across all metrics with accuracy at 88.579%. The high precision (86.569%) and recall (83.794%) values indicate a well-balanced model with high predictive capabilities.

Regression Model

The Band Gap Estimator for Regression shows strong performance with an R² value of 0.9026, indicating that the model explains approximately 90% of the variance in the target variable. The low RMSE (0.37947) and MSE (0.014724) values confirm the model's accuracy in predictions.

Both models are performing well.