Close Menu
  • News
  • Industry
  • Solar Panels
  • Commercial
  • Residential
  • Finance
  • Technology
  • Carbon Credit
  • More
    • Policy
    • Energy Storage
    • Utility
    • Cummunity
What's Hot

Lithuanian grid operators can now shut down solar power plants without cybersecurity measures

June 8, 2026

Dutch solar owners asked to switch off during peak periods to ease the distribution crisis

June 7, 2026

The hydrogen flow: Toyota demonstrates its racing prototype on liquid hydrogen

June 7, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Solar Energy News
Monday, June 8
  • News
  • Industry
  • Solar Panels
  • Commercial
  • Residential
  • Finance
  • Technology
  • Carbon Credit
  • More
    • Policy
    • Energy Storage
    • Utility
    • Cummunity
Solar Energy News
Home - Solar Industry - How to address imbalance datasets in solar panel dust detection
Solar Industry

How to address imbalance datasets in solar panel dust detection

solarenergyBy solarenergyMarch 5, 2026No Comments4 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email

Researchers from South Korea have improved dust detection from solar panels using SMOTE and stable diffusion (SD) augmentation, where SD increases detection accuracy from 76.5% to 98.9% while maintaining spatial realism.

March 4, 2026
Lior Kahana

A research group from South Korea Jeju National University addressed the imbalance of the datasets in photovoltaic modules by applying two complementary augmentation strategies. Specifically, they applied a synthetic minority oversampling technique (SMOTE) at the feature level and stable diffusion (SD) augmentation at the image level.

A dataset imbalance occurs when one class in a dataset contains many more examples than the other, causing models to favor the majority class and potentially miss rarer but important cases.

“Our study shows that diffusion-based augmentation significantly improves the detection of dusty panels in minority classes while maintaining robustness at the implementation level,” said corresponding author Raj Kumar. pv magazine. “Unlike previous works that relied solely on traditional oversampling or GAN-based augmentation, our study systematically compares strategies for reducing imbalances at the feature and image levels,” said Kumar. “We also introduced a two-stage synthetic image validation protocol, including manual screening plus FID, KID and perceptual hashing metrics, in addition to stratified 10-fold cross-validation and imbalance-sensitive metrics such as F1 score, Cohen’s κ and MCC.”

SMOTE is a commonly used method to address class imbalance in datasets. It works by randomly selecting samples from the minority class and generating new synthetic samples through combinations with their nearest neighbors. However, this process can disrupt spatial relationships, which are critical for image-based tasks such as dust detection on solar panels.

See also  Smart Solar Initiative to help agricultural communities

To address this limitation, the team investigated whether stable diffusion (SD) could be more effective. This artificial intelligence (AI) model starts with random noise, which is gradually transformed into an image guided by text prompts. The diffusion model iteratively removes noise until a realistic image of a dusty panel emerges. Generated images are then filtered to remove artifacts and duplicates, and the accepted synthetic images are included in the training dataset.

Stable diffusion

Image: Jeju National University, Energy Reports, CC BY 4.0

Both approaches – SMOTE and SD – were evaluated against each other, as well as against the original imbalanced data set. The team used publicly available materialSolNET dataset, containing polycrystalline silicon modules, labeled as clean or dusty. The dataset includes 842 images, originally unbalanced with 502 clean modules and 340 dusty modules. All images have been resized to 224 x 224 pixels.

Three convolutional neural network (CNN) models – VGG-16, ResNet50 and MobileNetV3 – were tested on both the extended balanced datasets and the unbalanced dataset. In all experiments, the data was split into 80% training and 20% testing, and the models were evaluated using matrices of accuracy, precision, recall, F1 score, and confusion.

“Several findings were particularly noteworthy,” Kumar said. “For example, ResNet50 improved from 76.53% accuracy on the unbalanced dataset to 98.87% using SD augmentation. Minority class detection was nearly perfect, with scores reaching 99%, and both Cohen’s κ and MCC exceeded 0.90.”

Kumar also highlighted that while SMOTE improved model performance, SD consistently produced superior results by maintaining spatial realism. “After training on the balanced datasets, the models maintained up to 98% accuracy even when tested on the original unbalanced dataset,” he noted. “These findings confirm that realistic image-level augmentation is critical for improving the performance of PV dust detection.”

See also  The Bannatyne group, Shawton Energy Partner for Solar on the roof

Looking ahead, the team is working to further improve their imbalance mitigation framework by integrating undersampling with oversampling techniques. “While this study focused on SD and SMOTE, our updated research introduces Tomek-Link undersampling to remove borderline samples of majority classes and reduce class overlap,” Kumar explains. “This hybrid approach improves decision boundary clarity, minimizes noise, and improves minority class detection more effectively than oversampling alone.”

The two methodologies were presented in “Mitigating imbalance in datasets using image-based stable diffusion and feature-level SMOTE for solar panel classification with CNNs”, published in Energy reports.

This content is copyrighted and may not be reused. If you would like to collaborate with us and reuse some of our content, please contact: editors@pv-magazine.com.

Popular content

Source link

address datasets detection dust imbalance panel solar
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
solarenergy
  • Website

Related Posts

Lithuanian grid operators can now shut down solar power plants without cybersecurity measures

June 8, 2026

Dutch solar owners asked to switch off during peak periods to ease the distribution crisis

June 7, 2026

Letter from China’s PV Industry: Arctech wins 2.1 GW solar deal

June 5, 2026
Leave A Reply Cancel Reply

Don't Miss
News

NESO rejects solar energy in the Clean Power 2030 advice

By solarenergyNovember 8, 20240

The report suggests several necessary system, market and infrastructure changes in line with two primary…

PV module production in Texas will exceed 15 GW by 2026

May 9, 2026

Britain to increase industrial energy price exemption to 90%

October 31, 2025

The future of AI with solar-powered synaptic devices

November 29, 2024
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Our Picks

Lithuanian grid operators can now shut down solar power plants without cybersecurity measures

June 8, 2026

Dutch solar owners asked to switch off during peak periods to ease the distribution crisis

June 7, 2026

The hydrogen flow: Toyota demonstrates its racing prototype on liquid hydrogen

June 7, 2026

Era of electrification exposing Australia’s weakest link

June 6, 2026
Our Picks

Lithuanian grid operators can now shut down solar power plants without cybersecurity measures

June 8, 2026

Dutch solar owners asked to switch off during peak periods to ease the distribution crisis

June 7, 2026

The hydrogen flow: Toyota demonstrates its racing prototype on liquid hydrogen

June 7, 2026
About
About

Stay updated with the latest in solar energy. Discover innovations, trends, policies, and market insights driving the future of sustainable power worldwide.

Subscribe to Updates

Get the latest creative news and updates about Solar industry directly in your inbox!

Facebook X (Twitter) Instagram Pinterest
  • Contact
  • Privacy Policy
  • Terms & Conditions
© 2026 Tsolarenergynews.co - All rights reserved.

Type above and press Enter to search. Press Esc to cancel.