Close Menu
  • News
  • Industry
  • Solar Panels
  • Commercial
  • Residential
  • Finance
  • Technology
  • Carbon Credit
  • More
    • Policy
    • Energy Storage
    • Utility
    • Cummunity
What's Hot

A deep learning model tracks the status of the EV battery with high precision

March 6, 2026

Mitsubishi Electric Trane announces new heat pump line for hydronic heating – SPE

March 6, 2026

Origis is developing a 413 MW solar portfolio in West Texas

March 6, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Solar Energy News
Friday, March 6
  • News
  • Industry
  • Solar Panels
  • Commercial
  • Residential
  • Finance
  • Technology
  • Carbon Credit
  • More
    • Policy
    • Energy Storage
    • Utility
    • Cummunity
Solar Energy News
Home - Solar Industry - How to address imbalance datasets in solar panel dust detection
Solar Industry

How to address imbalance datasets in solar panel dust detection

solarenergyBy solarenergyMarch 5, 2026No Comments4 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email

Researchers from South Korea have improved dust detection from solar panels using SMOTE and stable diffusion (SD) augmentation, where SD increases detection accuracy from 76.5% to 98.9% while maintaining spatial realism.

March 4, 2026
Lior Kahana

A research group from South Korea Jeju National University addressed the imbalance of the datasets in photovoltaic modules by applying two complementary augmentation strategies. Specifically, they applied a synthetic minority oversampling technique (SMOTE) at the feature level and stable diffusion (SD) augmentation at the image level.

A dataset imbalance occurs when one class in a dataset contains many more examples than the other, causing models to favor the majority class and potentially miss rarer but important cases.

“Our study shows that diffusion-based augmentation significantly improves the detection of dusty panels in minority classes while maintaining robustness at the implementation level,” said corresponding author Raj Kumar. pv magazine. “Unlike previous works that relied solely on traditional oversampling or GAN-based augmentation, our study systematically compares strategies for reducing imbalances at the feature and image levels,” said Kumar. “We also introduced a two-stage synthetic image validation protocol, including manual screening plus FID, KID and perceptual hashing metrics, in addition to stratified 10-fold cross-validation and imbalance-sensitive metrics such as F1 score, Cohen’s κ and MCC.”

SMOTE is a commonly used method to address class imbalance in datasets. It works by randomly selecting samples from the minority class and generating new synthetic samples through combinations with their nearest neighbors. However, this process can disrupt spatial relationships, which are critical for image-based tasks such as dust detection on solar panels.

See also  The US utility solar market has grown 34% since the second quarter of 2023

To address this limitation, the team investigated whether stable diffusion (SD) could be more effective. This artificial intelligence (AI) model starts with random noise, which is gradually transformed into an image guided by text prompts. The diffusion model iteratively removes noise until a realistic image of a dusty panel emerges. Generated images are then filtered to remove artifacts and duplicates, and the accepted synthetic images are included in the training dataset.

Stable diffusion

Image: Jeju National University, Energy Reports, CC BY 4.0

Both approaches – SMOTE and SD – were evaluated against each other, as well as against the original imbalanced data set. The team used publicly available materialSolNET dataset, containing polycrystalline silicon modules, labeled as clean or dusty. The dataset includes 842 images, originally unbalanced with 502 clean modules and 340 dusty modules. All images have been resized to 224 x 224 pixels.

Three convolutional neural network (CNN) models – VGG-16, ResNet50 and MobileNetV3 – were tested on both the extended balanced datasets and the unbalanced dataset. In all experiments, the data was split into 80% training and 20% testing, and the models were evaluated using matrices of accuracy, precision, recall, F1 score, and confusion.

“Several findings were particularly noteworthy,” Kumar said. “For example, ResNet50 improved from 76.53% accuracy on the unbalanced dataset to 98.87% using SD augmentation. Minority class detection was nearly perfect, with scores reaching 99%, and both Cohen’s κ and MCC exceeded 0.90.”

Kumar also highlighted that while SMOTE improved model performance, SD consistently produced superior results by maintaining spatial realism. “After training on the balanced datasets, the models maintained up to 98% accuracy even when tested on the original unbalanced dataset,” he noted. “These findings confirm that realistic image-level augmentation is critical for improving the performance of PV dust detection.”

See also  Water -based electrically conductive adhesives for ranged solar modules

Looking ahead, the team is working to further improve their imbalance mitigation framework by integrating undersampling with oversampling techniques. “While this study focused on SD and SMOTE, our updated research introduces Tomek-Link undersampling to remove borderline samples of majority classes and reduce class overlap,” Kumar explains. “This hybrid approach improves decision boundary clarity, minimizes noise, and improves minority class detection more effectively than oversampling alone.”

The two methodologies were presented in “Mitigating imbalance in datasets using image-based stable diffusion and feature-level SMOTE for solar panel classification with CNNs”, published in Energy reports.

This content is copyrighted and may not be reused. If you would like to collaborate with us and reuse some of our content, please contact: editors@pv-magazine.com.

Popular content

Source link

address datasets detection dust imbalance panel solar
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
solarenergy
  • Website

Related Posts

Origis is developing a 413 MW solar portfolio in West Texas

March 6, 2026

New Jersey expands state community solar program by 3 GW

March 6, 2026

Oleic acid anti-pollution coating for solar panels – SPE

March 5, 2026
Leave A Reply Cancel Reply

Don't Miss
Cummunity

Encore Renewable Energy is breaking ground on its first solar project in New York

By solarenergySeptember 5, 20240

Encore Renewable Energy has announced that it has begun construction on Constable Solar, the developer’s…

Agrivoltaic installations postpone harvest, improve wine quality – SPE

September 18, 2024

RWE commissions a 100 MW hydrogen electrolyzer – SPE

January 9, 2026

‘True leadership comes from listening’ – SPE

January 17, 2026
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Our Picks

A deep learning model tracks the status of the EV battery with high precision

March 6, 2026

Mitsubishi Electric Trane announces new heat pump line for hydronic heating – SPE

March 6, 2026

Origis is developing a 413 MW solar portfolio in West Texas

March 6, 2026

New Jersey expands state community solar program by 3 GW

March 6, 2026
Our Picks

A deep learning model tracks the status of the EV battery with high precision

March 6, 2026

Mitsubishi Electric Trane announces new heat pump line for hydronic heating – SPE

March 6, 2026

Origis is developing a 413 MW solar portfolio in West Texas

March 6, 2026
About
About

Stay updated with the latest in solar energy. Discover innovations, trends, policies, and market insights driving the future of sustainable power worldwide.

Subscribe to Updates

Get the latest creative news and updates about Solar industry directly in your inbox!

Facebook X (Twitter) Instagram Pinterest
  • Contact
  • Privacy Policy
  • Terms & Conditions
© 2026 Tsolarenergynews.co - All rights reserved.

Type above and press Enter to search. Press Esc to cancel.