Abstract
Plankton plays a vital role in sustaining life on Earth as it forms the foundation of the aquatic food chain. Additionally, it is responsible for producing approximately half of the oxygen in the Earth's atmosphere. Understanding the distribution of plankton is crucial in studying climate change and monitoring water quality. This study utilized a dataset comprising 30,336 plankton images belonging to 121 different classes. State-of-the-art deep learning methods were employed to automatically label the plankton species, while alternative training approaches, data augmentation, and preprocessing techniques were compared to assess their impact. The study analyzed the similarities between and within plankton classes, as well as the classes that were frequently misidentified. By training models on networks from two different versions of the Kaggle dataset, the highest success rate achieved was 78% on the larger dataset with 118 classes, and 92% on the smaller dataset with 38 classes using individual models. These results represent a 1% improvement over the previously reported best-performing single model classifiers. Furthermore, by employing an averaging ensemble method, the performance was further enhanced by 2% in the first dataset and 1% in the second dataset. These findings demonstrate the efficacy of the proposed approach in accurately identifying plankton species and highlight the potential for improving classification results through ensemble techniques. Also, we address the unbalance and the varying input sizes within the Kaggle plankton dataset. We propose alternative data balancing methods and investigate three pre-processing techniques. Then we compare several state-of-the-art deep learning algorithms and show the improvement in classification rates with ensemble methods. Our results have significantly improved the best results reported in the literature over the single best models.
| Original language | English |
|---|---|
| Title of host publication | SPA 2023 - Signal Processing |
| Subtitle of host publication | Algorithms, Architectures, Arrangements, and Applications, Conference Proceedings |
| Publisher | IEEE Computer Society |
| Pages | 118-123 |
| Number of pages | 6 |
| ISBN (Electronic) | 9798350304985 |
| DOIs | |
| Publication status | Published - 2023 |
| Event | 26th IEEE Signal Processing: Algorithms, Architectures, Arrangements, and Applications, SPA 2023 - Poznan, Poland Duration: 20 Sept 2023 → 22 Sept 2023 |
Publication series
| Name | Signal Processing - Algorithms, Architectures, Arrangements, and Applications Conference Proceedings, SPA |
|---|---|
| Volume | 2023-September |
| ISSN (Print) | 2326-0262 |
| ISSN (Electronic) | 2326-0319 |
Conference
| Conference | 26th IEEE Signal Processing: Algorithms, Architectures, Arrangements, and Applications, SPA 2023 |
|---|---|
| Country/Territory | Poland |
| City | Poznan |
| Period | 20/09/23 → 22/09/23 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 13 Climate Action
Keywords
- CNN
- Deep Learning
- Plankton Classification
Fingerprint
Dive into the research topics of 'Plankton Classification with Deep Learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver