• Home
  • New initiatives
  • Partners
  • Tools
  • About us
  • Blog
  • Opportunities
  • Contact

Computer Vision Resources: Fish Classification Datasets

  • Posted by Ellen Ditria
  • On June 17, 2020

by Sebastian Lopez

Computer vision is transforming the collection and processing of digital imagery for ecology and conservation. In aquatic environments, computer vision tools for automatic fish identification are heavily sought after, but robust and open-access fish datasets are hard to find. Here, I share some of the most used, open-access and updated fish datasets for automatic fish detection and classification.

 

Computer vision

Computer vision is the scientific field that develops and trains computers to understand and interpret objects from digital images or videos. Computer vision can automatically detect, count and even track objects from digital photos.

In ecology and conservation, computer vision is transforming information processing by quickly and accurately analysing the vast amount of digital imagery collected by researchers through previous decades.

 

Computer vision in aquatic ecosystems

In aquatic environments, one of the most sought-after computer vision tools is a platform that can automatically identify and detect fish species from underwater footage. Since cameras are becoming more common to monitor and study fish populations, researchers are developing tools that can reduce the tedious and time-consuming task of manually analysing the footage.

An automatic tool that streamlines the process and produces results more quickly than humans would be an important achievement for the monitoring of aquatic ecosystems.

 

Computer vision tools require data

Developing automatic tools for fish detection require LARGE amounts of data. Many aquatic researchers own enormous fish datasets; however, most of these datasets have

  1. restricted availability
  2. fish have not been manually annotated and labelled

 

What makes an excellent fish computer vision dataset?

A great dataset for a fish computer vision tool requires

  1. sufficient images of all the fish to be identified
  2. the user already knows the IDs of all the fish
  3. the fish have been outlined (mask or bounding box) along with its label
  4. the dataset has a variety of environmental conditions (i.e. high and low visibility footage)
An example image of masks outlining the fish. Image provided by the Moreton Bay Environmental Education Centre

While it is difficult to find datasets with these needs, here are some publically available datasets of fish imagery for computer vision tasks. These datasets have been used in several peer-reviewed computer vision/fish classification studies and can help the development of a fish computer vision tool.

 

Datasets

NOAA Fisheries datasets

  • Extensive datasets from 4 NOAA programs. Include still or video imagery of benthic fish and invertebrates across different locations, depths and backgrounds.

FishDataset

  • The dataset includes 3,960 images collected from 468 species across different backgrounds and illuminations.

OzFish

  • Extensive collection of ~80k fish crops and ~45k bounding box annotations for a wide variety of fish species.

Fish4Knowledge (Fish Detection)

  • A large and well-known ground-truth dataset with 1700 minutes of fish footage.

Fish4Knowledge (Fish Species Recognition)

  • Dataset used in LifeClef 2015 competition.
  • 20 manually annotated videos, 15 fish species to support the learning of fish recognition models.

DIDSON data for fish classification

  • Dual-Frequency Identification Sonar (DIDSON), fishery acoustic observation data of 8 fish species from the USA.

Fish-Pak

  • Fish-Pak: an image dataset of 6 different fish species, captured by a single camera from different pools located nearby the Head Qadirabad, Chenab River in Punjab, Pakistan.

Leave a comment if you know of any other computer vision fish datasets.

Follow the progress of #FishID; an automatic platform for fish species identification and abundance quantification through this blog or @seabassphd.

0 Comments

Recent Posts
  • Smarter monitoring for healthier oceans: How the GLOW team uses FishID
  • Mapping shellfish reefs in southeast Queensland for protection, management and restoration
  • New Paper: Integrating socioeconomic and ecological data into restoration practice
  • Fish AI Consortium Presentation: Rapid improvements in fisheries monitoring with underwater computer vision
  • Co-occurrence of ecosystem services to inform global mangrove conservation planning
Archives
  • May 2025
  • January 2025
  • December 2024
  • April 2024
  • February 2024
  • January 2024
  • December 2023
  • August 2023
  • November 2022
  • March 2022
  • February 2022
  • November 2021
  • August 2021
  • July 2021
  • June 2021
  • May 2021
  • April 2021
  • March 2021
  • January 2021
  • December 2020
  • October 2020
  • September 2020
  • July 2020
  • June 2020
  • May 2020
  • April 2020
  • January 2020
  • November 2019
  • October 2019
  • September 2019
  • July 2019
  • June 2019
  • May 2019
  • April 2019
  • March 2019
  • February 2019
  • January 2019
  • December 2018
Categories
  • Blue Carbon
  • Communication
  • Conference
  • Conservation
  • FishID projects
  • Jobs
  • News
  • Opinion
  • Research
  • Uncategorized

Building a sustainable future for seagrass ecosystems

Previous thumb

Data Science: The new language of ecologists

Next thumb
Scroll
PhD APPLICATIONS OPEN

Range of projects available with up to $15,000 funds for field work and collaborative travel.

Find out more

PARTNERING FOR CHANGE

GLOW is proud to be an active member of the Global Mangrove Alliance.

Check out the GMA website

@2018 Griffith University, CRICOS Provider - 00233E. Images: Tom Rayner, Anusha Rajkaran and via Creative Commons.
  • Home
  • New initiatives
  • Partners
  • Tools
  • About us
  • Blog
  • Opportunities
  • Contact