Hsuan-Tien Lin

Home | MOOCs | AIsk | Courses | Research Group | Awards | Publications | Presentations | Programs/Data


Dataset of Tropical Cyclone for Image-to-intensity Regression (TCIR)

Boyo Chen, BuoFu Chen and Hsuan-Tien Lin

TCIR collects Tropical Cyclone (TC) data from 4 channels of satellite images. TCIR aims to act as a benchmark dataset to help data scientists fairly evaluate the performance of TC intensity prediction models.

illustrate

Data Statistics

Region #TCs #Frames
Atlantic 235 13707
West Pacific 379 20061
East Pacific 247 13615
Central Pacific 19 1479
Indian Ocean 75 3205
Southern Hemisphere 330 18434
Total 1285 70501

Frames

Sources

Satellite observations comprising TCIR are collected from two open sources:

Labels

We used the best-tracks from Joint Typhoon Warning Center (JTWC) for TCs in western North Pacific (WP); the best-tracks from the revised Atlantic hurricane database (HURDAT2) for TCs in eastern North Pacific (EP) and Atlantic Ocean (AL) from 2003 to 2016.

The TC information provided in TCIR includes:

Note that these values are tuned and finalized afterward based on all observation that is available. Thus, they are very different from the real-time estimations in meaning. While the best-track information can be taken as ground truth, they are still some "estimation" in nature and can suffer from some inherent noise.

In addition to the intensity, we also provided another remarkable TC structure parameter, the size. TC size is closely related to the impacts on the economy/society from a TC. We encourage the community to tackle the TC size prediction task

Usage

We provide an HDF5 format file for people to easily access the whole organized dataset.

Example: Loading TCIR dataset with python.

import numpy as np
import pandas as pd
import h5py

data_path = "TCIR.h5"

# load "info" as pandas dataframe
data_info = pd.read_hdf(data_path, key="info", mode='r')

# load "matrix" as numpy ndarray, this could take longer times
with h5py.File(data_path, 'r') as hf:
    data_matrix = hf['matrix'][:]

How to cite TCIR?

Please cite the following paper:

Boyo Chen, Buo-Fu Chen, and Hsuan-Tien Lin. Rotation-blended CNNs on a new open dataset for tropical cyclone image-to-intensity regression. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), August 2018.


Last updated at CST 13:08, October 04, 2023
Please feel free to contact me: htlin.email.png
Valid HTML 4.0!