├── LICENSE.md
└── README.md
/LICENSE.md:
--------------------------------------------------------------------------------
1 |
ABCD Dataset Terms of Use
2 | June 30, 2017
3 |
4 | The users of the ABCD dataset must abide by the following Terms of Use:
5 |
6 | 1. The use of the dataset is limited only to noncommercial purpose.
7 |
8 | 1. When publishing results that use this dataset, the users shall comply with the following matters:
9 | * The users must cite the URL of the ABCD dataset (i.e., https://github.com/gistairc/ABCDdataset/) in their publication.
10 | * The original aerial images are copyrighted by PASCO corporation. Thus, the users must include the text "The copyright of aerial images is retained by PASCO corporation." in their publication to show attribution to the image provider.
11 |
12 | 1. The users of the dataset accept full responsibility for the use of the dataet,
13 | including but not limited to the use of any copies of copyrighted images that they may create from the dataset.
14 |
15 | 1. By downloading the dataset, the users are regarded as having agreed to the Terms of Use.
16 |
17 | ABCD dataset is based on results obtained from a project commissioned
18 | by the New Energy and Industrial Technology Development Organization (NEDO).
19 |
20 | National Institute of Advanced Industrial Science and Technology
21 | Artificial Intelligence Research Center, Japan
22 |
--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
1 | # ABCD dataset
2 |
3 | ABCD (AIST Building Change Detection) dataset is a new labeled dataset, specially geared toward constructing and evaluating damage detection systems to identify whether buildings have been washed-away by tsunami.
4 |
5 | The paper:
6 | **Aito Fujita, Ken Sakurada, Tomoyuki Imaizumi, Riho Ito, Shuhei Hikosaka and Ryosuke Nakamura, "Damage Detection from Aerial Images
7 | via Convolutional Neural Networks," IAPR International Conference on Machine Vision Applications (MVA), 2017.** ([pdf](http://www.airc.aist.go.jp/gsrt/fujita_final.pdf))
8 |
9 | ## Synopsis
10 | Each datum in this dataset is a pair of pre- and post-tsunami aerial image patches, and encompasses a target building at the center of the patch.
11 | The below shows eight samples from the dataset, where four pairs are shown for "washed-away" buildings (left column) and "surviving" buildings (right column), respectively. The class label assigned to each patch pair (i.e. "washed-away" or "surviving") represents whether or not a building at the center of the pre-tsunami patch got wahshed-away by tsunami.
12 |
13 |
14 |
15 |
16 |
17 | These pairs were cropped from a hefty number of RGB aerial images of Tohoku region of Japan. These aerial images were taken before or after the Great East Japan earthquake, with the original pixel resolution of 40 cm for pre-quake images and 12 cm for post-qukae images (actually, resampled to 40 cm).
18 |
19 | We prepared the patch pairs for two types of size: **fixed-scale** and **resized**. Fixed-scale patches were cropped from aerial images with the fixed size of 160 x 160 pixels; so they have the same resolution of the original images (40 cm). In contrast, resized patches were cropped depending on the size of each target building (specifically, three times larger than the target building), and then all resized to 128 x 128 pixels; so the spatial scale of the patches varies from building to building.
20 | The resulting ABCD dataset comprised 8,506 pairs for fixed-scale (4,253 washed-away) and 8,444 pairs for resized (4,223 washed-away).
21 |
22 | As source of class labels, we employed the existing, post-quake survey result (http://fukkou.csis.u-tokyo.ac.jp/). This survey result is the outcome of an exhaustive
23 | field investigation which was carried out under the initiative of MLIT (Ministry of Land, Infrastructure, Transport and Tourism) in the wake of the Great East Japan earthquake on March 11, 2011. As a consequence of this survey, over 220,000 buildings in the ravaged areas were assessed, and each building was assigned a label according to the degree of damage.
24 |
25 |
26 |
27 | ## Download
28 | IMPORTANT -- Please read the [Terms of Use](https://github.com/gistairc/ABCDdataset/blob/master/LICENSE.md) before downloading the ABCD dataset.
29 |
30 | The dataset can be downloaded from [here](https://data.airc.aist.go.jp/ABCDdataset/ABCDdataset.zip) (2.1GB).
31 | Or type the following in the terminal:
32 | ```
33 | $ wget https://data.airc.aist.go.jp/ABCDdataset/ABCDdataset.zip
34 | $ unzip ABCDdataset.zip
35 | ```
36 |
37 |
38 | Schematic of the directory configuration in the unzipped file is as follows:
39 | ```
40 | ./ABCDdataset/fixed-scale/
41 | |
42 | |- patch-pairs/
43 | | |
44 | | |- patch-pair_1.tif
45 | | |- patch-pair_2.tif
46 | | :
47 | | |_ patch-pair_8506.tif
48 | |
49 | |_ 5fold-list/
50 | |
51 | |- cv1-train.csv
52 | |- cv1-test.csv
53 | :
54 | |_ cv5-test.csv
55 |
56 | ./ABCDdataset/resized/
57 | |
58 | |- patch-pairs/
59 | | |
60 | | |- patch-pair_1.tif
61 | | |- patch-pair_2.tif
62 | | :
63 | | |_ patch-pair_8444.tif
64 | |
65 | |_ 5fold-list/
66 | |
67 | |- cv1-train.csv
68 | |- cv1-test.csv
69 | :
70 | |_ cv5-test.csv
71 |
72 | ```
73 | The root directory contains two directories, `fixed-scale/` and `resized/`, each corresponding to fixed-scale and resized patch pairs as mentioned above. Each of the directories has two subdirectories, `patch-pairs/` and `5fold-list/`. In `patch-pairs/`, "washed-away" and "surviving" patch pairs are stored in `.tif` format. Each `.tif` file comprises 6 channels, the first three channels for a pre-tsunami RGB patch and the last three channels for a post-tsunami patch. Also, for traceability of our experiment, in `5fold-list/` we prepared csv files that specify file names we actually used for our 5-fold cross validation. For example, `cv1-train.csv` and `cv1-test.csv` are training and test set for one of 5 folds, and so on. These csv files take the following format:
74 |
75 | ```
76 | patch-pair_14.tif,1
77 | patch-pair_600.tif,1
78 | :
79 | patch-pair_34.tif,0
80 | ```
81 | where each record corresponds to each tiff file, the first field is file name and the second field represents class label of the file in the first field ("1" for "washed-away" and "0" for "surviving").
82 |
83 |
84 | ---
85 |
86 | ### Contact
87 | If you have any questions, please contact the following:
88 | Aito Fujita
89 | National Institute of Advanced Industrial Science and Technology (AIST), Japan
90 | Email: fujita.713[at]aist.go.jp
91 |
92 | ### Acknowledgement
93 | This dataset is based on results obtained from a project commissioned by the New Energy and Industrial Technology Development Organization (NEDO).
94 | We also appreciate MLIT (Ministry of Land, Infrastructure, Transport and Tourism) and CSiS (Center for Spatial Information Science, The University of Tokyo) for compiling [the archive of the Great East Japan Earthquake Survey](http://fukkou.csis.u-tokyo.ac.jp/), which we employed as source for groundtruths.
95 |
--------------------------------------------------------------------------------