├── .gitignore
├── README.md
├── domain_adaptation
    ├── README.md
    ├── source
    │   ├── core
    │   │   ├── bdds.py
    │   │   ├── cityscapes.py
    │   │   ├── csvs
    │   │   │   ├── stratified_GTA.csv
    │   │   │   ├── stratified_bdds.csv
    │   │   │   └── stratified_mapillary.csv
    │   │   ├── gta.py
    │   │   └── mapillary.py
    │   └── prep_all.sh
    └── target
    │   ├── semi-supervised
    │       └── selected_samples.csv
    │   └── weakly-supervised
    │       ├── train1.zip
    │       ├── train2.zip
    │       └── val.zip
├── evaluation
    ├── csHelpers.py
    ├── evaluate_detection.py
    ├── evaluate_instance_segmentation.py
    ├── evaluate_mIoU.py
    ├── helper_eval_detection.py
    ├── idd_lite_evaluate_mIoU.py
    ├── instance.py
    └── instances2dict.py
├── helpers
    ├── __pycache__
    │   ├── annotation.cpython-35.pyc
    │   ├── annotation.cpython-37.pyc
    │   ├── anue_labels.cpython-35.pyc
    │   └── anue_labels.cpython-37.pyc
    ├── annotation.py
    └── anue_labels.py
├── preperation
    ├── __pycache__
    │   ├── json2instanceImg.cpython-37.pyc
    │   └── json2labelImg.cpython-37.pyc
    ├── cityscape_panoptic_gt.py
    ├── createLabels.py
    ├── json2instanceImg.py
    └── json2labelImg.py
├── requirements.txt
└── viewer
    ├── viewer.py
    └── viewer2.py


/.gitignore:
--------------------------------------------------------------------------------
1 | *.pyc
2 | __pycache__


--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
  1 | # Scene Understanding Challenge for Autonomous Navigation in Unstructured Environments
  2 | 
  3 | Code for working with the dataset used for the [Scene Understanding Challenge for Autonomous Navigation in Unstructured Environments](http://cvit.iiit.ac.in/scene-understanding-challenge-2018/). For details of getting the dataset and updates see:
  4 | - https://cvit.iiit.ac.in/autonue2021/
  5 | - https://cvit.iiit.ac.in/autonue2019/
  6 | - http://cvit.iiit.ac.in/autonue2018/
  7 | - http://cvit.iiit.ac.in/scene-understanding-challenge-2018/ 
  8 | 
  9 | 
 10 | # AutoNUE 2021 (Domain Adaptation and Semantic Segmentation)
 11 | 
 12 | This repository contains the datasets related to domain adaptation and segmentation challenges for AutoNEU 2021, CVPR Workshop. For more details, please visit https://cvit.iiit.ac.in/autonue2021/challenge. For the segmentation challenge, please skip "Source datasets" section below.
 13 | 
 14 | 
 15 | ## Source datasets:
 16 | 
 17 | Participants are requested to download the datasets from original websites, given below for easy reference:-
 18 | 1. https://www.mapillary.com/dataset/vistas?pKey=q0GhQpk20wJm1ba1mfwJmw
 19 | 2. https://bdd-data.berkeley.edu/ (you might have to click on Advanced tab, and then click on "proceed to bdd-data.berkeley.edu")
 20 | 3. https://download.visinf.tu-darmstadt.de/data/from_games/
 21 | 4. https://www.cityscapes-dataset.com/examples/#fine-annotations (only fine-annotations to be used)
 22 | 
 23 | After downloading all the source datasets, move them to folder ./domain_adaptation/source/datasets/. Its folder structure should be as follows:
 24 | ```
 25 | datasets
 26 |   |--mapillary-vistas-dataset_public_v1.1/
 27 |   |  |--training/
 28 |   |  |  |--images/
 29 |   |  |  |--labels/
 30 |   |  |--validation/
 31 |   |  |  |--images/
 32 |   |  |  |--labels/
 33 |   |  |--testing/
 34 |   |     |--images/
 35 |   |--bdd100k/
 36 |   |  |--seg/
 37 |   |     |--images/
 38 |   |     |  |--train/
 39 |   |     |  |--val/
 40 |   |     |  |--test/
 41 |   |     |--labels/
 42 |   |        |--train/
 43 |   |        |--val/
 44 |   |--gta/
 45 |   |  |--images/
 46 |   |  |--labels/
 47 |   |--cityscapes/
 48 |      |--gtFine/
 49 |      |  |--train/
 50 |      |  |--val/
 51 |      |  |--test/
 52 |      |--leftImg8bit/
 53 |         |--train/
 54 |         |--val/
 55 |         |--test/
 56 | ```
 57 | 
 58 | 
 59 | Run the following commands:
 60 | 
 61 | ```
 62 | pip3 install requirements.txt
 63 | chmod +x domain_adaptation/source/prep_all.sh
 64 | ./domain_adaptation/source/prep_all.sh
 65 | ```
 66 | 
 67 | This will create a folder "domain_adaptation/source/source_datasets_dir/" where you will find the images and annotations for the source dataset to be used for any of the domain adaptation challenges.
 68 | 
 69 | ## Target datasets:
 70 | 
 71 | **For using first add helpers/ to $PYTHONPATH**
 72 | ```
 73 | export PYTHONPATH="${PYTHONPATH}:helpers/"
 74 | ```
 75 | 
 76 | ### Dataset Structure 
 77 | 
 78 | The structure is similar to the cityscapes dataset. That is:
 79 | ```
 80 | gtFine/{split}/{drive_no}/{img_id}_gtFine_polygons.json for ground truths
 81 | leftImg8bit/{split}/{drive_no}/{img_id}_leftImg8bit.png for image frames
 82 | ```
 83 | #### Semantic Segmentation
 84 | 
 85 | Furthermore for training, label masks needs to be generated as described below resulting in the following files:
 86 | ```
 87 | gtFine/{split}/{drive_no}/{img_id}_gtFine_labellevel3Ids.png
 88 | gtFine/{split}/{drive_no}/{img_id}_gtFine_instancelevel3Ids.png
 89 | ```
 90 | ### Labels
 91 | 
 92 | See helpers/anue_labels.py
 93 | 
 94 | #### Generate Label Masks (for training/evaluation) (Semantic/Instance/Panoptic Segmentation)
 95 | ```bash
 96 | python preperation/createLabels.py --datadir $ANUE --id-type $IDTYPE --color [True|False] --instance [True|False] --num-workers $C
 97 | ```
 98 | 
 99 | - ANUE is the path to the AutoNUE dataset
100 | - IDTYPE can be id, csId, csTrainId, level3Id, level2Id, level1Id. 
101 | - color True  generates the color masks
102 | - instance True generates the instance masks with the id given by IDTYPE
103 | - panoptic True generates panoptic masks in the format similar to COCO. See the modified evaluation scripts here: https://github.com/AutoNUE/panopticapi
104 | - C is the number of threads to run in parallel
105 | 
106 | 
107 | For the supervised domain adaptation and semantic segmentation tasks, the masks should be generated using IDTYPE of level3Id and used for training models (similar to trainId in cityscapes). This can be done by the command:
108 | ```bash
109 | python preperation/createLabels.py --datadir $ANUE --id-type level3Id --num-workers $C
110 | ```
111 | 
112 | Following commands are updated for the target labels of other domain adaptation tasks:
113 | 
114 | ```
115 | python3 preperation/createLabels.py --datadir $ANUE --id-type level3Id --num-workers $C --semisup_da True
116 | python3 preperation/createLabels.py --datadir $ANUE --id-type level3Id --num-workers $C --weaksup_da True
117 | python3 preperation/createLabels.py --datadir $ANUE --id-type level3Id --num-workers $C --unsup_da True
118 | 
119 | ```
120 | The bounding box labels for weakly supervised domain adapation can be downloaded from here: https://github.com/AutoNUE/public-code/tree/master/domain_adaptation/target/weakly-supervised
121 | 
122 | 
123 | The generated files:
124 | 
125 | - _gtFine_labelLevel3Ids.png will be used for semantic segmentation
126 | 
127 | 
128 | 
129 | # AutoNUE 2019
130 | 
131 | **For using first add helpers/ to $PYTHONPATH**
132 | ```
133 | export PYTHONPATH="${PYTHONPATH}:helpers/"
134 | ```
135 | 
136 | **The code has been tested on python 3.6.4**
137 | 
138 | ## Dataset Structure 
139 | 
140 | The structure is similar to the cityscapes dataset. That is:
141 | ```
142 | gtFine/{split}/{drive_no}/{img_id}_gtFine_polygons.json for ground truths
143 | leftImg8bit/{split}/{drive_no}/{img_id}_leftImg8bit.png for image frames
144 | ```
145 | ### Semantic Segmentation and Instance Segmentation
146 | 
147 | Furthermore for training, label masks needs to be generated as described below resulting in the following files:
148 | ```
149 | gtFine/{split}/{drive_no}/{img_id}_gtFine_labellevel3Ids.png
150 | gtFine/{split}/{drive_no}/{img_id}_gtFine_instancelevel3Ids.png
151 | ```
152 | 
153 | ### Panoptic Challenge
154 | 
155 | Furthermore for training, panoptic masks needs to be generated as described below resulting in the following files:
156 | ```
157 | gtFine/{split}_panoptic/{drive_no}_{img_id}_gtFine_panopticlevel3Ids.png
158 | gtFine/{split}_panoptic.json
159 | ```
160 | ### Detection
161 | 
162 | The structure is slightly similar to Pascal VOC dataset.
163 | - JPEGImages/<capture_category>/<drive sequence>/<>.jpg for images
164 | - Annotations/<capture_category>/<drive sequence>/<>.xml for Annotations
165 | 
166 | ## Labels
167 | 
168 | See helpers/anue_labels.py
169 | 
170 | ### Generate Label Masks (for training/evaluation) (Semantic/Instance/Panoptic Segmentation)
171 | ```bash
172 | python preperation/createLabels.py --datadir $ANUE --id-type $IDTYPE --color [True|False] --instance [True|False] --num-workers $C
173 | ```
174 | 
175 | - ANUE is the path to the AutoNUE dataset
176 | - IDTYPE can be id, csId, csTrainId, level3Id, level2Id, level1Id. 
177 | - color True  generates the color masks
178 | - instance True generates the instance masks with the id given by IDTYPE
179 | - panoptic True generates panoptic masks in the format similar to COCO. See the modified evaluation scripts here: https://github.com/AutoNUE/panopticapi
180 | - C is the number of threads to run in parallel
181 | 
182 | For the semantic segmentation challenge, masks should be generated using IDTYPE of level3Id and used for training models (similar to trainId in cityscapes). This can be done by the command:
183 | ```bash
184 | python preperation/createLabels.py --datadir $ANUE --id-type level3Id --num-workers $C
185 | ```
186 | For the instance segmentation challenge, instance masks should be generated by the following comand:
187 | ```bash
188 | python preperation/createLabels.py --datadir $ANUE --id-type id --num-workers $C
189 | ```
190 | 
191 | The generated files:
192 | 
193 | - _gtFine_labelLevel3Ids.png will be used for semantic segmentation
194 | - _gtFine_instanceids.png will be used for instance segmentation
195 | - _gtFine_panopticLevel3Ids.png will be used for panoptic segmentation under the folder gtFine/{split}_panoptic and the gtFine/{split}_panoptic.json
196 | 
197 | ### Detection
198 | 
199 | We use subset of labels from helpers/anue_labels.py.
200 | 
201 | We have person(level3Id: 4 , Trainable : True), rider (level3Id: 5, Trainable : True), car (level3Id: 9, Trainable : True), truck (level3Id: 10, Trainable : True),  bus(level3Id: 11, Trainable : True), motorcycle(level3Id: 6, Trainable : True), bicycle(level3Id: 7, Trainable : True), autorickshaw(level3Id: 8, Trainable : True), animal(level3Id: 4 , Trainable : True), traffic light(level3Id: 18, Trainable : True), traffic sign(level3Id: 19, Trainable : True), vehicle fallback (level3Id: 12, Trainable : False), caravan (level3Id: 12, Trainable : False), trailer (level3Id: 12, Trainable : False), train (level3Id: 12, Trainable : False).
202 | 
203 | Note : We train based on level3Id’s and only those labels which are mentioned as trainable and report accuracies on them.
204 | 
205 | 
206 | ## Viewer
207 | 
208 | First generate label masks as described above. To view the ground truths / prediction masks at different levels of heirarchy use:
209 | ```bash
210 | python viewer/viewer.py ---datadir $ANUE
211 | ```
212 | 
213 | - ANUE has the folder path to the dataset or prediction masks with similar file/folder structure as dataset.
214 | 
215 | TODO: Make the color map more sensible.
216 | 
217 | 
218 | ## Evaluation
219 | 
220 | ### Semantic Segmentation
221 | 
222 | First generate labels masks with level3Ids as described before. Then
223 | ```bash
224 | python evaluate/evaluate_mIoU.py --gts $GT  --preds $PRED  --num-workers $C
225 | ```
226 | 
227 | - GT is the folder path of ground truths containing <drive_no>/<img_no>_gtFine_labellevel3Ids.png 
228 | - PRED is the folder paths of predictions with the same folder structure and file names.
229 | - C is the number of threads to run in parallel
230 | 
231 | 
232 | ### Constrained Semantic Segmentation
233 | 
234 | First generate labels masks with level1Ids as described before. Then
235 | ```bash
236 | python evaluate/idd_lite_evaluate_mIoU.py --gts $GT  --preds $PRED  --num-workers $C
237 | ```
238 | 
239 | - GT is the folder path of ground truths containing <drive_no>/<img_no>_gtFine_labellevel1Ids.png 
240 | - PRED is the folder paths of predictions with the same folder structure and file names.
241 | - C is the number of threads to run in parallel
242 | 
243 | 
244 | ### Instance Segmentation
245 | 
246 | 
247 | First generate instance label masks with ID_TYPE=id, as described before. Then
248 | ```bash
249 | python evaluate/evaluate_instance_segmentation.py --gts $GT  --preds $PRED 
250 | ```
251 | 
252 | - GT is the folder path of ground truths containing <drive_no>/<img_no>_gtFine_labellevel3Ids.png 
253 | - PRED is the folder paths of predictions with the same folder structure and file names. The format for predictions is the same as the cityscapes dataset. That is a .txt file where each line is of the form "<instance_mask_png> <label id> <conf score>". Note that the ID_TYPE=id is used by this evaluation code.
254 | - C is the number of threads to run in parallel
255 | 
256 | ### Panoptic Segmentation
257 | 
258 | Please use https://github.com/AutoNUE/panopticapi
259 | 
260 | ### Detection
261 | 
262 | ```bash
263 | python evaluate/evaluate_detection.py --gts $GT  --preds $PRED 
264 | ```
265 | - GT is the folder path of ground truths containing Annotations/<capture_category>/<drive sequence>/<>.xml
266 | - PRED is the folder path of predictions with generated outputs in idd_det_<image_set>_<level3Id>.txt format. Here image_set can take {train,val,test}, while level3Id for all trainable labels has to present.
267 | 
268 | 
269 | 
270 | ## Acknowledgement
271 | 
272 | Some of the code was adapted from the cityscapes code at: https://github.com/mcordts/cityscapesScripts/ 
273 | Some of the code was adapted from https://github.com/rbgirshick/py-faster-rcnn
274 | Some of the code was adapted from https://github.com/cocodataset/panopticapi
275 | 
276 | 


--------------------------------------------------------------------------------
/domain_adaptation/README.md:
--------------------------------------------------------------------------------
 1 | # AutoNUE2021_domain_adaptation
 2 | 
 3 | This repository contains the datasets for Domain Adaptation challenge for AutoNEU 2021, CVPR Workshop. For more details, please visit https://cvit.iiit.ac.in/autonue2021/challenge.html
 4 | 
 5 | 
 6 | # Source datasets:
 7 | For the Cityscapes dataset, participants are requested to use the fine images (~3200 training samples). Refer: https://www.cityscapes-dataset.com/examples/#fine-annotations. For the other datasets (BDD, GTA and Mapillary), the list of image names are given in the csv files in the folder "Source".
 8 | 
 9 | Participants are requested to download the datasets from original websites, given below for easy reference:-
10 | 1. https://www.mapillary.com/dataset/vistas?pKey=q0GhQpk20wJm1ba1mfwJmw
11 | 2. https://bdd-data.berkeley.edu/ (you might have to click on Advanced tab, and then click on "proceed to bdd-data.berkeley.edu")
12 | 3. https://download.visinf.tu-darmstadt.de/data/from_games/
13 | 
14 | After downloading all the source datasets, move them to folder ./domain_adaptation/source/datasets/. Its folder structure should be as follows:
15 | ```
16 | datasets
17 |   |--mapillary-vistas-dataset_public_v1.1/
18 |   |  |--training/
19 |   |  |  |--images/
20 |   |  |  |--labels/
21 |   |  |--validation/
22 |   |  |  |--images/
23 |   |  |  |--labels/
24 |   |  |--testing/
25 |   |     |--images/
26 |   |--bdd100k/
27 |   |  |--seg/
28 |   |     |--images/
29 |   |     |  |--train/
30 |   |     |  |--val/
31 |   |     |  |--test/
32 |   |     |--labels/
33 |   |        |--train/
34 |   |        |--val/
35 |   |--gta/
36 |   |  |--images/
37 |   |  |--labels/
38 |   |--cityscapes/
39 |      |--gtFine/
40 |      |  |--train/
41 |      |  |--val/
42 |      |  |--test/
43 |      |--leftImg8bit/
44 |         |--train/
45 |         |--val/
46 |         |--test/
47 | ```
48 | 
49 | 
50 | Run the following commands **from public-code**:
51 | 
52 | ```
53 | pip3 install requirements.txt
54 | chmod +x domain_adaptation/source/prep_all.sh
55 | ./domain_adaptation/source/prep_all.sh
56 | ```
57 | 
58 | This will create a folder "domain_adaptation/source/source_datasets_dir/" where you will find the images and annotations for the source dataset to be used for this competetion.
59 | 
60 | # Target datasets:
61 | 
62 | Following commands are updated for the target labels of challenges other than supervised domain adaptation and semantic segmentation, **run them from public-code**:
63 | 
64 | ```
65 | python3 preperation/createLabels.py --datadir $ANUE --id-type level3Id --num-workers 4 --semisup_da True
66 | python3 preperation/createLabels.py --datadir $ANUE --id-type level3Id --num-workers 4 --weaksup_da True
67 | python3 preperation/createLabels.py --datadir $ANUE --id-type level3Id --num-workers 4 --unsup_da True
68 | ```
69 | 
70 | The bounding box labels for weakly supervised domain adapation can be downloaded from here: https://github.com/AutoNUE/public-code/tree/master/domain_adaptation/target/weakly-supervised
71 | 


--------------------------------------------------------------------------------
/domain_adaptation/source/core/bdds.py:
--------------------------------------------------------------------------------
 1 | from pathlib import Path
 2 | import pandas as pd
 3 | import argparse
 4 | from tqdm import tqdm
 5 | import shutil
 6 | import os
 7 | 
 8 | #print("here")
 9 | parser = argparse.ArgumentParser()
10 | parser.add_argument("datadir",help="path to dataset")
11 | parser.add_argument("savedir",help="path to save directory")
12 | #print(parser.parse_args())
13 | ### test ###
14 | #dd = Path('/raid/datasets/SemanticSegmentation/domain_adaptation/bdds/')
15 | #lbl = '/raid/datasets/SemanticSegmentation/domain_adaptation/bdds/labels/train/0004a4c0-d4dff0ad_train_id.png'
16 | #
17 | ############/home/cvit/rohit/autoneu/github/public-code/domain_adaptation/source/core
18 | 
19 | def getImg(lbl,dd):
20 |     '''
21 |     returns corresponding image to a label, specific to bdd
22 |     '''
23 |     osfx = '_train_id.png'
24 |     nsfx = '.jpg'
25 |     return dd/f'images/train/{lbl.name.replace(osfx,nsfx)}'
26 | 
27 | def prepBDD(dd,sd):
28 |     assert dd.exists() , f'dataset directory doesn\'t exist'
29 |     d_strat = pd.read_csv('./domain_adaptation/source/core/csvs/stratified_bdds.csv',header=None)
30 |     strp = '/raid/datasets/SemanticSegmentation/domain_adaptation/bdds/'
31 | 
32 |     lbls = sd/'BDD/labels'
33 |     imgs = sd/'BDD/images'
34 |     #lbls.mkdir(exist_ok=True)
35 |     #imgs.mkdir(exist_ok=True)
36 |     if not os.path.exists(lbls):
37 |         os.makedirs(lbls)
38 |     if not os.path.exists(imgs):
39 |         os.makedirs(imgs)
40 | 
41 |     for lbl in tqdm(list(d_strat[0])):
42 |         #print("dd",dd)
43 |         #if str(dd)[-1] != "/": dd = str(dd) + "/"
44 |         if str(dd)[-1] != "/": lbl = Path(lbl.replace(strp,str(dd)+"/"))
45 |         else: lbl = Path(lbl.replace(strp,str(dd)))
46 |         img = getImg(lbl,dd)
47 |         #assert img.exists() and lbl.exists() , 'invalid files picked up'
48 |         shutil.copy(lbl,lbls/f'{lbl.name}')
49 |         shutil.copy(img,imgs/f'{img.name}')
50 | 
51 | 
52 | if __name__ == "__main__": 
53 |     #print("here1")
54 |     args = parser.parse_args()
55 |     dd = Path(args.datadir)
56 |     sd = Path(args.savedir)
57 |     #print(f'collecting BDDS from {dd} into {sd}')
58 |     prepBDD(dd,sd)
59 | 
60 | 
61 | 


--------------------------------------------------------------------------------
/domain_adaptation/source/core/cityscapes.py:
--------------------------------------------------------------------------------
 1 | from pathlib import Path
 2 | import pandas as pd
 3 | import argparse
 4 | from tqdm import tqdm
 5 | import shutil
 6 | import os
 7 | 
 8 | parser = argparse.ArgumentParser()
 9 | parser.add_argument("datadir",help="path to dataset")
10 | parser.add_argument("savedir",help="path to save directory")
11 | 
12 | ###############test################
13 | #dd = '/raid/datasets/SemanticSegmentation/domain_adaptation/Cityscapes'
14 | ###################################
15 | 
16 | 
17 | def getImg(lbl):
18 |     '''
19 |     returns corresponding image to a label, specific to Cityscapes
20 |     '''
21 |     osfx = 'gtFine_labelIds.png'
22 |     nsfx = 'leftImg8bit.png'
23 |     img = Path(str(lbl).replace(osfx,nsfx).replace('gtFine','leftImg8bit'))
24 |     return img
25 | 
26 | def prepCityscapes(dd,sd):
27 |     dd = Path(dd)
28 |     sd = Path(sd)
29 |     labeldirs = [dd/f'gtFine/{lbls}' for lbls in ['train','val']]
30 | 
31 |     imgs = sd/'Cityscapes/images'
32 |     lbls = sd/'Cityscapes/labels'
33 |     #lbls.mkdir(exist_ok=True)
34 |     #imgs.mkdir(exist_ok=True)
35 |     if not os.path.exists(lbls):
36 |         os.makedirs(lbls)
37 |     if not os.path.exists(imgs):
38 |         os.makedirs(imgs)
39 | 
40 |     labels = []
41 |     for ld in labeldirs:
42 |         labels+=list(ld.rglob('*gtFine_labelIds.png'))
43 | 
44 |     for lbl in tqdm(labels):
45 |         img = getImg(lbl)
46 |         assert img.exists() , 'invalid files picked up, aborting'
47 |         shutil.copy(lbl,lbls/f'{lbl.name}')
48 |         shutil.copy(img,imgs/f'{img.name}')
49 | 
50 | 
51 | if __name__ == "__main__":
52 |     args = parser.parse_args()
53 |     dd = Path(args.datadir)
54 |     sd = Path(args.savedir)
55 |     print(f'collecting Cityscapes from {dd} into {sd}')
56 |     prepCityscapes(dd,sd)
57 | 


--------------------------------------------------------------------------------
/domain_adaptation/source/core/gta.py:
--------------------------------------------------------------------------------
 1 | from pathlib import Path
 2 | import pandas as pd
 3 | import argparse
 4 | from tqdm import tqdm
 5 | import shutil
 6 | import os
 7 | 
 8 | parser = argparse.ArgumentParser()
 9 | parser.add_argument("datadir",help="path to dataset")
10 | parser.add_argument("savedir",help="path to save directory")
11 | 
12 | ### test ###
13 | dd = Path('/raid/datasets/SemanticSegmentation/domain_adaptation/GTA5/')
14 | lbl = '/raid/datasets/SemanticSegmentation/domain_adaptation/GTA5/labels/labels/00005.png'
15 | 
16 | ############
17 | 
18 | def getImg(lbl,dd):
19 |     '''
20 |     returns corresponding image to a label, specific to gta
21 |     '''
22 |     return dd/f'images/{lbl.name}'
23 | 
24 | def prepGTA(dd,sd):
25 |     assert dd.exists() , f'dataset directory doesn\'t exist'
26 |     d_strat = pd.read_csv('./domain_adaptation/source/core/csvs/stratified_GTA.csv',header=None)
27 |     strp = '/raid/datasets/SemanticSegmentation/domain_adaptation/GTA5/labels'
28 | 
29 |     lbls = sd/'GTA/labels'
30 |     imgs = sd/'GTA/images'
31 |     #lbls.mkdir(exist_ok=True)
32 |     #imgs.mkdir(exist_ok=True)
33 |     if not os.path.exists(lbls):
34 |         os.makedirs(lbls)
35 |     if not os.path.exists(imgs):
36 |         os.makedirs(imgs)
37 | 
38 |     for lbl in tqdm(list(d_strat[0])):
39 |         if str(dd)[-1] != "/": lbl = Path(lbl.replace(strp,str(dd)+"/"))
40 |         else: lbl = Path(lbl.replace(strp,str(dd)))
41 |         img = getImg(lbl,dd)
42 |         assert img.exists() and lbl.exists() , 'invalid files picked up'
43 |         shutil.copy(lbl,lbls/f'{lbl.name}')
44 |         shutil.copy(img,imgs/f'{img.name}')
45 | 
46 | 
47 | if __name__ == "__main__": 
48 |     args = parser.parse_args()
49 |     dd = Path(args.datadir)
50 |     sd = Path(args.savedir)
51 |     print(f'collecting GTA from {dd} into {sd}')
52 |     prepGTA(dd,sd)
53 | 
54 | 
55 | 


--------------------------------------------------------------------------------
/domain_adaptation/source/core/mapillary.py:
--------------------------------------------------------------------------------
 1 | from pathlib import Path
 2 | import pandas as pd
 3 | import argparse
 4 | from tqdm import tqdm
 5 | import shutil
 6 | import os
 7 | 
 8 | parser = argparse.ArgumentParser()
 9 | parser.add_argument("datadir",help="path to dataset")
10 | parser.add_argument("savedir",help="path to save directory")
11 | 
12 | ### test ###
13 | #dd = Path('/raid/datasets/SemanticSegmentation/domain_adaptation/Mapillary/')
14 | #lbl = '/raid/datasets/SemanticSegmentation/domain_adaptation/Mapillary/training/labels/-6-WLs7O63-6cwx-8adk7g.png'
15 | ############
16 | 
17 | def getImg(lbl,dd):
18 |     '''
19 |     returns corresponding image to a label, specific to Mapillary
20 |     '''
21 |     osfx = '.png'
22 |     nsfx = '.jpg'
23 |     return dd/f'training/images/{lbl.name.replace(osfx,nsfx)}'
24 | 
25 | def prepMapillary(dd,sd):
26 |     assert dd.exists() , f'dataset directory doesn\'t exist'
27 |     d_strat = pd.read_csv('./domain_adaptation/source/core/csvs/stratified_mapillary.csv',header=None)
28 |     strp = '/raid/datasets/SemanticSegmentation/domain_adaptation/Mapillary'
29 | 
30 |     lbls = sd/'Mapillary/labels'
31 |     imgs = sd/'Mapillary/images'
32 |     #lbls.mkdir(exist_ok=True)
33 |     #imgs.mkdir(exist_ok=True)
34 |     if not os.path.exists(lbls):
35 |         os.makedirs(lbls)
36 |     if not os.path.exists(imgs):
37 |         os.makedirs(imgs)
38 | 
39 |     for lbl in tqdm(list(d_strat[0])):
40 |         if str(dd)[-1] != "/": lbl = Path(lbl.replace(strp,str(dd)+"/"))
41 |         else: lbl = Path(lbl.replace(strp,str(dd)))
42 |         img = getImg(lbl,dd)
43 |         assert img.exists() and lbl.exists() , 'invalid files picked up'
44 |         shutil.copy(lbl,lbls/f'{lbl.name}')
45 |         shutil.copy(img,imgs/f'{img.name}')
46 | 
47 | 
48 | if __name__ == "__main__":
49 |     args = parser.parse_args()
50 |     dd = Path(args.datadir)
51 |     sd = Path(args.savedir)
52 |     print(f'collecting Mapillary from {dd} into {sd}')
53 |     prepMapillary(dd,sd)
54 | 


--------------------------------------------------------------------------------
/domain_adaptation/source/prep_all.sh:
--------------------------------------------------------------------------------
 1 | #! /bin/bash
 2 | 
 3 | bdd_dd='./domain_adaptation/source/datasets/bdd100k/seg/'
 4 | mapillary_dd='./domain_adaptation/source/datasets/mapillary-vistas-dataset_public_v1.1/'
 5 | gta_dd='./domain_adaptation/source/datasets/gta/'
 6 | cityscapes_dd='./domain_adaptation/source/datasets/cityscapes/'
 7 | 
 8 | sd='./domain_adaptation/source/source_datasets_dir/'
 9 | mkdir -p sd
10 | python3 ./domain_adaptation/source/core/cityscapes.py ${cityscapes_dd} ${sd}
11 | python3 ./domain_adaptation/source/core/mapillary.py ${mapillary_dd} ${sd}
12 | python3 ./domain_adaptation/source/core/gta.py ${gta_dd} ${sd}
13 | python3 ./domain_adaptation/source/core/bdds.py ${bdd_dd} ${sd}
14 | 


--------------------------------------------------------------------------------
/domain_adaptation/target/weakly-supervised/train1.zip:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/AutoNUE/public-code/5d9a93beb176b03dd32c79ce050d0fbddc9acd00/domain_adaptation/target/weakly-supervised/train1.zip


--------------------------------------------------------------------------------
/domain_adaptation/target/weakly-supervised/train2.zip:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/AutoNUE/public-code/5d9a93beb176b03dd32c79ce050d0fbddc9acd00/domain_adaptation/target/weakly-supervised/train2.zip


--------------------------------------------------------------------------------
/domain_adaptation/target/weakly-supervised/val.zip:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/AutoNUE/public-code/5d9a93beb176b03dd32c79ce050d0fbddc9acd00/domain_adaptation/target/weakly-supervised/val.zip


--------------------------------------------------------------------------------
/evaluation/csHelpers.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/python
  2 | #
  3 | # Various helper methods and includes for Cityscapes
  4 | #
  5 | 
  6 | # Python imports
  7 | import os, sys, getopt
  8 | import glob
  9 | import math
 10 | import json
 11 | from collections import namedtuple
 12 | 
 13 | # Image processing
 14 | # Check if PIL is actually Pillow as expected
 15 | try:
 16 |     from PIL import PILLOW_VERSION
 17 | except:
 18 |     print("Please install the module 'Pillow' for image processing, e.g.")
 19 |     print("pip install pillow")
 20 |     sys.exit(-1)
 21 | 
 22 | try:
 23 |     import PIL.Image     as Image
 24 |     import PIL.ImageDraw as ImageDraw
 25 | except:
 26 |     print("Failed to import the image processing packages.")
 27 |     sys.exit(-1)
 28 | 
 29 | # Numpy for datastructures
 30 | try:
 31 |     import numpy as np
 32 | except:
 33 |     print("Failed to import numpy package.")
 34 |     sys.exit(-1)
 35 | 
 36 | # Cityscapes modules
 37 | # try:
 38 | from annotation   import Annotation
 39 | from anue_labels       import labels, name2label, id2label
 40 | # except:
 41 | #     print("Failed to find all Cityscapes modules")
 42 | #     sys.exit(-1)
 43 | 
 44 | # Print an error message and quit
 45 | def printError(message):
 46 |     print('ERROR: ' + str(message))
 47 |     sys.exit(-1)
 48 | 
 49 | # Class for colors
 50 | class colors:
 51 |     RED       = '\033[31;1m'
 52 |     GREEN     = '\033[32;1m'
 53 |     YELLOW    = '\033[33;1m'
 54 |     BLUE      = '\033[34;1m'
 55 |     MAGENTA   = '\033[35;1m'
 56 |     CYAN      = '\033[36;1m'
 57 |     BOLD      = '\033[1m'
 58 |     UNDERLINE = '\033[4m'
 59 |     ENDC      = '\033[0m'
 60 | 
 61 | # Colored value output if colorized flag is activated.
 62 | def getColorEntry(val, args):
 63 |     if not args.colorized:
 64 |         return ""
 65 |     if not isinstance(val, float) or math.isnan(val):
 66 |         return colors.ENDC
 67 |     if (val < .20):
 68 |         return colors.RED
 69 |     elif (val < .40):
 70 |         return colors.YELLOW
 71 |     elif (val < .60):
 72 |         return colors.BLUE
 73 |     elif (val < .80):
 74 |         return colors.CYAN
 75 |     else:
 76 |         return colors.GREEN
 77 | 
 78 | # Cityscapes files have a typical filename structure
 79 | # <city>_<sequenceNb>_<frameNb>_<type>[_<type2>].<ext>
 80 | # This class contains the individual elements as members
 81 | # For the sequence and frame number, the strings are returned, including leading zeros
 82 | CsFile = namedtuple( 'csFile' , [ 'city' , 'sequenceNb' , 'frameNb' , 'type' , 'type2' , 'ext' ] )
 83 | 
 84 | # Returns a CsFile object filled from the info in the given filename
 85 | def getCsFileInfo(fileName):
 86 |     baseName = os.path.basename(fileName)
 87 |     parts = baseName.split('_')
 88 |     parts = parts[:-1] + parts[-1].split('.')
 89 |     if not parts:
 90 |         printError( 'Cannot parse given filename ({}). Does not seem to be a valid Cityscapes file.'.format(fileName) )
 91 |     if len(parts) == 5:
 92 |         csFile = CsFile( *parts[:-1] , type2="" , ext=parts[-1] )
 93 |     elif len(parts) == 6:
 94 |         csFile = CsFile( *parts )
 95 |     else:
 96 |         printError( 'Found {} part(s) in given filename ({}). Expected 5 or 6.'.format(len(parts) , fileName) )
 97 | 
 98 |     return csFile
 99 | 
100 | # Returns the part of Cityscapes filenames that is common to all data types
101 | # e.g. for city_123456_123456_gtFine_polygons.json returns city_123456_123456
102 | def getCoreImageFileName(filename):
103 |     csFile = getCsFileInfo(filename)
104 |     return "{}_{}_{}".format( csFile.city , csFile.sequenceNb , csFile.frameNb )
105 | 
106 | # Returns the directory name for the given filename, e.g.
107 | # fileName = "/foo/bar/foobar.txt"
108 | # return value is "bar"
109 | # Not much error checking though
110 | def getDirectory(fileName):
111 |     dirName = os.path.dirname(fileName)
112 |     return os.path.basename(dirName)
113 | 
114 | # Make sure that the given path exists
115 | def ensurePath(path):
116 |     if not path:
117 |         return
118 |     if not os.path.isdir(path):
119 |         os.makedirs(path)
120 | 
121 | # Write a dictionary as json file
122 | def writeDict2JSON(dictName, fileName):
123 |     with open(fileName, 'w') as f:
124 |         f.write(json.dumps(dictName, default=lambda o: o.__dict__, sort_keys=True, indent=4))
125 | 
126 | # dummy main
127 | if __name__ == "__main__":
128 |     printError("Only for include, not executable on its own.")
129 | 


--------------------------------------------------------------------------------
/evaluation/evaluate_detection.py:
--------------------------------------------------------------------------------
 1 | from argparse import ArgumentParser
 2 | from PIL import Image
 3 | import os
 4 | import glob
 5 | import time
 6 | import numpy as np
 7 | from multiprocessing import Pool
 8 | import matplotlib.pyplot as plt
 9 | import sys
10 | import cPickle
11 | 
12 | sys.path.append( os.path.normpath( os.path.join( os.path.dirname( __file__ ) , '..' , 'helpers' ) ) )
13 | 
14 | from anue_labels import *
15 | from helper_eval_detection import idd_eval
16 | 
17 | _classes_level3Ids = (
18 |   			4, #("person", "animal"),
19 | 	   		5, #"rider",
20 |   			6,#"motorcycle",
21 | 	        7,#"bicycle",
22 | 	        8,#"autorickshaw",
23 | 	    	9,#"car",
24 | 	     	10,#"truck",
25 | 			11,#"bus"
26 | 			18,#"traffic light" ,
27 |  			19,#"traffic sign",'''
28 |  			)
29 | 
30 | def get_args():
31 |     parser = ArgumentParser()
32 |     parser.add_argument('--gts', default="")
33 |     parser.add_argument('--preds', default="")
34 |     parser.add_argument('--image-set', default="test")
35 |     parser.add_argument('--output-dir', default="output")
36 |     
37 |     args = parser.parse_args()
38 | 
39 |     return args
40 | 
41 | def _get_idd_results_file_template():
42 |     # idd_det_test_<level3Id>.txt
43 |     filename = 'idd_det_' + args.image_set + '_{:s}.txt'
44 |     path = os.path.join(
45 |         args.preds,
46 |         filename)
47 |     return path
48 | 
49 | def _do_eval(output_dir = 'output'):
50 |     annopath = os.path.join(
51 |         args.gts,
52 |         'Annotations',
53 |         '{:s}.xml')
54 |     imagesetfile = os.path.join(
55 |         args.gts,
56 |         args.image_set +'.txt')
57 |     cachedir = os.path.join(args.gts, 'annotations_cache')
58 |     aps = []
59 |     # This metric is similar to PASCAL VOC metric in 2010
60 |     if not os.path.isdir(output_dir):
61 |         os.mkdir(output_dir)
62 |     for i, cls_id in enumerate(_classes_level3Ids):
63 |         filename = _get_idd_results_file_template().format(str(cls_id))
64 |         rec, prec, ap = idd_eval(
65 |             filename, annopath, imagesetfile, cls_id, cachedir, ovthresh=0.5)
66 |         aps += [ap]
67 |         print('AP for {} = {:.4f}'.format(cls_id, ap))
68 |         with open(os.path.join(output_dir, str(cls_id) + '_pr.pkl'), 'w') as f:
69 |             cPickle.dump({'rec': rec, 'prec': prec, 'ap': ap}, f)
70 |     print('Mean AP = {:.4f}'.format(np.mean(aps)))
71 |     print('~~~~~~~~')
72 |     print('Results:')
73 |     for ap in aps:
74 |         print('{:.3f}'.format(ap))
75 |     print('{:.3f}'.format(np.mean(aps)))
76 |     print('~~~~~~~~')
77 |     print('')
78 | 
79 | def main(args):
80 | 	_do_eval(args.output_dir)
81 | 	return
82 | 
83 | if __name__ == '__main__':
84 |     args = get_args()
85 |     main(args)
86 | 
87 | 
88 | 
89 | 


--------------------------------------------------------------------------------
/evaluation/evaluate_instance_segmentation.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/python
  2 | #
  3 | # The evaluation script for instance-level semantic labeling.
  4 | # We use this script to evaluate your approach on the test set.
  5 | # You can use the script to evaluate on the validation set.
  6 | #
  7 | # Please check the description of the "getPrediction" method below
  8 | # and set the required environment variables as needed, such that
  9 | # this script can locate your results.
 10 | # If the default implementation of the method works, then it's most likely
 11 | # that our evaluation server will be able to process your results as well.
 12 | #
 13 | # To run this script, make sure that your results contain text files
 14 | # (one for each test set image) with the content:
 15 | #   relPathPrediction1 labelIDPrediction1 confidencePrediction1
 16 | #   relPathPrediction2 labelIDPrediction2 confidencePrediction2
 17 | #   relPathPrediction3 labelIDPrediction3 confidencePrediction3
 18 | #   ...
 19 | #
 20 | # - The given paths "relPathPrediction" point to images that contain
 21 | # binary masks for the described predictions, where any non-zero is
 22 | # part of the predicted instance. The paths must not contain spaces,
 23 | # must be relative to the root directory and must point to locations
 24 | # within the root directory.
 25 | # - The label IDs "labelIDPrediction" specify the class of that mask,
 26 | # encoded as defined in labels.py. Note that the regular ID is used,
 27 | # not the train ID.
 28 | # - The field "confidencePrediction" is a float value that assigns a
 29 | # confidence score to the mask.
 30 | #
 31 | # Note that this tool creates a file named "gtInstances.json" during its
 32 | # first run. This file helps to speed up computation and should be deleted
 33 | # whenever anything changes in the ground truth annotations or anything
 34 | # goes wrong.
 35 | 
 36 | # python imports
 37 | from __future__ import print_function
 38 | import os, sys
 39 | import fnmatch
 40 | from copy import deepcopy
 41 | from anue_labels import *
 42 | import json
 43 | 
 44 | from csHelpers      import *
 45 | from instances2dict import instances2dict
 46 | import numpy as np
 47 | import glob
 48 | from PIL import Image
 49 | 
 50 | from argparse import *
 51 | 
 52 | 
 53 | ######################
 54 | # Parameters
 55 | ######################
 56 | 
 57 | 
 58 | # A dummy class to collect all bunch of data
 59 | class CArgs(object):
 60 |     pass
 61 | # And a global object of that class
 62 | args = CArgs()
 63 | 
 64 | # overlaps for evaluation
 65 | args.overlaps           = np.arange(0.5,1.,0.05)
 66 | # minimum region size for evaluation [pixels]
 67 | args.minRegionSizes     = np.array( [ 100 , 1000 , 1000 ] )
 68 | # distance thresholds [m]
 69 | args.distanceThs        = np.array( [  float('inf') , 100 , 50 ] )
 70 | # distance confidences
 71 | args.distanceConfs      = np.array( [ -float('inf') , 0.5 , 0.5 ] )
 72 | 
 73 | args.gtInstancesFile    = os.path.join(os.path.dirname(os.path.realpath(__file__)),'gtInstances.json')
 74 | args.distanceAvailable  = False
 75 | args.JSONOutput         = True
 76 | args.quiet              = False
 77 | args.csv                = False
 78 | args.colorized          = True
 79 | args.instLabels         = []
 80 | 
 81 | # store some parameters for finding predictions in the args variable
 82 | # the values are filled when the method getPrediction is first called
 83 | args.predictionPath = None
 84 | args.predictionWalk = None
 85 | 
 86 | 
 87 | # Determine the labels that have instances
 88 | def setInstanceLabels(args):
 89 |     args.instLabels = []
 90 |     for label in labels:
 91 |         if label.hasInstances and not label.ignoreInEval:
 92 |             args.instLabels.append(label.name)
 93 | 
 94 | # Print an error message and quit
 95 | def printError(message):
 96 |     print('ERROR: ' + str(message))
 97 |     sys.exit(-1)
 98 | 
 99 | def getDirectory(fileName):
100 |     dirName = os.path.dirname(fileName)
101 |     return os.path.basename(dirName)
102 | 
103 | # Make sure that the given path exists
104 | def ensurePath(path):
105 |     if not path:
106 |         return
107 |     if not os.path.isdir(path):
108 |         os.makedirs(path)
109 | 
110 | 
111 | # Read prediction info
112 | # imgFile, predId, confidence
113 | def readPredInfo(predInfoFileName,args):
114 |     predInfo = {}
115 |     if (not os.path.isfile(predInfoFileName)):
116 |         printError("Infofile '{}' for the predictions not found.".format(predInfoFileName))
117 |     with open(predInfoFileName, 'r') as f:
118 |         for line in f:
119 |             splittedLine         = line.split(" ")
120 |             if len(splittedLine) != 3:
121 |                 printError( "Invalid prediction file. Expected content: relPathPrediction1 labelIDPrediction1 confidencePrediction1" )
122 |             if os.path.isabs(splittedLine[0]):
123 |                 printError( "Invalid prediction file. First entry in each line must be a relative path." )
124 | 
125 |             filename             = os.path.join( os.path.dirname(predInfoFileName),splittedLine[0] )
126 |             filename             = os.path.abspath( filename )
127 | 
128 |             # check if that file is actually somewhere within the prediction root
129 |             # print(filename, args.predictionPath)
130 |             # if os.path.commonprefix( [filename,args.predictionPath] ) != args.predictionPath:
131 |             #     printError( "Predicted mask {} in prediction text file {} points outside of prediction path.".format(filename,predInfoFileName) )
132 | 
133 |             imageInfo            = {}
134 |             imageInfo["labelID"] = int(float(splittedLine[1]))
135 |             imageInfo["conf"]    = float(splittedLine[2])
136 |             predInfo[filename]   = imageInfo
137 | 
138 |     return predInfo
139 | 
140 | def writeDict2JSON(dictName, fileName):
141 |     with open(fileName, 'w') as f:
142 |         f.write(json.dumps(dictName, default=lambda o: o.__dict__, sort_keys=True, indent=4))
143 | 
144 | 
145 | # Routine to read ground truth image
146 | def readGTImage(gtImageFileName,args):
147 |     return Image.open(gtImageFileName)
148 | 
149 | # either read or compute a dictionary of all ground truth instances
150 | def getGtInstances(groundTruthList, args):
151 |     gtInstances = {}
152 |     # if there is a global statistics json, then load it
153 |     if (os.path.isfile(args.gtInstancesFile)):
154 |         if not args.quiet:
155 |             print("Loading ground truth instances from JSON.")
156 |         with open(args.gtInstancesFile) as json_file:
157 |             # print(json_file)
158 |             gtInstances = json.load(json_file)
159 |     # otherwise create it
160 |     else:
161 |         if (not args.quiet):
162 |             print("Creating ground truth instances from png files.")
163 |         gtInstances = instances2dict(groundTruthList,not args.quiet)
164 |         writeDict2JSON(gtInstances, args.gtInstancesFile)
165 | 
166 |     return gtInstances
167 | 
168 | # Filter instances, ignore labels without instances
169 | def filterGtInstances(singleImageInstances,args):
170 |     instanceDict = {}
171 |     for labelName in singleImageInstances:
172 |         if not labelName in args.instLabels:
173 |             continue
174 |         instanceDict[labelName] = singleImageInstances[labelName]
175 |     return instanceDict
176 | 
177 | # match ground truth instances with predicted instances
178 | def matchGtWithPreds(predictionList,groundTruthList,gtInstances,args):
179 |     matches = {}
180 |     if not args.quiet:
181 |         print("Matching {} pairs of images...".format(len(predictionList)))
182 | 
183 |     count = 0
184 |     for (pred,gt) in zip(predictionList,groundTruthList):
185 |         # key for dicts
186 |         dictKey = os.path.abspath(gt)
187 | 
188 |         # Read input files
189 |         gtImage  = readGTImage(gt,args)
190 |         predInfo = readPredInfo(pred,args)
191 | 
192 |         # Get and filter ground truth instances
193 |         unfilteredInstances = gtInstances[ dictKey ]
194 |         curGtInstancesOrig  = filterGtInstances(unfilteredInstances,args)
195 | 
196 |         # Try to assign all predictions
197 |         (curGtInstances,curPredInstances) = assignGt2Preds(curGtInstancesOrig, gtImage, predInfo, args)
198 |         # print(curGtInstances, curPredInstances)
199 | 
200 |         # append to global dict
201 |         matches[ dictKey ] = {}
202 |         matches[ dictKey ]["groundTruth"] = curGtInstances
203 |         matches[ dictKey ]["prediction"]  = curPredInstances
204 | 
205 |         count += 1
206 |         if not args.quiet:
207 |             print("\rImages Processed: {}".format(count), end=' ')
208 |             sys.stdout.flush()
209 | 
210 |     if not args.quiet:
211 |         print("")
212 | 
213 |     return matches
214 | 
215 | # For a given frame, assign all predicted instances to ground truth instances
216 | def assignGt2Preds(gtInstancesOrig, gtImage, predInfo, args):
217 |     # In this method, we create two lists
218 |     #  - predInstances: contains all predictions and their associated gt
219 |     #  - gtInstances:   contains all gt instances and their associated predictions
220 |     predInstances    = {}
221 |     predInstCount    = 0
222 | 
223 |     # Create a prediction array for each class
224 |     for label in args.instLabels:
225 |         predInstances[label] = []
226 | 
227 |     # We already know about the gt instances
228 |     # Add the matching information array
229 |     gtInstances = deepcopy(gtInstancesOrig)
230 |     for label in gtInstances:
231 |         for gt in gtInstances[label]:
232 |             gt["matchedPred"] = []
233 | 
234 |     # Make the gt a numpy array
235 |     gtNp = np.array(gtImage)
236 | 
237 |     # Get a mask of void labels in the groundtruth
238 |     voidLabelIDList = []
239 |     for label in labels:
240 |         if label.ignoreInEval:
241 |             voidLabelIDList.append(label.id)
242 |     boolVoid = np.in1d(gtNp, voidLabelIDList).reshape(gtNp.shape)
243 | 
244 |     # Loop through all prediction masks
245 |     for predImageFile in predInfo:
246 |         # Additional prediction info
247 |         labelID  = predInfo[predImageFile]["labelID"]
248 |         predConf = predInfo[predImageFile]["conf"]
249 | 
250 |         # label name
251 |         labelName = id2label[int(labelID)].name
252 | 
253 |         # maybe we are not interested in that label
254 |         if not labelName in args.instLabels:
255 |             continue
256 | 
257 |         # Read the mask
258 |         predImage = Image.open(predImageFile)
259 |         predImage = predImage.convert("L")
260 |         predNp    = np.array(predImage)
261 | 
262 |         # make the image really binary, i.e. everything non-zero is part of the prediction
263 |         boolPredInst   = predNp != 0
264 |         predPixelCount = np.count_nonzero( boolPredInst )
265 | 
266 |         # skip if actually empty
267 |         if not predPixelCount:
268 |             continue
269 | 
270 |         # The information we want to collect for this instance
271 |         predInstance = {}
272 |         predInstance["imgName"]          = predImageFile
273 |         predInstance["predID"]           = predInstCount
274 |         predInstance["labelID"]          = int(labelID)
275 |         predInstance["pixelCount"]       = predPixelCount
276 |         predInstance["confidence"]       = predConf
277 |         # Determine the number of pixels overlapping void
278 |         predInstance["voidIntersection"] = np.count_nonzero( np.logical_and(boolVoid, boolPredInst) )
279 | 
280 |         # A list of all overlapping ground truth instances
281 |         matchedGt = []
282 | 
283 |         # Loop through all ground truth instances with matching label
284 |         # This list contains all ground truth instances that distinguish groups
285 |         # We do not know, if a certain instance is actually a single object or a group
286 |         # e.g. car or cargroup
287 |         # However, for now we treat both the same and do the rest later
288 |         for (gtNum,gtInstance) in enumerate(gtInstancesOrig[labelName]):
289 | 
290 |             intersection = np.count_nonzero( np.logical_and( gtNp == gtInstance["instID"] , boolPredInst) )
291 | 
292 |             # If they intersect add them as matches to both dicts
293 |             if (intersection > 0):
294 |                 gtCopy   = gtInstance.copy()
295 |                 predCopy = predInstance.copy()
296 | 
297 |                 # let the two know their intersection
298 |                 gtCopy["intersection"]   = intersection
299 |                 predCopy["intersection"] = intersection
300 | 
301 |                 # append ground truth to matches
302 |                 matchedGt.append(gtCopy)
303 |                 # append prediction to ground truth instance
304 |                 gtInstances[labelName][gtNum]["matchedPred"].append(predCopy)
305 | 
306 |         predInstance["matchedGt"] = matchedGt
307 |         predInstCount += 1
308 |         predInstances[labelName].append(predInstance)
309 | 
310 |     return (gtInstances,predInstances)
311 | 
312 | 
313 | def evaluateMatches(matches, args):
314 |     # In the end, we need two vectors for each class and for each overlap
315 |     # The first vector (y_true) is binary and is 1, where the ground truth says true,
316 |     # and is 0 otherwise.
317 |     # The second vector (y_score) is float [0...1] and represents the confidence of
318 |     # the prediction.
319 |     #
320 |     # We represent the following cases as:
321 |     #                                       | y_true |   y_score
322 |     #   gt instance with matched prediction |    1   | confidence
323 |     #   gt instance w/o  matched prediction |    1   |     0.0
324 |     #          false positive prediction    |    0   | confidence
325 |     #
326 |     # The current implementation makes only sense for an overlap threshold >= 0.5,
327 |     # since only then, a single prediction can either be ignored or matched, but
328 |     # never both. Further, it can never match to two gt instances.
329 |     # For matching, we vary the overlap and do the following steps:
330 |     #   1.) remove all predictions that satisfy the overlap criterion with an ignore region (either void or *group)
331 |     #   2.) remove matches that do not satisfy the overlap
332 |     #   3.) mark non-matched predictions as false positive
333 | 
334 |     # AP
335 |     overlaps  = args.overlaps
336 |     # region size
337 |     minRegionSizes = args.minRegionSizes
338 |     # distance thresholds
339 |     distThs   = args.distanceThs
340 |     # distance confidences
341 |     distConfs = args.distanceConfs
342 |     # only keep the first, if distances are not available
343 |     if not args.distanceAvailable:
344 |         minRegionSizes = [ minRegionSizes[0] ]
345 |         distThs        = [ distThs       [0] ]
346 |         distConfs      = [ distConfs     [0] ]
347 | 
348 |     # last three must be of same size
349 |     if len(distThs) != len(minRegionSizes):
350 |         printError("Number of distance thresholds and region sizes different")
351 |     if len(distThs) != len(distConfs):
352 |         printError("Number of distance thresholds and confidences different")
353 | 
354 |     # Here we hold the results
355 |     # First dimension is class, second overlap
356 |     ap = np.zeros( (len(distThs) , len(args.instLabels) , len(overlaps)) , np.float )
357 | 
358 |     for dI,(minRegionSize,distanceTh,distanceConf) in enumerate(zip(minRegionSizes,distThs,distConfs)):
359 |         for (oI,overlapTh) in enumerate(overlaps):
360 |             for (lI,labelName) in enumerate(args.instLabels):
361 |                 y_true   = np.empty( 0 )
362 |                 y_score  = np.empty( 0 )
363 |                 # count hard false negatives
364 |                 hardFns  = 0
365 |                 # found at least one gt and predicted instance?
366 |                 haveGt   = False
367 |                 havePred = False
368 | 
369 |                 for img in matches:
370 |                     predInstances = matches[img]["prediction" ][labelName]
371 |                     gtInstances   = matches[img]["groundTruth"][labelName]
372 |                     # filter groups in ground truth
373 |                     gtInstances   = [ gt for gt in gtInstances if gt["instID"]>=1000 and gt["pixelCount"]>=minRegionSize and gt["medDist"]<=distanceTh and gt["distConf"]>=distanceConf ]
374 | 
375 |                     if gtInstances:
376 |                         haveGt = True
377 |                     if predInstances:
378 |                         havePred = True
379 | 
380 |                     curTrue  = np.ones ( len(gtInstances) )
381 |                     curScore = np.ones ( len(gtInstances) ) * (-float("inf"))
382 |                     curMatch = np.zeros( len(gtInstances) , dtype=np.bool )
383 | 
384 |                     # collect matches
385 |                     for (gtI,gt) in enumerate(gtInstances):
386 |                         foundMatch = False
387 |                         for pred in gt["matchedPred"]:
388 |                             overlap = float(pred["intersection"]) / (gt["pixelCount"]+pred["pixelCount"]-pred["intersection"])
389 |                             if overlap > overlapTh:
390 |                                 # the score
391 |                                 confidence = pred["confidence"]
392 | 
393 |                                 # if we already hat a prediction for this groundtruth
394 |                                 # the prediction with the lower score is automatically a false positive
395 |                                 if curMatch[gtI]:
396 |                                     maxScore = max( curScore[gtI] , confidence )
397 |                                     minScore = min( curScore[gtI] , confidence )
398 |                                     curScore[gtI] = maxScore
399 |                                     # append false positive
400 |                                     curTrue  = np.append(curTrue,0)
401 |                                     curScore = np.append(curScore,minScore)
402 |                                     curMatch = np.append(curMatch,True)
403 |                                 # otherwise set score
404 |                                 else:
405 |                                     foundMatch = True
406 |                                     curMatch[gtI] = True
407 |                                     curScore[gtI] = confidence
408 | 
409 |                         if not foundMatch:
410 |                             hardFns += 1
411 | 
412 |                     # remove non-matched ground truth instances
413 |                     curTrue  = curTrue [ curMatch==True ]
414 |                     curScore = curScore[ curMatch==True ]
415 | 
416 |                     # collect non-matched predictions as false positive
417 |                     for pred in predInstances:
418 |                         foundGt = False
419 |                         for gt in pred["matchedGt"]:
420 |                             overlap = float(gt["intersection"]) / (gt["pixelCount"]+pred["pixelCount"]-gt["intersection"])
421 |                             if overlap > overlapTh:
422 |                                 foundGt = True
423 |                                 break
424 |                         if not foundGt:
425 |                             # collect number of void and *group pixels
426 |                             nbIgnorePixels = pred["voidIntersection"]
427 |                             for gt in pred["matchedGt"]:
428 |                                 # group?
429 |                                 if gt["instID"] < 1000:
430 |                                     nbIgnorePixels += gt["intersection"]
431 |                                 # small ground truth instances
432 |                                 if gt["pixelCount"] < minRegionSize or gt["medDist"]>distanceTh or gt["distConf"]<distanceConf:
433 |                                     nbIgnorePixels += gt["intersection"]
434 |                             proportionIgnore = float(nbIgnorePixels)/pred["pixelCount"]
435 |                             # if not ignored
436 |                             # append false positive
437 |                             if proportionIgnore <= overlapTh:
438 |                                 curTrue = np.append(curTrue,0)
439 |                                 confidence = pred["confidence"]
440 |                                 curScore = np.append(curScore,confidence)
441 | 
442 |                     # append to overall results
443 |                     y_true  = np.append(y_true,curTrue)
444 |                     y_score = np.append(y_score,curScore)
445 | 
446 |                 # compute the average precision
447 |                 if haveGt and havePred:
448 |                     # compute precision recall curve first
449 | 
450 |                     # sorting and cumsum
451 |                     scoreArgSort      = np.argsort(y_score)
452 |                     yScoreSorted      = y_score[scoreArgSort]
453 |                     yTrueSorted       = y_true[scoreArgSort]
454 |                     yTrueSortedCumsum = np.cumsum(yTrueSorted)
455 | 
456 |                     # unique thresholds
457 |                     (thresholds,uniqueIndices) = np.unique( yScoreSorted , return_index=True )
458 | 
459 |                     # since we need to add an artificial point to the precision-recall curve
460 |                     # increase its length by 1
461 |                     nbPrecRecall = len(uniqueIndices) + 1
462 | 
463 |                     # prepare precision recall
464 |                     nbExamples     = len(yScoreSorted)
465 |                     nbTrueExamples = yTrueSortedCumsum[-1]
466 |                     precision      = np.zeros(nbPrecRecall)
467 |                     recall         = np.zeros(nbPrecRecall)
468 | 
469 |                     # deal with the first point
470 |                     # only thing we need to do, is to append a zero to the cumsum at the end.
471 |                     # an index of -1 uses that zero then
472 |                     yTrueSortedCumsum = np.append( yTrueSortedCumsum , 0 )
473 | 
474 |                     # deal with remaining
475 |                     for idxRes,idxScores in enumerate(uniqueIndices):
476 |                         cumSum = yTrueSortedCumsum[idxScores-1]
477 |                         tp = nbTrueExamples - cumSum
478 |                         fp = nbExamples     - idxScores - tp
479 |                         fn = cumSum + hardFns
480 |                         p  = float(tp)/(tp+fp)
481 |                         r  = float(tp)/(tp+fn)
482 |                         precision[idxRes] = p
483 |                         recall   [idxRes] = r
484 | 
485 |                     # first point in curve is artificial
486 |                     precision[-1] = 1.
487 |                     recall   [-1] = 0.
488 | 
489 |                     # compute average of precision-recall curve
490 |                     # integration is performed via zero order, or equivalently step-wise integration
491 |                     # first compute the widths of each step:
492 |                     # use a convolution with appropriate kernel, manually deal with the boundaries first
493 |                     recallForConv = np.copy(recall)
494 |                     recallForConv = np.append( recallForConv[0] , recallForConv )
495 |                     recallForConv = np.append( recallForConv    , 0.            )
496 | 
497 |                     stepWidths = np.convolve(recallForConv,[-0.5,0,0.5],'valid')
498 | 
499 |                     # integrate is now simply a dot product
500 |                     apCurrent = np.dot( precision , stepWidths )
501 | 
502 |                 elif haveGt:
503 |                     apCurrent = 0.0
504 |                 else:
505 |                     apCurrent = float('nan')
506 |                 ap[dI,lI,oI] = apCurrent
507 | 
508 |     return ap
509 | 
510 | def computeAverages(aps,args):
511 |     # max distance index
512 |     dInf  = np.argmax( args.distanceThs )
513 |     d50m  = np.where( np.isclose( args.distanceThs ,  50. ) )
514 |     d100m = np.where( np.isclose( args.distanceThs , 100. ) )
515 |     o50   = np.where(np.isclose(args.overlaps,0.5  ))
516 | 
517 |     avgDict = {}
518 |     avgDict["allAp"]       = np.nanmean(aps[ dInf,:,:  ])
519 |     avgDict["allAp50%"]    = np.nanmean(aps[ dInf,:,o50])
520 | 
521 |     if args.distanceAvailable:
522 |         avgDict["allAp50m"]    = np.nanmean(aps[ d50m,:,  :])
523 |         avgDict["allAp100m"]   = np.nanmean(aps[d100m,:,  :])
524 |         avgDict["allAp50%50m"] = np.nanmean(aps[ d50m,:,o50])
525 | 
526 |     avgDict["classes"]  = {}
527 |     for (lI,labelName) in enumerate(args.instLabels):
528 |         avgDict["classes"][labelName]             = {}
529 |         avgDict["classes"][labelName]["ap"]       = np.average(aps[ dInf,lI,  :])
530 |         avgDict["classes"][labelName]["ap50%"]    = np.average(aps[ dInf,lI,o50])
531 |         if args.distanceAvailable:
532 |             avgDict["classes"][labelName]["ap50m"]    = np.average(aps[ d50m,lI,  :])
533 |             avgDict["classes"][labelName]["ap100m"]   = np.average(aps[d100m,lI,  :])
534 |             avgDict["classes"][labelName]["ap50%50m"] = np.average(aps[ d50m,lI,o50])
535 | 
536 |     return avgDict
537 | 
538 | def printResults(avgDict, args):
539 |     sep     = (","         if args.csv       else "")
540 |     col1    = (":"         if not args.csv   else "")
541 |     noCol   = (colors.ENDC if args.colorized else "")
542 |     bold    = (colors.BOLD if args.colorized else "")
543 |     lineLen = 50
544 |     if args.distanceAvailable:
545 |         lineLen += 40
546 | 
547 |     print("")
548 |     if not args.csv:
549 |         print("#"*lineLen)
550 |     line  = bold
551 |     line += "{:<15}".format("what"      ) + sep + col1
552 |     line += "{:>15}".format("AP"        ) + sep
553 |     line += "{:>15}".format("AP_50%"    ) + sep
554 |     if args.distanceAvailable:
555 |         line += "{:>15}".format("AP_50m"    ) + sep
556 |         line += "{:>15}".format("AP_100m"   ) + sep
557 |         line += "{:>15}".format("AP_50%50m" ) + sep
558 |     line += noCol
559 |     print(line)
560 |     if not args.csv:
561 |         print("#"*lineLen)
562 | 
563 |     for (lI,labelName) in enumerate(args.instLabels):
564 |         apAvg  = avgDict["classes"][labelName]["ap"]
565 |         ap50o  = avgDict["classes"][labelName]["ap50%"]
566 |         if args.distanceAvailable:
567 |             ap50m  = avgDict["classes"][labelName]["ap50m"]
568 |             ap100m = avgDict["classes"][labelName]["ap100m"]
569 |             ap5050 = avgDict["classes"][labelName]["ap50%50m"]
570 | 
571 |         line  = "{:<15}".format(labelName) + sep + col1
572 |         line += getColorEntry(apAvg , args) + sep + "{:>15.3f}".format(apAvg ) + sep
573 |         line += getColorEntry(ap50o , args) + sep + "{:>15.3f}".format(ap50o ) + sep
574 |         if args.distanceAvailable:
575 |             line += getColorEntry(ap50m , args) + sep + "{:>15.3f}".format(ap50m ) + sep
576 |             line += getColorEntry(ap100m, args) + sep + "{:>15.3f}".format(ap100m) + sep
577 |             line += getColorEntry(ap5050, args) + sep + "{:>15.3f}".format(ap5050) + sep
578 |         line += noCol
579 |         print(line)
580 | 
581 |     allApAvg  = avgDict["allAp"]
582 |     allAp50o  = avgDict["allAp50%"]
583 |     if args.distanceAvailable:
584 |         allAp50m  = avgDict["allAp50m"]
585 |         allAp100m = avgDict["allAp100m"]
586 |         allAp5050 = avgDict["allAp50%50m"]
587 | 
588 |     if not args.csv:
589 |             print("-"*lineLen)
590 |     line  = "{:<15}".format("average") + sep + col1
591 |     line += getColorEntry(allApAvg , args) + sep + "{:>15.3f}".format(allApAvg)  + sep
592 |     line += getColorEntry(allAp50o , args) + sep + "{:>15.3f}".format(allAp50o)  + sep
593 |     if args.distanceAvailable:
594 |         line += getColorEntry(allAp50m , args) + sep + "{:>15.3f}".format(allAp50m)  + sep
595 |         line += getColorEntry(allAp100m, args) + sep + "{:>15.3f}".format(allAp100m) + sep
596 |         line += getColorEntry(allAp5050, args) + sep + "{:>15.3f}".format(allAp5050) + sep
597 |     line += noCol
598 |     print(line)
599 |     print("")
600 | 
601 | def prepareJSONDataForResults(avgDict, aps, args):
602 |     JSONData = {}
603 |     JSONData["averages"] = avgDict
604 |     JSONData["overlaps"] = args.overlaps.tolist()
605 |     JSONData["minRegionSizes"]      = args.minRegionSizes.tolist()
606 |     JSONData["distanceThresholds"]  = args.distanceThs.tolist()
607 |     JSONData["minStereoDensities"]  = args.distanceConfs.tolist()
608 |     JSONData["instLabels"] = args.instLabels
609 |     JSONData["resultApMatrix"] = aps.tolist()
610 | 
611 |     return JSONData
612 | 
613 | # Work through image list
614 | def evaluateImgLists(predictionList, groundTruthList, args):
615 |     # determine labels of interest
616 |     setInstanceLabels(args)
617 |     # get dictionary of all ground truth instances
618 |     gtInstances = getGtInstances(groundTruthList,args)
619 |     # print(gtInstances)
620 |     # match predictions and ground truth
621 |     matches = matchGtWithPreds(predictionList,groundTruthList,gtInstances,args)
622 |     writeDict2JSON(matches,"matches.json")
623 |     # evaluate matches
624 |     apScores = evaluateMatches(matches, args)
625 |     # averages
626 |     avgDict = computeAverages(apScores,args)
627 |     # result dict
628 |     resDict = prepareJSONDataForResults(avgDict, apScores, args)
629 |     if args.JSONOutput:
630 |         os.makedirs('eval_results', exist_ok=True)
631 |         writeDict2JSON(resDict, 'eval_results/instance.json')
632 | 
633 |     if not args.quiet:
634 |          # Print results
635 |         printResults(avgDict, args)
636 | 
637 |     return resDict
638 | 
639 | 
640 | 
641 | def get_args():
642 |     parser = ArgumentParser()
643 |     parser.add_argument('--gts', default="")
644 |     parser.add_argument('--preds', default="")
645 |     
646 |     args = parser.parse_args()
647 | 
648 |     return args
649 | 
650 | 
651 | # The main method
652 | def main(ar):
653 |     global args
654 | 
655 | 
656 |     groundTruthImgList = glob.glob(f'{ar.gts}/*/*_gtFine_instanceids.png')
657 |     predictionImgList = [ gt.replace('_gtFine_instanceids.png','_leftImg8bit.txt').replace(ar.gts, ar.preds) for gt in groundTruthImgList ]
658 |     
659 |     evaluateImgLists(predictionImgList, groundTruthImgList, args)
660 | 
661 |     return
662 | 
663 | if __name__ == '__main__':
664 |     ar = get_args()
665 |     main(ar)
666 | 


--------------------------------------------------------------------------------
/evaluation/evaluate_mIoU.py:
--------------------------------------------------------------------------------
  1 | from argparse import ArgumentParser
  2 | from PIL import Image
  3 | import os
  4 | import glob
  5 | import time
  6 | import numpy as np
  7 | from multiprocessing import Pool
  8 | import matplotlib.pyplot as plt
  9 | import sys
 10 | 
 11 | res = 1080
 12 | 
 13 | def get_args():
 14 |     parser = ArgumentParser()
 15 |     parser.add_argument('--gts', default="/ssd_scratch/cvit/girish.varma/dataset/anue_test/gtFine/test")
 16 |     parser.add_argument('--preds', default="")
 17 |     parser.add_argument('--prefix', default="_gtFine_labellevel3Ids.png")
 18 |     parser.add_argument('--res', type=int, default=720)
 19 |     parser.add_argument('--num-workers', type=int, default=10)
 20 |     
 21 |     args = parser.parse_args()
 22 | 
 23 |     return args
 24 | 
 25 | 
 26 | def add_to_confusion_matrix(gt, pred, mat):
 27 | #    print(pred.shape)
 28 | #    print(pred.size[0],pred.size[1])
 29 | 
 30 |     if (pred.shape[0] != gt.shape[0]):
 31 |         print("Image widths of " + pred + " and " + gt + " are not equal.")
 32 |     if (pred.shape[1] != gt.shape[1]):
 33 |         print("Image heights of " + pred + " and " + gt + " are not equal.")
 34 |     if ( len(pred.shape) != 2 ):
 35 |         print("Predicted image has multiple channels.")
 36 |     W  = pred.shape[0]
 37 |     H = pred.shape[1]
 38 |     P = H*W
 39 | 
 40 |     pred = pred.flatten()
 41 |     gt = gt.flatten()
 42 | 
 43 | 
 44 |     for (gtp,predp) in zip(gt, pred):
 45 |         if gtp == 255 or gtp >26 :
 46 |             gtp = 26
 47 |         if predp == 255 or predp >26:
 48 |             predp = 26
 49 |         mat[gtp, predp] += 1
 50 | 
 51 |     return mat
 52 | 
 53 | def higher_level_mat(mat, mapping):
 54 |     n = len(mapping)
 55 |     h_mat = np.zeros((n,n), dtype=np.ulonglong)
 56 | 
 57 |     for x_id in range(n):
 58 |         for y_id in range(n):
 59 |             x_h_ids = mapping[x_id]
 60 |             y_h_ids = mapping[y_id]
 61 |             for x_h_id in x_h_ids:
 62 |                 for y_h_id in y_h_ids:
 63 |                     h_mat[x_id,y_id] += mat[x_h_id, y_h_id]
 64 | 
 65 |     return h_mat
 66 | 
 67 | 
 68 | def generalized_eval_ious(mat):
 69 |     n = mat.shape[0]
 70 |     ious = np.zeros(n)
 71 |     for l in range(n):
 72 |         tp = np.longlong(mat[l,l])
 73 |         fn = np.longlong(mat[l,:].sum()) - tp
 74 | 
 75 |         notIgnored = [i for i in range(n) if not i==l]
 76 |         fp = np.longlong(mat[notIgnored,l].sum())
 77 |         denom = (tp + fp + fn)
 78 |         if denom == 0:
 79 |             print('error: denom is 0')
 80 | 
 81 |         ious[l] =  float(tp) / denom
 82 |     return ious
 83 | 
 84 | 
 85 | def print_scores(ious, names, heading):
 86 |     print('---------------------------------------------')
 87 |     print(heading)
 88 |     print('---------------------------------------------')
 89 |     for (iou, name) in zip(ious, names):
 90 |         print(f'{name}\t\t:{iou}')
 91 |     print('---------------------------------------------')
 92 |     print(f'mIoU\t\t:{ious.mean()}')
 93 |     print('---------------------------------------------')
 94 | 
 95 | 
 96 |     
 97 | 
 98 | 
 99 | 
100 | def ious_at_all_levels(mat):
101 |     global res
102 | 
103 |     mapping_2_to_3 = [
104 |         [0],
105 |         [1],
106 |         [2],
107 |         [3],
108 |         [4],
109 |         [5],
110 |         [6,7],
111 |         [8,9],
112 |         [10,11,12],
113 |         [13,14],
114 |         [15,16],
115 |         [17,18,19],
116 |         [20,21],
117 |         [22, 23],
118 |         [24],
119 |         [25]
120 |     ]
121 | 
122 |     mapping_1_to_2 = [
123 |         [0,1],
124 |         [2,3],
125 |         [4,5],
126 |         [6,7,8],
127 |         [9,10,11,12],
128 |         [13,14],
129 |         [15]
130 |     ]
131 | 
132 |     l2_mat = higher_level_mat(mat, mapping_2_to_3)
133 |     l1_mat = higher_level_mat(mat, mapping_1_to_2)
134 | 
135 |     l2_ious = generalized_eval_ious(l2_mat)
136 |     l1_ious = generalized_eval_ious(l1_mat)
137 |     l3_ious = eval_ious(mat)
138 | 
139 |     np.save(f'eval_results/ious_l1_{res}', np.array(l1_ious))
140 |     np.save(f'eval_results/ious_l2_{res}', np.array(l2_ious))
141 | 
142 |     np.save(f'eval_results/cm_l1_{res}', np.array(l1_mat))
143 |     np.save(f'eval_results/cm_l2_{res}', np.array(l2_mat))
144 | 
145 | 
146 |     l3_names = ['road', 'drivable fallback', 'sidewalk', 'non-drivable fallback', 'person', 'rider', 'motorcycle', 'bicycle', 'autorickshaw', 'car', 'truck', 'bus', 'vehicle fallback', 'curb', 'wall', 'fence', 'guard rail', 'billboard', 'traffic sign', 'traffic light', 'pole', 'obs-str-bar-fallback', 'building', 'bridge', 'vegetation', 'sky', 'misc']
147 |     l2_names = ['road', 'drivable fallback', 'sidewalk', 'non-drivable fallback', 'person', 'rider', '2-wheeler', 'car-auto', 'large-vehicle', 'curb-wall', 'fence-guard rail', 'info-structures', 'obs-fallback', 'construction', 'vegetation', 'sky']
148 |     l1_names = ['drivable', 'non-drivable', 'living-things', 'vehicles', 'road-side-objs', 'far-objects', 'sky']
149 | 
150 | 
151 |     print_scores(l1_ious, l1_names, "Level 1 Scores")
152 |     print_scores(l2_ious, l2_names, "Level 2 Scores")
153 |     print_scores(l3_ious, l3_names, "Level 3 Scores")
154 | 
155 |     
156 | 
157 | 
158 | 
159 | def eval_ious(mat):
160 |     ious = np.zeros(27)
161 |     for l in range(26):
162 |         tp = np.longlong(mat[l,l])
163 |         fn = np.longlong(mat[l,:].sum()) - tp
164 | 
165 |         notIgnored = [i for i in range(26) if not i==l]
166 |         fp = np.longlong(mat[notIgnored,l].sum())
167 |         denom = (tp + fp + fn)
168 |         if denom == 0:
169 |             print('error: denom is 0')
170 | 
171 |         ious[l] =  float(tp) / denom
172 | 
173 |     return ious[:-1]
174 | 
175 | def process_pred_gt_pair(pair):
176 |     global res
177 |     W,H = 1920, 1080
178 |     if res == 720:
179 |         W,H = 1280, 720
180 |         # print(W,H)
181 |     if res == 480:
182 |         W,H = 858, 480
183 |     if res == 240:
184 |         W,H = 426, 240
185 |     
186 |     
187 | 
188 |     pred, gt = pair
189 |     # tqdm.tqdm.write(pred, gt)
190 |     confusion_matrix = np.zeros(shape=(26+1, 26+1),dtype=np.ulonglong)
191 |     try:
192 |         gt = Image.open(gt)
193 |         # print(gt.size)
194 |         if gt.size != (W, H):
195 |             gt = gt.resize((W, H), resample=Image.NEAREST)
196 |         gt  = np.array(gt)
197 |     except:
198 |         print("Unable to load " + gt)
199 |         os._exit(1)
200 | 
201 |     try:
202 |         pred = Image.open(pred)
203 |         if pred.size != (W, H):
204 |             pred = pred.resize((W, H), resample=Image.NEAREST)
205 |         pred = np.array(pred)
206 |     except:
207 |         print("Unable to load " + pred)
208 |         os._exit(1)
209 | 
210 |     # plt.matshow(gt)
211 |     # plt.show()
212 |     # plt.matshow(pred)
213 |     # plt.show()
214 | 
215 | 
216 |     
217 |     
218 | 
219 |     # print(pred.size,gt.size)
220 |     
221 |     add_to_confusion_matrix(gt, pred, confusion_matrix)
222 | 
223 |     return confusion_matrix
224 | 
225 | import tqdm
226 | 
227 | class_names = ['road', 'drivable fallback', 'sidewalk', 'non-drivable fallback', 'person', 'rider', 'motorcycle', 'bicycle', 'autorickshaw', 'car', 'truck', 'bus', 'vehicle fallback', 'curb', 'wall', 'fence', 'guard rail', 'billboard', 'traffic sign', 'traffic light', 'pole', 'obs-str-bar-fallback', 'building', 'bridge', 'vegetation', 'sky', 'misc']
228 | 
229 | 
230 | def main(args):
231 |     global res
232 |     res = args.res
233 |     confusion_matrix    = np.zeros(shape=(26+1, 26+1),dtype=np.ulonglong)
234 |     gts_folders         =  glob.glob(args.gts + '/*')
235 |     pred_folders        = [ gtf.replace(args.gts, args.preds) for gtf in gts_folders ]
236 |     # print(gts_folders)
237 | 
238 |     gts     = []
239 |     preds   = []
240 |     for i, gtf in enumerate(gts_folders):
241 |         
242 |         g = glob.glob(gtf+'/*_gtFine_labellevel3Ids.png')
243 |         # print(g)
244 |         p = [ j.replace(gtf, pred_folders[i]) for j in g] #.replace('_gtFine_labellevel3Ids.png','_leftImg8bit.png')
245 |         gts += g
246 |         preds += p
247 | 
248 |     # print(len(gts))
249 | 
250 |     pairs = [(preds[i], gts[i]) for i in range(len(gts))]
251 | 
252 |     pool = Pool(args.num_workers)
253 | 
254 |     results = list(tqdm.tqdm(pool.imap(process_pred_gt_pair, pairs), total=len(pairs)))
255 |     pool.close()
256 |     pool.join()
257 | 
258 |     for i in range(len(results)):
259 |         confusion_matrix += results[i]
260 | 
261 |     os.makedirs('eval_results', exist_ok=True)
262 |     ious_at_all_levels(confusion_matrix)
263 |     
264 |     np.save(f'eval_results/cm_{res}',confusion_matrix)
265 | 
266 |     ious = eval_ious(confusion_matrix)
267 |     np.save(f'eval_results/ious_{res}', np.array(ious))
268 | 
269 |     # for i in range(26):
270 |     #     print(f'{class_names[i]}:\t\t\t\t {ious[i]*100}')
271 | 
272 |     # print(f'mIoU:\t\t\t\t{ious.mean()*100}')
273 | 
274 |         
275 |         
276 | if __name__ == '__main__':
277 |     args = get_args()
278 |     main(args)
279 | 


--------------------------------------------------------------------------------
/evaluation/helper_eval_detection.py:
--------------------------------------------------------------------------------
  1 | # --------------------------------------------------------
  2 | # Adapted from Fast/er R-CNN
  3 | # --------------------------------------------------------
  4 | 
  5 | import xml.etree.ElementTree as ET
  6 | import os,sys
  7 | import cPickle
  8 | import numpy as np
  9 | sys.path.append( os.path.normpath( os.path.join( os.path.dirname( __file__ ) , '..' , 'helpers' ) ) )
 10 | 
 11 | from anue_labels import labels, name2label
 12 | 
 13 | 
 14 | def parse_rec(filename):
 15 |     """ Parse a IDD Detection xml file """
 16 |     tree = ET.parse(filename)
 17 |     objects = []
 18 |     for obj in tree.findall('object'):
 19 |         obj_struct = {}
 20 |         lbl = name2label[obj.find('name').text]
 21 |         obj_struct['level3Id'] = lbl.level3Id   
 22 |         obj_struct['difficult'] = 0#int(obj.find('difficult').text)
 23 |         bbox = obj.find('bndbox')
 24 |         obj_struct['bbox'] = [int(bbox.find('xmin').text),
 25 |                               int(bbox.find('ymin').text),
 26 |                               int(bbox.find('xmax').text),
 27 |                               int(bbox.find('ymax').text)]
 28 |         objects.append(obj_struct)
 29 | 
 30 |     return objects
 31 | 
 32 | def idd_ap(rec, prec ):
 33 |     """ ap = idd_ap(rec, prec, [use_07_metric])
 34 |     Compute IDD AP given precision and recall.
 35 |     """
 36 |     
 37 |     # correct AP calculation
 38 |     # first append sentinel values at the end
 39 |     mrec = np.concatenate(([0.], rec, [1.]))
 40 |     mpre = np.concatenate(([0.], prec, [0.]))
 41 | 
 42 |     # compute the precision envelope
 43 |     for i in range(mpre.size - 1, 0, -1):
 44 |         mpre[i - 1] = np.maximum(mpre[i - 1], mpre[i])
 45 | 
 46 |     # to calculate area under PR curve, look for points
 47 |     # where X axis (recall) changes value
 48 |     i = np.where(mrec[1:] != mrec[:-1])[0]
 49 | 
 50 |     # and sum (\Delta recall) * prec
 51 |     ap = np.sum((mrec[i + 1] - mrec[i]) * mpre[i + 1])
 52 |     return ap
 53 | 
 54 | def idd_eval(detpath,
 55 |              annopath,
 56 |              imagesetfile,
 57 |              level3Id,
 58 |              cachedir,
 59 |              ovthresh=0.5):
 60 |     """rec, prec, ap = idd_eval(detpath,
 61 |                                 annopath,
 62 |                                 imagesetfile,
 63 |                                 level3Id,
 64 |                                 [ovthresh],
 65 |                                 [use_07_metric])
 66 | 
 67 |     Top level function that does similar evaluation as PASCAL VOC.
 68 | 
 69 |     detpath: Path to detections
 70 |         detpath.format(level3Id) should produce the detection results file.
 71 |     annopath: Path to annotations
 72 |         annopath.format(imagename) should be the xml annotations file.
 73 |     imagesetfile: Text file containing the list of images, one image per line.
 74 |     level3Id: level3Id of label
 75 |     cachedir: Directory for caching the annotations
 76 |     [ovthresh]: Overlap threshold (default = 0.5)
 77 |     """
 78 |     # assumes detections are in detpath.format(level3Id)
 79 |     # assumes annotations are in annopath.format(imagename)
 80 |     # assumes imagesetfile is a text file with each line an image name
 81 |     # cachedir caches the annotations in a pickle file
 82 | 
 83 |     # first load gt
 84 |     if not os.path.isdir(cachedir):
 85 |         os.mkdir(cachedir)
 86 |     cachefile = os.path.join(cachedir, 'annots.pkl')
 87 |     # read list of images
 88 |     with open(imagesetfile, 'r') as f:
 89 |         lines = f.readlines()
 90 |     imagenames = [x.strip() for x in lines]
 91 | 
 92 |     if not os.path.isfile(cachefile):
 93 |         # load annots
 94 |         recs = {}
 95 |         for i, imagename in enumerate(imagenames):
 96 |             recs[imagename] = parse_rec(annopath.format(imagename))
 97 |             if i % 100 == 0:
 98 |                 print 'Reading annotation for {:d}/{:d}'.format(
 99 |                     i + 1, len(imagenames))
100 |         # save
101 |         print 'Saving cached annotations to {:s}'.format(cachefile)
102 |         with open(cachefile, 'w') as f:
103 |             cPickle.dump(recs, f)
104 |     else:
105 |         # load
106 |         with open(cachefile, 'r') as f:
107 |             recs = cPickle.load(f)
108 | 
109 |     # extract gt objects for this class
110 |     class_recs = {}
111 |     npos = 0
112 |     for imagename in imagenames:
113 |         R = [obj for obj in recs[imagename] if obj['level3Id'] == level3Id]
114 |         bbox = np.array([x['bbox'] for x in R])
115 |         difficult = np.array([x['difficult'] for x in R]).astype(np.bool)
116 |         det = [False] * len(R)
117 |         npos = npos + sum(~difficult)
118 |         class_recs[imagename] = {'bbox': bbox,
119 |                                 'difficult' : difficult,
120 |                                  'det': det}
121 | 
122 |     # read dets
123 |     detfile = detpath.format(level3Id
124 |         )
125 |     with open(detfile, 'r') as f:
126 |         lines = f.readlines()
127 | 
128 |     splitlines = [x.strip().split(' ') for x in lines]
129 |     image_ids = [x[0] for x in splitlines]
130 |     confidence = np.array([float(x[1]) for x in splitlines])
131 |     BB = np.array([[float(z) for z in x[2:]] for x in splitlines])
132 | 
133 |     # sort by confidence
134 |     sorted_ind = np.argsort(-confidence)
135 |     sorted_scores = np.sort(-confidence)
136 |     BB = BB[sorted_ind, :]
137 |     image_ids = [image_ids[x] for x in sorted_ind]
138 | 
139 |     # go down dets and mark TPs and FPs
140 |     nd = len(image_ids)
141 |     tp = np.zeros(nd)
142 |     fp = np.zeros(nd)
143 |     for d in range(nd):
144 |         R = class_recs[image_ids[d]]
145 |         bb = BB[d, :].astype(float)
146 |         ovmax = -np.inf
147 |         BBGT = R['bbox'].astype(float)
148 | 
149 |         if BBGT.size > 0:
150 |             # compute overlaps
151 |             # intersection
152 |             ixmin = np.maximum(BBGT[:, 0], bb[0])
153 |             iymin = np.maximum(BBGT[:, 1], bb[1])
154 |             ixmax = np.minimum(BBGT[:, 2], bb[2])
155 |             iymax = np.minimum(BBGT[:, 3], bb[3])
156 |             iw = np.maximum(ixmax - ixmin + 1., 0.)
157 |             ih = np.maximum(iymax - iymin + 1., 0.)
158 |             inters = iw * ih
159 | 
160 |             # union
161 |             uni = ((bb[2] - bb[0] + 1.) * (bb[3] - bb[1] + 1.) +
162 |                    (BBGT[:, 2] - BBGT[:, 0] + 1.) *
163 |                    (BBGT[:, 3] - BBGT[:, 1] + 1.) - inters)
164 | 
165 |             overlaps = inters / uni
166 |             ovmax = np.max(overlaps)
167 |             jmax = np.argmax(overlaps)
168 |         if ovmax > ovthresh:
169 |             if not R['difficult'][jmax]:
170 |                 if not R['det'][jmax]:
171 |                     tp[d] = 1.
172 |                     R['det'][jmax] = 1
173 |                 else:
174 |                     fp[d] = 1.
175 |         else:
176 |             fp[d] = 1.
177 | 
178 |     # compute precision recall
179 |     fp = np.cumsum(fp)
180 |     tp = np.cumsum(tp)
181 |     rec = tp / float(npos)
182 |     # avoid divide by zero in case the first detection matches a difficult
183 |     # ground truth
184 |     prec = tp / np.maximum(tp + fp, np.finfo(np.float64).eps)
185 |     ap = idd_ap(rec, prec)
186 | 
187 |     return rec, prec, ap


--------------------------------------------------------------------------------
/evaluation/idd_lite_evaluate_mIoU.py:
--------------------------------------------------------------------------------
  1 | from argparse import ArgumentParser
  2 | from PIL import Image
  3 | import os
  4 | import glob
  5 | import time
  6 | import numpy as np
  7 | from multiprocessing import Pool
  8 | import matplotlib.pyplot as plt
  9 | import sys
 10 | 
 11 | res = 128
 12 | 
 13 | def get_args():
 14 |     parser = ArgumentParser()
 15 |     parser.add_argument('--gts', default="/ssd_scratch/cvit/girish.varma/dataset/anue_test/gtFine/test")
 16 |     parser.add_argument('--preds', default="")
 17 |     parser.add_argument('--prefix', default="_label.png")
 18 |     parser.add_argument('--res', type=int, default=128)
 19 |     parser.add_argument('--num-workers', type=int, default=10)
 20 |     
 21 |     args = parser.parse_args()
 22 | 
 23 |     return args
 24 | 
 25 | 
 26 | def add_to_confusion_matrix(gt, pred, mat):
 27 | #    print(pred.shape)
 28 | #    print(pred.size[0],pred.size[1])
 29 | 
 30 |     if (pred.shape[0] != gt.shape[0]):
 31 |         print("Image widths of " + pred + " and " + gt + " are not equal.")
 32 |     if (pred.shape[1] != gt.shape[1]):
 33 |         print("Image heights of " + pred + " and " + gt + " are not equal.")
 34 |     if ( len(pred.shape) != 2 ):
 35 |         print("Predicted image has multiple channels.")
 36 |     W  = pred.shape[0]
 37 |     H = pred.shape[1]
 38 |     P = H*W
 39 | 
 40 |     pred = pred.flatten()
 41 |     gt = gt.flatten()
 42 | 
 43 | 
 44 |     for (gtp,predp) in zip(gt, pred):
 45 |         if gtp == 255 or gtp >7 :
 46 |             gtp = 7
 47 |         if predp == 255 or predp >7:
 48 |             predp = 7
 49 |         mat[gtp, predp] += 1
 50 | 
 51 |     return mat
 52 | 
 53 | def higher_level_mat(mat, mapping):
 54 |     n = len(mapping)
 55 |     h_mat = np.zeros((n,n), dtype=np.ulonglong)
 56 | 
 57 |     for x_id in range(n):
 58 |         for y_id in range(n):
 59 |             x_h_ids = mapping[x_id]
 60 |             y_h_ids = mapping[y_id]
 61 |             for x_h_id in x_h_ids:
 62 |                 for y_h_id in y_h_ids:
 63 |                     h_mat[x_id,y_id] += mat[x_h_id, y_h_id]
 64 | 
 65 |     return h_mat
 66 | 
 67 | 
 68 | def generalized_eval_ious(mat):
 69 |     n = mat.shape[0]
 70 |     ious = np.zeros(n)
 71 |     for l in range(n):
 72 |         tp = np.longlong(mat[l,l])
 73 |         fn = np.longlong(mat[l,:].sum()) - tp
 74 | 
 75 |         notIgnored = [i for i in range(n) if not i==l]
 76 |         fp = np.longlong(mat[notIgnored,l].sum())
 77 |         denom = (tp + fp + fn)
 78 |         if denom == 0:
 79 |             print('error: denom is 0')
 80 | 
 81 |         ious[l] =  float(tp) / denom
 82 |     return ious
 83 | 
 84 | 
 85 | def print_scores(ious, names, heading):
 86 |     print('---------------------------------------------')
 87 |     print(heading)
 88 |     print('---------------------------------------------')
 89 |     for (iou, name) in zip(ious, names):
 90 |         print(f'{name}\t\t:{iou}')
 91 |     print('---------------------------------------------')
 92 |     print(f'mIoU\t\t:{ious.mean()}')
 93 |     print('---------------------------------------------')
 94 | 
 95 | 
 96 |     
 97 | 
 98 | 
 99 | 
100 | def ious_at_all_levels(mat):
101 |     global res
102 | 
103 | 
104 |     l1_mat = mat
105 |     l1_ious = generalized_eval_ious(l1_mat)
106 |     
107 |     np.save(f'eval_results/lite_ious_l1_{res}', np.array(l1_ious))
108 |     
109 |     np.save(f'eval_results/lite_cm_l1_{res}', np.array(l1_mat))
110 |     
111 |     l1_names = ['drivable', 'non-drivable', 'living-things', 'vehicles', 'road-side-objs', 'far-objects', 'sky']
112 | 
113 | 
114 |     print_scores(l1_ious, l1_names, "Level 1 Scores")
115 | 
116 |     
117 | 
118 | def process_pred_gt_pair(pair):
119 |     global res
120 |     W,H = 1920, 1080
121 |     if res == 720:
122 |         W,H = 1280, 720
123 |         # print(W,H)
124 |     if res == 480:
125 |         W,H = 858, 480
126 |     if res == 240:
127 |         W,H = 426, 240
128 |     if res == 128:
129 |         W,H = 256, 128
130 |     
131 |     
132 | 
133 |     pred, gt = pair
134 |     # tqdm.tqdm.write(pred, gt)
135 |     confusion_matrix = np.zeros(shape=(7+1, 7+1),dtype=np.ulonglong)
136 |     try:
137 |         gt = Image.open(gt)
138 |         # print(gt.size)
139 |         if gt.size != (W, H):
140 |             gt = gt.resize((W, H), resample=Image.NEAREST)
141 |         gt  = np.array(gt)
142 |     except:
143 |         print("Unable to load " + gt)
144 |         os._exit(1)
145 | 
146 |     try:
147 |         pred = Image.open(pred)
148 |         if pred.size != (W, H):
149 |             pred = pred.resize((W, H), resample=Image.NEAREST)
150 |         pred = np.array(pred)
151 |     except:
152 |         print("Unable to load " + pred)
153 |         os._exit(1)
154 | 
155 |     # plt.matshow(gt)
156 |     # plt.show()
157 |     # plt.matshow(pred)
158 |     # plt.show()
159 | 
160 | 
161 |     
162 |     
163 | 
164 |     # print(pred.size,gt.size)
165 |     
166 |     add_to_confusion_matrix(gt, pred, confusion_matrix)
167 | 
168 |     return confusion_matrix
169 | 
170 | import tqdm
171 | 
172 | class_names = ['road', 'drivable fallback', 'sidewalk', 'non-drivable fallback', 'person', 'rider', 'motorcycle', 'bicycle', 'autorickshaw', 'car', 'truck', 'bus', 'vehicle fallback', 'curb', 'wall', 'fence', 'guard rail', 'billboard', 'traffic sign', 'traffic light', 'pole', 'obs-str-bar-fallback', 'building', 'bridge', 'vegetation', 'sky', 'misc']
173 | 
174 | 
175 | 
176 | def eval_ious(mat):
177 |     ious = np.zeros(7+1)
178 |     for l in range(7):
179 |         tp = np.longlong(mat[l,l])
180 |         fn = np.longlong(mat[l,:].sum()) - tp
181 | 
182 |         notIgnored = [i for i in range(7) if not i==l]
183 |         fp = np.longlong(mat[notIgnored,l].sum())
184 |         denom = (tp + fp + fn)
185 |         if denom == 0:
186 |             print('error: denom is 0')
187 | 
188 |         ious[l] =  float(tp) / denom
189 | 
190 |     return ious[:-1]
191 | 
192 | 
193 | def main(args):
194 |     global res
195 |     res = args.res
196 |     confusion_matrix    = np.zeros(shape=(7+1, 7+1),dtype=np.ulonglong)
197 |     gts_folders         =  glob.glob(args.gts + '/*')
198 |     pred_folders        = [ gtf.replace(args.gts, args.preds) for gtf in gts_folders ]
199 |     # print(gts_folders)
200 | 
201 |     gts     = []
202 |     preds   = []
203 |     for i, gtf in enumerate(gts_folders):
204 |         
205 |         g = glob.glob(gtf+'/*_label.png')
206 |         g = [ lab for lab in g if not lab.endswith("_inst_label.png") ]
207 |         
208 |         # print(g)
209 |         p = [ j.replace(gtf, pred_folders[i]) for j in g] #.replace('_gtFine_labellevel3Ids.png','_leftImg8bit.png')
210 |         gts += g
211 |         preds += p
212 | 
213 |     # print(len(gts))
214 | 
215 |     pairs = [(preds[i], gts[i]) for i in range(len(gts))]
216 | 
217 |     pool = Pool(args.num_workers)
218 | 
219 |     results = list(tqdm.tqdm(pool.imap(process_pred_gt_pair, pairs), total=len(pairs)))
220 |     pool.close()
221 |     pool.join()
222 | 
223 |     for i in range(len(results)):
224 |         confusion_matrix += results[i]
225 | 
226 |     os.makedirs('eval_results', exist_ok=True)
227 |     ious_at_all_levels(confusion_matrix)
228 |     
229 |     np.save(f'eval_results/lite_cm_{res}',confusion_matrix)
230 | 
231 |     ious = eval_ious(confusion_matrix)
232 |     np.save(f'eval_results/lite_ious_{res}', np.array(ious))
233 | 
234 |     # for i in range(26):
235 |     #     print(f'{class_names[i]}:\t\t\t\t {ious[i]*100}')
236 |     print(ious)
237 |     print(f'mIoU:\t\t\t\t{ious.mean()*100}')
238 | 
239 |         
240 |         
241 | if __name__ == '__main__':
242 |     args = get_args()
243 |     main(args)
244 | 


--------------------------------------------------------------------------------
/evaluation/instance.py:
--------------------------------------------------------------------------------
 1 | #!/usr/bin/python
 2 | #
 3 | # Instance class
 4 | #
 5 | 
 6 | class Instance(object):
 7 |     instID     = 0
 8 |     labelID    = 0
 9 |     pixelCount = 0
10 |     medDist    = -1
11 |     distConf   = 0.0
12 | 
13 |     def __init__(self, imgNp, instID):
14 |         if (instID == -1):
15 |             return
16 |         self.instID     = int(instID)
17 |         self.labelID    = int(self.getLabelID(instID))
18 |         self.pixelCount = int(self.getInstancePixels(imgNp, instID))
19 | 
20 |     def getLabelID(self, instID):
21 |         if (instID < 1000):
22 |             return instID
23 |         else:
24 |             return int(instID / 1000)
25 | 
26 |     def getInstancePixels(self, imgNp, instLabel):
27 |         return (imgNp == instLabel).sum()
28 | 
29 |     def toJSON(self):
30 |         return json.dumps(self, default=lambda o: o.__dict__, sort_keys=True, indent=4)
31 | 
32 |     def toDict(self):
33 |         buildDict = {}
34 |         buildDict["instID"]     = self.instID
35 |         buildDict["labelID"]    = self.labelID
36 |         buildDict["pixelCount"] = self.pixelCount
37 |         buildDict["medDist"]    = self.medDist
38 |         buildDict["distConf"]   = self.distConf
39 |         return buildDict
40 | 
41 |     def fromJSON(self, data):
42 |         self.instID     = int(data["instID"])
43 |         self.labelID    = int(data["labelID"])
44 |         self.pixelCount = int(data["pixelCount"])
45 |         if ("medDist" in data):
46 |             self.medDist    = float(data["medDist"])
47 |             self.distConf   = float(data["distConf"])
48 | 
49 |     def __str__(self):
50 |         return "("+str(self.instID)+")"


--------------------------------------------------------------------------------
/evaluation/instances2dict.py:
--------------------------------------------------------------------------------
 1 | #!/usr/bin/python
 2 | #
 3 | # Convert instances from png files to a dictionary
 4 | #
 5 | 
 6 | from __future__ import print_function
 7 | import os, sys
 8 | 
 9 | # Cityscapes imports
10 | from instance import *
11 | sys.path.append( os.path.normpath( os.path.join( os.path.dirname( __file__ ) , '..' , 'helpers' ) ) )
12 | # from csHelpers import *
13 | from PIL import Image
14 | import numpy as np
15 | from anue_labels import *
16 | 
17 | def instances2dict(imageFileList, verbose=False):
18 |     imgCount     = 0
19 |     instanceDict = {}
20 | 
21 |     if not isinstance(imageFileList, list):
22 |         imageFileList = [imageFileList]
23 | 
24 |     if verbose:
25 |         print("Processing {} images...".format(len(imageFileList)))
26 | 
27 |     for imageFileName in imageFileList:
28 |         # Load image
29 |         img = Image.open(imageFileName)
30 | 
31 |         # Image as numpy array
32 |         imgNp = np.array(img)
33 | 
34 |         # Initialize label categories
35 |         instances = {}
36 |         for label in labels:
37 |             instances[label.name] = []
38 | 
39 |         # Loop through all instance ids in instance image
40 |         for instanceId in np.unique(imgNp):
41 |             if instanceId != 255 and instanceId//1000 <19 and instanceId//1000>5:
42 |                 instanceObj = Instance(imgNp, instanceId)
43 | 
44 |                 instances[id2label[instanceObj.labelID].name].append(instanceObj.toDict())
45 | 
46 |         imgKey = os.path.abspath(imageFileName)
47 |         instanceDict[imgKey] = instances
48 |         imgCount += 1
49 | 
50 |         if verbose:
51 |             print("\rImages Processed: {}".format(imgCount), end=' ')
52 |             sys.stdout.flush()
53 | 
54 |     if verbose:
55 |         print("")
56 | 
57 |     return instanceDict
58 | 
59 | def main(argv):
60 |     fileList = []
61 |     if (len(argv) > 2):
62 |         for arg in argv:
63 |             if ("png" in arg):
64 |                 fileList.append(arg)
65 |     instances2dict(fileList, True)
66 | 
67 | if __name__ == "__main__":
68 |     main(sys.argv[1:])
69 | 


--------------------------------------------------------------------------------
/helpers/__pycache__/annotation.cpython-35.pyc:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/AutoNUE/public-code/5d9a93beb176b03dd32c79ce050d0fbddc9acd00/helpers/__pycache__/annotation.cpython-35.pyc


--------------------------------------------------------------------------------
/helpers/__pycache__/annotation.cpython-37.pyc:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/AutoNUE/public-code/5d9a93beb176b03dd32c79ce050d0fbddc9acd00/helpers/__pycache__/annotation.cpython-37.pyc


--------------------------------------------------------------------------------
/helpers/__pycache__/anue_labels.cpython-35.pyc:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/AutoNUE/public-code/5d9a93beb176b03dd32c79ce050d0fbddc9acd00/helpers/__pycache__/anue_labels.cpython-35.pyc


--------------------------------------------------------------------------------
/helpers/__pycache__/anue_labels.cpython-37.pyc:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/AutoNUE/public-code/5d9a93beb176b03dd32c79ce050d0fbddc9acd00/helpers/__pycache__/anue_labels.cpython-37.pyc


--------------------------------------------------------------------------------
/helpers/annotation.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/python
  2 | #
  3 | # Classes to store, read, and write annotations
  4 | #
  5 | 
  6 | import os
  7 | import json
  8 | from collections import namedtuple
  9 |   
 10 | # get current date and time
 11 | import datetime
 12 | import locale
 13 | 
 14 | # A point in a polygon
 15 | Point = namedtuple('Point', ['x', 'y'])
 16 | 
 17 | # Class that contains the information of a single annotated object
 18 | class CsObject:
 19 |     # Constructor
 20 |     def __init__(self):
 21 |         # the label
 22 |         self.label    = ""
 23 |         # the polygon as list of points
 24 |         self.polygon  = []
 25 | 
 26 |         # the object ID
 27 |         self.id       = -1
 28 |         # If deleted or not
 29 |         self.deleted  = 0
 30 |         # If verified or not
 31 |         self.verified = 0
 32 |         # The date string
 33 |         self.date     = ""
 34 |         # The username
 35 |         self.user     = ""
 36 |         # Draw the object
 37 |         # Not read from or written to JSON
 38 |         # Set to False if deleted object
 39 |         # Might be set to False by the application for other reasons
 40 |         self.draw     = True
 41 | 
 42 |     def __str__(self):
 43 |         polyText = ""
 44 |         if self.polygon:
 45 |             if len(self.polygon) <= 4:
 46 |                 for p in self.polygon:
 47 |                     polyText += '({},{}) '.format( p.x , p.y )
 48 |             else:
 49 |                 polyText += '({},{}) ({},{}) ... ({},{}) ({},{})'.format(
 50 |                     self.polygon[ 0].x , self.polygon[ 0].y ,
 51 |                     self.polygon[ 1].x , self.polygon[ 1].y ,
 52 |                     self.polygon[-2].x , self.polygon[-2].y ,
 53 |                     self.polygon[-1].x , self.polygon[-1].y )
 54 |         else:
 55 |             polyText = "none"
 56 |         text = "Object: {} - {}".format( self.label , polyText )
 57 |         return text
 58 | 
 59 |     def fromJsonText(self, jsonText, objId):
 60 |         self.id = objId
 61 |         self.label = str(jsonText['label'])
 62 |         self.polygon = [ Point(p[0],p[1]) for p in jsonText['polygon'] ]
 63 |         if 'deleted' in jsonText.keys():
 64 |             self.deleted = jsonText['deleted']
 65 |         else:
 66 |             self.deleted = 0
 67 |         if 'verified' in jsonText.keys():
 68 |             self.verified = jsonText['verified']
 69 |         else:
 70 |             self.verified = 1
 71 |         if 'user' in jsonText.keys():
 72 |             self.user = jsonText['user']
 73 |         else:
 74 |             self.user = ''
 75 |         if 'date' in jsonText.keys():
 76 |             self.date = jsonText['date']
 77 |         else:
 78 |             self.date = ''
 79 |         if self.deleted == 1:
 80 |             self.draw = False
 81 |         else:
 82 |             self.draw = True
 83 | 
 84 |     def toJsonText(self):
 85 |         objDict = {}
 86 |         objDict['label'] = self.label
 87 |         objDict['id'] = self.id
 88 |         objDict['deleted'] = self.deleted
 89 |         objDict['verified'] = self.verified
 90 |         objDict['user'] = self.user
 91 |         objDict['date'] = self.date
 92 |         objDict['polygon'] = []
 93 |         for pt in self.polygon:
 94 |             objDict['polygon'].append([pt.x, pt.y])
 95 | 
 96 |         return objDict
 97 | 
 98 |     def updateDate( self ):
 99 |         try:
100 |             locale.setlocale( locale.LC_ALL , 'en_US.utf8' )
101 |         except locale.Error:
102 |             locale.setlocale( locale.LC_ALL , 'us_us.utf8' )
103 |         except:
104 |             pass
105 |         self.date = datetime.datetime.now().strftime("%d-%b-%Y %H:%M:%S")
106 | 
107 |     # Mark the object as deleted
108 |     def delete(self):
109 |         self.deleted = 1
110 |         self.draw    = False
111 | 
112 | # The annotation of a whole image
113 | class Annotation:
114 |     # Constructor
115 |     def __init__(self, imageWidth=0, imageHeight=0):
116 |         # the width of that image and thus of the label image
117 |         self.imgWidth  = imageWidth
118 |         # the height of that image and thus of the label image
119 |         self.imgHeight = imageHeight
120 |         # the list of objects
121 |         self.objects = []
122 | 
123 |     def toJson(self):
124 |         return json.dumps(self, default=lambda o: o.__dict__, sort_keys=True, indent=4)
125 | 
126 |     def fromJsonText(self, jsonText):
127 |         jsonDict = json.loads(jsonText)
128 |         self.imgWidth  = int(jsonDict['imgWidth'])
129 |         self.imgHeight = int(jsonDict['imgHeight'])
130 |         self.objects   = []
131 |         for objId, objIn in enumerate(jsonDict[ 'objects' ]):
132 |             obj = CsObject()
133 |             obj.fromJsonText(objIn, objId)
134 |             self.objects.append(obj)
135 | 
136 |     def toJsonText(self):
137 |         jsonDict = {}
138 |         jsonDict['imgWidth'] = self.imgWidth
139 |         jsonDict['imgHeight'] = self.imgHeight
140 |         jsonDict['objects'] = []
141 |         for obj in self.objects:
142 |             objDict = obj.toJsonText()
143 |             jsonDict['objects'].append(objDict)
144 |   
145 |         return jsonDict
146 | 
147 |     # Read a json formatted polygon file and return the annotation
148 |     def fromJsonFile(self, jsonFile):
149 |         if not os.path.isfile(jsonFile):
150 |             print('Given json file not found: {}'.format(jsonFile))
151 |             return
152 |         with open(jsonFile, 'r') as f:
153 |             jsonText = f.read()
154 |             self.fromJsonText(jsonText)
155 | 
156 |     def toJsonFile(self, jsonFile):
157 |         with open(jsonFile, 'w') as f:
158 |             f.write(self.toJson())
159 |             
160 | 
161 | # a dummy example
162 | if __name__ == "__main__":
163 |     obj = CsObject()
164 |     obj.label = 'car'
165 |     obj.polygon.append( Point( 0 , 0 ) )
166 |     obj.polygon.append( Point( 1 , 0 ) )
167 |     obj.polygon.append( Point( 1 , 1 ) )
168 |     obj.polygon.append( Point( 0 , 1 ) )
169 | 
170 |     print(obj)
171 | 


--------------------------------------------------------------------------------
/helpers/anue_labels.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/python
  2 | #
  3 | # AutoNUE labels
  4 | #
  5 | 
  6 | from collections import namedtuple
  7 | 
  8 | 
  9 | #--------------------------------------------------------------------------------
 10 | # Definitions
 11 | #--------------------------------------------------------------------------------
 12 | 
 13 | # a label and all meta information
 14 | Label = namedtuple( 'Label' , [
 15 | 
 16 |     'name'        , 
 17 |     'id'          ,
 18 | 
 19 |     'csId'        ,
 20 | 
 21 |     'csTrainId'   ,    
 22 | 
 23 |     'level4Id'    , 
 24 |     'level3Id'    , 
 25 |     'level2IdName', 
 26 |     'level2Id'    , 
 27 |     'level1Id'    , 
 28 | 
 29 |     'hasInstances', 
 30 |     'ignoreInEval', 
 31 |     'color'       , 
 32 |     ] )
 33 | 
 34 | 
 35 | #--------------------------------------------------------------------------------
 36 | # A list of all labels
 37 | #--------------------------------------------------------------------------------
 38 | 
 39 | 
 40 | labels = [
 41 |     #       name                     id    csId     csTrainId level4id        level3Id  category           level2Id      level1Id  hasInstances   ignoreInEval   color
 42 |     Label(  'road'                 ,  0   ,  7 ,     0 ,       0   ,     0  ,   'drivable'            , 0           , 0      , False        , False        , (128, 64,128)  ),
 43 |     Label(  'parking'              ,  1   ,  9 ,   255 ,       1   ,     1  ,   'drivable'            , 1           , 0      , False        , False         , (250,170,160)  ),
 44 |     Label(  'drivable fallback'    ,  2   ,  255 ,   255 ,     2   ,       1  ,   'drivable'            , 1           , 0      , False        , False         , ( 81,  0, 81)  ),
 45 |     Label(  'sidewalk'             ,  3   ,  8 ,     1 ,       3   ,     2  ,   'non-drivable'        , 2           , 1      , False        , False        , (244, 35,232)  ),
 46 |     Label(  'rail track'           ,  4   , 10 ,   255 ,       3   ,     3  ,   'non-drivable'        , 3           , 1      , False        , False         , (230,150,140)  ),
 47 |     Label(  'non-drivable fallback',  5   , 255 ,     9 ,      4   ,      3  ,   'non-drivable'        , 3           , 1      , False        , False        , (152,251,152)  ),
 48 |     Label(  'person'               ,  6   , 24 ,    11 ,       5   ,     4  ,   'living-thing'        , 4           , 2      , True         , False        , (220, 20, 60)  ),
 49 |     Label(  'animal'               ,  7   , 255 ,   255 ,      6   ,      4  ,   'living-thing'        , 4           , 2      , True         , True        , (246, 198, 145)),
 50 |     Label(  'rider'                ,  8   , 25 ,    12 ,       7   ,     5  ,   'living-thing'        , 5           , 2      , True         , False        , (255,  0,  0)  ),
 51 |     Label(  'motorcycle'           ,  9   , 32 ,    17 ,       8   ,     6  ,   '2-wheeler'           , 6           , 3      , True         , False        , (  0,  0,230)  ),
 52 |     Label(  'bicycle'              , 10   , 33 ,    18 ,       9   ,     7  ,   '2-wheeler'           , 6           , 3      , True         , False        , (119, 11, 32)  ),
 53 |     Label(  'autorickshaw'         , 11   , 255 ,   255 ,     10   ,      8  ,   'autorickshaw'        , 7           , 3      , True         , False        , (255, 204, 54) ),
 54 |     Label(  'car'                  , 12   , 26 ,    13 ,      11   ,     9  ,   'car'                 , 7           , 3      , True         , False        , (  0,  0,142)  ),
 55 |     Label(  'truck'                , 13   , 27 ,    14 ,      12   ,     10 ,   'large-vehicle'       , 8           , 3      , True         , False        , (  0,  0, 70)  ),
 56 |     Label(  'bus'                  , 14   , 28 ,    15 ,      13   ,     11 ,   'large-vehicle'       , 8           , 3      , True         , False        , (  0, 60,100)  ),
 57 |     Label(  'caravan'              , 15   , 29 ,   255 ,      14   ,     12 ,   'large-vehicle'       , 8           , 3      , True         , True         , (  0,  0, 90)  ),
 58 |     Label(  'trailer'              , 16   , 30 ,   255 ,      15   ,     12 ,   'large-vehicle'       , 8           , 3      , True         , True         , (  0,  0,110)  ),
 59 |     Label(  'train'                , 17   , 31 ,    16 ,      15   ,     12 ,   'large-vehicle'       , 8           , 3      , True         , True        , (  0, 80,100)  ),
 60 |     Label(  'vehicle fallback'     , 18   , 355 ,   255 ,     15   ,      12 ,   'large-vehicle'       , 8           , 3      , True         , False        , (136, 143, 153)),  
 61 |     Label(  'curb'                 , 19   ,255 ,   255 ,      16   ,     13 ,   'barrier'             , 9           , 4      , False        , False        , (220, 190, 40)),
 62 |     Label(  'wall'                 , 20   , 12 ,     3 ,      17   ,     14 ,   'barrier'             , 9           , 4      , False        , False        , (102,102,156)  ),
 63 |     Label(  'fence'                , 21   , 13 ,     4 ,      18   ,     15 ,   'barrier'             , 10           , 4      , False        , False        , (190,153,153)  ),
 64 |     Label(  'guard rail'           , 22   , 14 ,   255 ,      19   ,     16 ,   'barrier'             , 10          , 4      , False        , False         , (180,165,180)  ),
 65 |     Label(  'billboard'            , 23   , 255 ,   255 ,     20   ,      17 ,   'structures'          , 11           , 4      , False        , False        , (174, 64, 67) ),
 66 |     Label(  'traffic sign'         , 24   , 20 ,     7 ,      21   ,     18 ,   'structures'          , 11          , 4      , False        , False        , (220,220,  0)  ),
 67 |     Label(  'traffic light'        , 25   , 19 ,     6 ,      22   ,     19 ,   'structures'          , 11          , 4      , False        , False        , (250,170, 30)  ),
 68 |     Label(  'pole'                 , 26   , 17 ,     5 ,      23   ,     20 ,   'structures'          , 12          , 4      , False        , False        , (153,153,153)  ),
 69 |     Label(  'polegroup'            , 27   , 18 ,   255 ,      23   ,     20 ,   'structures'          , 12          , 4      , False        , False         , (153,153,153)  ),
 70 |     Label(  'obs-str-bar-fallback' , 28   , 255 ,   255 ,     24   ,      21 ,   'structures'          , 12          , 4      , False        , False        , (169, 187, 214) ),  
 71 |     Label(  'building'             , 29   , 11 ,     2 ,      25   ,     22 ,   'construction'        , 13          , 5      , False        , False        , ( 70, 70, 70)  ),
 72 |     Label(  'bridge'               , 30   , 15 ,   255 ,      26   ,     23 ,   'construction'        , 13          , 5      , False        , False         , (150,100,100)  ),
 73 |     Label(  'tunnel'               , 31   , 16 ,   255 ,      26   ,     23 ,   'construction'        , 13          , 5      , False        , False         , (150,120, 90)  ),
 74 |     Label(  'vegetation'           , 32   , 21 ,     8 ,      27   ,     24 ,   'vegetation'          , 14          , 5      , False        , False        , (107,142, 35)  ),
 75 |     Label(  'sky'                  , 33   , 23 ,    10 ,      28   ,     25 ,   'sky'                 , 15          , 6      , False        , False        , ( 70,130,180)  ),
 76 |     Label(  'fallback background'  , 34   , 255 ,   255 ,     29   ,      25 ,   'object fallback'     , 15          , 6      , False        , False        , (169, 187, 214)),
 77 |     Label(  'unlabeled'            , 35   ,  0  ,     255 ,   255   ,      255 ,   'void'                , 255         , 255    , False        , True         , (  0,  0,  0)  ),
 78 |     Label(  'ego vehicle'          , 36   ,  1  ,     255 ,   255   ,      255 ,   'void'                , 255         , 255    , False        , True         , (  0,  0,  0)  ),
 79 |     Label(  'rectification border' , 37   ,  2  ,     255 ,   255   ,      255 ,   'void'                , 255         , 255    , False        , True         , (  0,  0,  0)  ),
 80 |     Label(  'out of roi'           , 38   ,  3  ,     255 ,   255   ,      255 ,   'void'                , 255         , 255    , False        , True         , (  0,  0,  0)  ),
 81 |     Label(  'license plate'        , 39   , 255 ,     255 ,   255   ,      255 ,   'vehicle'             , 255         , 255    , False        , True         , (  0,  0,142)  ),
 82 |     
 83 | ]           
 84 | 
 85 | 
 86 | #--------------------------------------------------------------------------------
 87 | # Create dictionaries for a fast lookup
 88 | #--------------------------------------------------------------------------------
 89 | 
 90 | # Please refer to the main method below for example usages!
 91 | 
 92 | # name to label object
 93 | name2label      = { label.name    : label for label in labels           }
 94 | # id to label object
 95 | id2label        = { label.id      : label for label in labels           }
 96 | 
 97 | def assureSingleInstanceName( name ):
 98 |     # if the name is known, it is not a group
 99 |     if name in name2label:
100 |         return name
101 |     # test if the name actually denotes a group
102 |     if not name.endswith("group"):
103 |         return None
104 |     # remove group
105 |     name = name[:-len("group")]
106 |     # test if the new name exists
107 |     if not name in name2label:
108 |         return None
109 |     # test if the new name denotes a label that actually has instances
110 |     if not name2label[name].hasInstances:
111 |         return None
112 |     # all good then
113 |     return name
114 | 
115 | #--------------------------------------------------------------------------------
116 | # Main for testing
117 | #--------------------------------------------------------------------------------
118 | 
119 | # just a dummy main
120 | # if __name__ == "__main__":
121 |     # # Print all the labels
122 |     
123 | 


--------------------------------------------------------------------------------
/preperation/__pycache__/json2instanceImg.cpython-37.pyc:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/AutoNUE/public-code/5d9a93beb176b03dd32c79ce050d0fbddc9acd00/preperation/__pycache__/json2instanceImg.cpython-37.pyc


--------------------------------------------------------------------------------
/preperation/__pycache__/json2labelImg.cpython-37.pyc:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/AutoNUE/public-code/5d9a93beb176b03dd32c79ce050d0fbddc9acd00/preperation/__pycache__/json2labelImg.cpython-37.pyc


--------------------------------------------------------------------------------
/preperation/cityscape_panoptic_gt.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/env python
  2 | from __future__ import absolute_import
  3 | from __future__ import division
  4 | from __future__ import print_function
  5 | from __future__ import unicode_literals
  6 | import pdb
  7 | from tqdm import tqdm
  8 | import os
  9 | import sys
 10 | import json
 11 | import glob
 12 | import numpy as np
 13 | import PIL.Image as Image
 14 | from multiprocessing import Pool
 15 | from panopticapi.utils import IdGenerator, save_json
 16 | 
 17 | try:
 18 |     # set up path for cityscapes scripts
 19 |     # sys.path.append('./cityscapesScripts/')
 20 |     from anue_labels import labels, id2label
 21 | except Exception:
 22 |     raise Exception(
 23 |         "Please load Cityscapes scripts from https://github.com/mcordts/cityscapesScripts")
 24 | 
 25 | original_format_folder = '/home/chrizandr/idd20kII/gtFine/val/'
 26 | # folder to store panoptic PNGs
 27 | out_folder = 'output/'
 28 | # json with segmentations information
 29 | out_file = 'cityscapes_panoptic_val.json'
 30 | 
 31 | 
 32 | def process_image(working_idx):
 33 |     global file_list, categories_dic, output_folder
 34 |     f = file_list[working_idx]
 35 |     # print(f)
 36 |     images = []
 37 |     img = Image.open(f)
 38 |     img = img.resize((1280, 720))
 39 |     original_format = np.array(img)
 40 |     # print("Processing file", f)
 41 |     file_name = f.split('/')[-1]
 42 |     image_id = file_name.rsplit('_', 2)[0]
 43 |     image_filename = '{}_{}_gtFine_panopticlevel3Ids.png'.format(
 44 |         f.split('/')[-2], image_id)
 45 |     # pdb.set_trace()
 46 |     # image entry, id for image is its filename without extension
 47 |     image = {"id": image_filename,
 48 |              "width": original_format.shape[1],
 49 |              "height": original_format.shape[0],
 50 |              "file_name": image_filename}
 51 | 
 52 |     pan_format = np.zeros(
 53 |         (original_format.shape[0], original_format.shape[1], 3), dtype=np.uint8)
 54 |     id_generator = IdGenerator(categories_dict)
 55 | 
 56 |     idx = 0
 57 |     l = np.unique(original_format)
 58 |     segm_info = []
 59 |     for el in l:
 60 |         if el < 1000:
 61 |             semantic_id = el
 62 |             is_crowd = 1
 63 |         else:
 64 |             semantic_id = el // 1000
 65 |             is_crowd = 0
 66 |         if semantic_id not in categories_dict:
 67 |             continue
 68 |         if categories_dict[semantic_id]['isthing'] == 0:
 69 |             is_crowd = 0
 70 |         mask = original_format == el
 71 |         segment_id, color = id_generator.get_id_and_color(semantic_id)
 72 |         pan_format[mask] = color
 73 | 
 74 |         area = np.sum(mask)  # segment area computation
 75 | 
 76 |         # bbox computation for a segment
 77 |         hor = np.sum(mask, axis=0)
 78 |         hor_idx = np.nonzero(hor)[0]
 79 |         x = hor_idx[0]
 80 |         width = hor_idx[-1] - x + 1
 81 |         vert = np.sum(mask, axis=1)
 82 |         vert_idx = np.nonzero(vert)[0]
 83 |         y = vert_idx[0]
 84 |         height = vert_idx[-1] - y + 1
 85 |         bbox = [x, y, width, height]
 86 | 
 87 |         segm_info.append({"id": int(segment_id),
 88 |                           "category_id": int(semantic_id),
 89 |                           "area": int(area),
 90 |                           "bbox": [int(x) for x in bbox],
 91 |                           "iscrowd": is_crowd})
 92 | 
 93 |     Image.fromarray(pan_format).save(
 94 |         os.path.join(output_folder, image_filename))
 95 |     return image, segm_info
 96 | 
 97 | 
 98 | def panoptic_converter(num_workers, original_format_folder, out_folder, out_file):
 99 |     global file_list, categories_dict, output_folder
100 |     output_folder = out_folder
101 |     if not os.path.isdir(out_folder):
102 |         print("Creating folder {} for panoptic segmentation PNGs".format(out_folder))
103 |         os.mkdir(out_folder)
104 | 
105 |     categories = []
106 |     added_cats = []
107 |     for idx, el in enumerate(labels):
108 |         if el.ignoreInEval:
109 |             continue
110 |         if el.level3Id not in added_cats:
111 |             # pdb.set_trace()
112 |             categories.append({'id': el.level3Id,
113 |                                'name': el.name,
114 |                                'color': el.color,
115 |                                'supercategory': el.level2IdName,
116 |                                'isthing': 1 if el.hasInstances else 0})
117 |             added_cats.append(el.level3Id)
118 | 
119 |     categories_dict = {cat['id']: cat for cat in categories}
120 | 
121 |     file_list = sorted(glob.glob(os.path.join(
122 |         original_format_folder, '*/*_gtFine_instancelevel3Ids.png')))
123 | 
124 |     images = []
125 |     annotations = []
126 |     pool = Pool(num_workers)
127 |     files = [x for x in range(len(file_list))]
128 |     tqdm.write(
129 |         "Processing {} annotation files for Panoptic Segmentation".format(len(files)))
130 | 
131 | 
132 |     # results = pool.map(process_pred_gt_pair, pairs)
133 |     results = list(tqdm(pool.imap(process_image, files), total=len(files)))
134 |     for img, segm_info in results:
135 |         annotations.append({'image_id': img["id"],
136 |                             'file_name': img["file_name"],
137 |                             "segments_info": segm_info})
138 |         images.append(img)
139 | 
140 |     d = {'images': images,
141 |          'annotations': annotations,
142 |          'categories': categories,
143 |          }
144 |     save_json(d, out_file)
145 | 
146 | 
147 | if __name__ == "__main__":
148 |     panoptic_converter(original_format_folder, out_folder, out_file)
149 | 


--------------------------------------------------------------------------------
/preperation/createLabels.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/python
  2 | #
  3 | 
  4 | # python imports
  5 | from __future__ import print_function
  6 | import os
  7 | import glob
  8 | import sys
  9 | #from scipy.misc import imread, imsave
 10 | from imageio import imread, imsave
 11 | import numpy as np
 12 | from numpngw import write_png
 13 | 
 14 | from json2labelImg import json2labelImg
 15 | from json2instanceImg import json2instanceImg
 16 | 
 17 | 
 18 | from tqdm import tqdm
 19 | 
 20 | from argparse import ArgumentParser
 21 | import os
 22 | 
 23 | import pandas as pd
 24 | import shutil
 25 | 
 26 | args = None
 27 | 
 28 | 
 29 | def process_folder(fn):
 30 |     global args
 31 | 
 32 |     dst = fn.replace("_polygons.json", "_label{}s.png".format(args.id_type))
 33 | 
 34 |     # do the conversion
 35 |     try:
 36 |         json2labelImg(fn, dst, args.id_type)
 37 |     except:
 38 |         tqdm.write("Failed to convert: {}".format(fn))
 39 |         raise
 40 | 
 41 |     if args.instance:
 42 |         dst = fn.replace("_polygons.json",
 43 |                          "_instance{}s.png".format(args.id_type))
 44 | 
 45 |         # do the conversion
 46 |         # try:
 47 |         json2instanceImg(fn, dst, args.id_type)
 48 |         # except:
 49 |         #     tqdm.write("Failed to convert: {}".format(f))
 50 |         #     raise
 51 | 
 52 |     if args.color:
 53 |         # create the output filename
 54 |         dst = fn.replace("_polygons.json", "_labelColors.png")
 55 | 
 56 |         # do the conversion
 57 |         try:
 58 |             json2labelImg(fn, dst, 'color')
 59 |         except:
 60 |             tqdm.write("Failed to convert: {}".format(f))
 61 |             raise
 62 | 
 63 |     # if args.panoptic and args.instance:
 64 |         # panoptic_converter(f, out_folder, out_file)
 65 | 
 66 | 
 67 | def get_args():
 68 |     parser = ArgumentParser()
 69 | 
 70 |     parser.add_argument('--datadir', default="")
 71 |     parser.add_argument('--id-type', default='level3Id')
 72 |     parser.add_argument('--color', type=bool, default=False)
 73 |     parser.add_argument('--instance', type=bool, default=False)
 74 |     parser.add_argument('--panoptic', type=bool, default=False)
 75 |     parser.add_argument('--semisup_da', type=bool, default=False)
 76 |     parser.add_argument('--unsup_da', type=bool, default=False)
 77 |     parser.add_argument('--weaksup_da', type=bool, default=False)
 78 |     parser.add_argument('--num-workers', type=int, default=10)
 79 | 
 80 |     args = parser.parse_args()
 81 | 
 82 |     return args
 83 | 
 84 | # The main method
 85 | 
 86 | 
 87 | def main(args):
 88 |     import sys
 89 |     if args.panoptic:
 90 |         args.instance = True
 91 |     sys.path.append(os.path.normpath(os.path.join(
 92 |         os.path.dirname(__file__), '..', 'helpers')))
 93 |     # how to search for all ground truth
 94 |     searchFine = os.path.join(args.datadir, "gtFine",
 95 |                               "*", "*", "*_gt*_polygons.json")
 96 | 
 97 |     # search files
 98 |     filesFine = glob.glob(searchFine)
 99 |     filesFine.sort()
100 | 
101 |     files = []#filesFine
102 | 
103 |     #for semi supervised domain adaptation, convert only selected images
104 |     filesnew_semisup = []
105 |     filesnewunsup = []
106 |     if args.semisup_da:
107 |         d_strat = list(pd.read_csv('./domain_adaptation/target/semi-supervised/selected_samples.csv',header=None)[0])
108 |         d_strat = ["/".join(filenew.replace("_labellevel3Ids.png", "").split("/")[-3:]) for filenew in d_strat]
109 |         print(d_strat)
110 |         for fileold in filesFine:
111 |             if "val/" not in fileold:
112 |                 searchstr = "/".join(fileold.replace("_polygons.json", "").split("/")[-3:])
113 |                 if searchstr in d_strat:
114 |                     print(searchstr)
115 |                     filesnew_semisup.append(fileold)
116 |             else: filesnew_semisup.append(fileold)
117 |         files = filesnew_semisup
118 |     elif args.unsup_da or args.weaksup_da:    #for unsupervised domain adaptation, convert only val images
119 |         for fileold in filesFine:
120 |             if "val/" in fileold:
121 |                 filesnewunsup.append(fileold)
122 |         files = filesnewunsup
123 |     else: files = filesFine
124 | 
125 |     #print('args.semisup_da', args.semisup_da, len(files))
126 |     if not files:
127 |         tqdm.writeError(
128 |             "Did not find any files. Please consult the README.")
129 | 
130 |     # a bit verbose
131 |     tqdm.write(
132 |         "Processing {} annotation files for Sematic/Instance Segmentation".format(len(files)))
133 | 
134 |     # iterate through files
135 |     progress = 0
136 |     tqdm.write("Progress: {:>3} %".format(
137 |         progress * 100 / len(files)), end=' ')
138 | 
139 |     from multiprocessing import Pool
140 |     import time
141 | 
142 |     pool = Pool(args.num_workers)
143 |     # results = pool.map(process_pred_gt_pair, pairs)
144 |     results = list(
145 |         tqdm(pool.imap(process_folder, files), total=len(files)))
146 |     pool.close()
147 |     pool.join()
148 | 
149 |     if args.panoptic:
150 |         from cityscape_panoptic_gt import panoptic_converter
151 |         for split in ['train', 'val']:
152 | 
153 |             tqdm.write("Panoptic Segmentation {} split".format(split))
154 |             folder_name = os.path.join(args.datadir, 'gtFine')
155 |             output_folder = os.path.join(folder_name, split + "_panoptic")
156 |             os.makedirs(output_folder, exist_ok=True)
157 |             out_file = os.path.join(folder_name, split + "_panoptic.json")
158 |             panoptic_converter(args.num_workers, os.path.join(
159 |                 folder_name, split), output_folder, out_file)
160 | 
161 | 
162 | if __name__ == "__main__":
163 |     args = get_args()
164 |     main(args)
165 | 


--------------------------------------------------------------------------------
/preperation/json2instanceImg.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/python
  2 | #
  3 | # Reads labels as polygons in JSON format and converts them to instance images,
  4 | # where each pixel has an ID that represents the ground truth class and the
  5 | # individual instance of that class.
  6 | #
  7 | # The pixel values encode both, class and the individual instance.
  8 | # The integer part of a division by 1000 of each ID provides the class ID,
  9 | # as described in labels.py. The remainder is the instance ID. If a certain
 10 | # annotation describes multiple instances, then the pixels have the regular
 11 | # ID of that class.
 12 | #
 13 | # Example:
 14 | # Let's say your labels.py assigns the ID 26 to the class 'car'.
 15 | # Then, the individual cars in an image get the IDs 26000, 26001, 26002, ... .
 16 | # A group of cars, where our annotators could not identify the individual
 17 | # instances anymore, is assigned to the ID 26.
 18 | #
 19 | # Note that not all classes distinguish instances (see labels.py for a full list).
 20 | # The classes without instance annotations are always directly encoded with
 21 | # their regular ID, e.g. 11 for 'building'.
 22 | #
 23 | # Usage: json2instanceImg.py [OPTIONS] <input json> <output image>
 24 | # Options:
 25 | #   -h   print a little help text
 26 | #   -t   use train IDs
 27 | #
 28 | # Can also be used by including as a module.
 29 | #
 30 | # Uses the mapping defined in 'labels.py'.
 31 | #
 32 | # See also createTrainIdInstanceImgs.py to apply the mapping to all annotations in Cityscapes.
 33 | #
 34 | 
 35 | # python imports
 36 | from anue_labels import labels, name2label
 37 | from annotation import Annotation
 38 | import os
 39 | import sys
 40 | import getopt
 41 | from tqdm import tqdm
 42 | 
 43 | # Image processing
 44 | # Check if PIL is actually Pillow as expected
 45 | try:
 46 |     from PIL import PILLOW_VERSION
 47 | except:
 48 |     print("Please install the module 'Pillow' for image processing, e.g.")
 49 |     print("pip install pillow")
 50 |     sys.exit(-1)
 51 | 
 52 | try:
 53 |     import PIL.Image as Image
 54 |     import PIL.ImageDraw as ImageDraw
 55 | except:
 56 |     print("Failed to import the image processing packages.")
 57 |     sys.exit(-1)
 58 | 
 59 | 
 60 | sys.path.append(os.path.normpath(os.path.join(
 61 |     os.path.dirname(__file__), '..', 'helpers')))
 62 | # print(sys.path)
 63 | 
 64 | # Print the information
 65 | 
 66 | 
 67 | def printHelp():
 68 |     print('{} [OPTIONS] inputJson outputImg'.format(
 69 |         os.path.basename(sys.argv[0])))
 70 |     print('')
 71 |     print(' Reads labels as polygons in JSON format and converts them to instance images,')
 72 |     print(' where each pixel has an ID that represents the ground truth class and the')
 73 |     print(' individual instance of that class.')
 74 |     print('')
 75 |     print(' The pixel values encode both, class and the individual instance.')
 76 |     print(' The integer part of a division by 1000 of each ID provides the class ID,')
 77 |     print(' as described in labels.py. The remainder is the instance ID. If a certain')
 78 |     print(' annotation describes multiple instances, then the pixels have the regular')
 79 |     print(' ID of that class.')
 80 |     print('')
 81 |     print(' Example:')
 82 |     print(' Let\'s say your labels.py assigns the ID 26 to the class "car".')
 83 |     print(' Then, the individual cars in an image get the IDs 26000, 26001, 26002, ... .')
 84 |     print(' A group of cars, where our annotators could not identify the individual')
 85 |     print(' instances anymore, is assigned to the ID 26.')
 86 |     print('')
 87 |     print(' Note that not all classes distinguish instances (see labels.py for a full list).')
 88 |     print(' The classes without instance annotations are always directly encoded with')
 89 |     print(' their regular ID, e.g. 11 for "building".')
 90 |     print('')
 91 |     print('Options:')
 92 |     print(' -h                 Print this help')
 93 |     print(' -t                 Use the "trainIDs" instead of the regular mapping. See "labels.py" for details.')
 94 | 
 95 | # Print an error message and quit
 96 | 
 97 | 
 98 | def printError(message):
 99 |     print('ERROR: {}'.format(message))
100 |     print('')
101 |     print('USAGE:')
102 |     printHelp()
103 |     sys.exit(-1)
104 | 
105 | # Convert the given annotation to a label image
106 | 
107 | 
108 | def createInstanceImage(inJson,annotation, encoding):
109 |     # the size of the image
110 |     size = (annotation.imgWidth, annotation.imgHeight)
111 | 
112 |     # the background
113 |     if encoding == "id":
114 |         backgroundId = name2label['unlabeled'].id
115 |     elif encoding == "csId":
116 |         backgroundId = name2label['unlabeled'].csId
117 |     elif encoding == "csTrainId":
118 |         backgroundId = name2label['unlabeled'].csTrainId
119 |     elif encoding == "level4Id":
120 |         backgroundId = name2label['unlabeled'].level4Id
121 |     elif encoding == "level3Id":
122 |         backgroundId = name2label['unlabeled'].level3Id
123 |     elif encoding == "level2Id":
124 |         backgroundId = name2label['unlabeled'].level2Id
125 |     elif encoding == "level1Id":
126 |         backgroundId = name2label['unlabeled'].level1Id
127 |     else:
128 |         print("Unknown encoding '{}'".format(encoding))
129 |         return None
130 | 
131 |     # this is the image that we want to create
132 |     instanceImg = Image.new("I", size, backgroundId)
133 | 
134 |     # a drawer to draw into the image
135 |     drawer = ImageDraw.Draw(instanceImg)
136 | 
137 |     # a dict where we keep track of the number of instances that
138 |     # we already saw of each class
139 |     nbInstances = {}
140 |     for labelTuple in labels:
141 |         if labelTuple.hasInstances:
142 |             nbInstances[labelTuple.level3Id] = 0
143 | 
144 |     # loop over all objects
145 |     for obj in annotation.objects:
146 |         label = obj.label
147 |         # if label == 'person':
148 |         #     print "person"
149 |         polygon = obj.polygon
150 | 
151 |         # If the object is deleted, skip it
152 |         if obj.deleted or len(polygon) < 2:
153 |             continue
154 | 
155 |         # if the label is not known, but ends with a 'group' (e.g. cargroup)
156 |         # try to remove the s and see if that works
157 |         # also we know that this polygon describes a group
158 |         isGroup = False
159 |         if (not label in name2label) and label.endswith('group'):
160 |             label = label[:-len('group')]
161 |             isGroup = True
162 | 
163 |         if not label in name2label:
164 |             print("Label '{}' not known.".format(label))
165 |             tqdm.write("Something wrong in: " + inJson)
166 |             continue
167 | 
168 |         # the label tuple
169 |         labelTuple = name2label[label]
170 | 
171 |         # get the class ID
172 |         if encoding == "id":
173 |             id = labelTuple.id
174 |         elif encoding == "csId":
175 |             id = labelTuple.csId
176 |         elif encoding == "csTrainId":
177 |             id = labelTuple.csTrainId
178 |         elif encoding == "level4Id":
179 |             id = labelTuple.level4Id
180 |         elif encoding == "level3Id":
181 |             id = labelTuple.level3Id
182 |         elif encoding == "level2Id":
183 |             id = labelTuple.level2Id
184 |         elif encoding == "level1Id":
185 |             id = labelTuple.level1Id
186 | 
187 |         # if this label distinguishs between invidudial instances,
188 |         # make the id a instance ID
189 |         if labelTuple.hasInstances and not isGroup:
190 |             id = id * 1000 + nbInstances[labelTuple.level3Id]
191 |             nbInstances[labelTuple.level3Id] += 1
192 | 
193 |         # If the ID is negative that polygon should not be drawn
194 |         if id < 0:
195 |             continue
196 | 
197 |         # print 'id is ', id
198 | 
199 |         try:
200 |             # if id > 24000 and id < 25000:
201 |                 # print id
202 |             drawer.polygon(polygon, fill=id)
203 |         except:
204 |             print("Failed to draw polygon with label {} and id {}: {}".format(
205 |                 label, id, polygon))
206 |             raise
207 | 
208 |     return instanceImg
209 | 
210 | # A method that does all the work
211 | # inJson is the filename of the json file
212 | # outImg is the filename of the instance image that is generated
213 | # encoding can be set to
214 | #     - "ids"      : classes are encoded using the regular label IDs
215 | #     - "trainIds" : classes are encoded using the training IDs
216 | 
217 | 
218 | def json2instanceImg(inJson, outImg, encoding="ids"):
219 |     annotation = Annotation()
220 |     annotation.fromJsonFile(inJson)
221 |     instanceImg = createInstanceImage(inJson, annotation, encoding)
222 |     instanceImg.save(outImg)
223 | 
224 | # The main method, if you execute this script directly
225 | # Reads the command line arguments and calls the method 'json2instanceImg'
226 | 
227 | 
228 | def main(argv):
229 |     trainIds = False
230 |     try:
231 |         opts, args = getopt.getopt(argv, "ht")
232 |     except getopt.GetoptError:
233 |         printError('Invalid arguments')
234 |     for opt, arg in opts:
235 |         if opt == '-h':
236 |             printHelp()
237 |             sys.exit(0)
238 |         elif opt == '-t':
239 |             trainIds = True
240 |         else:
241 |             printError("Handling of argument '{}' not implementend".format(opt))
242 | 
243 |     if len(args) == 0:
244 |         printError("Missing input json file")
245 |     elif len(args) == 1:
246 |         printError("Missing output image filename")
247 |     elif len(args) > 2:
248 |         printError("Too many arguments")
249 | 
250 |     inJson = args[0]
251 |     outImg = args[1]
252 | 
253 |     if trainIds:
254 |         json2instanceImg(inJson, outImg, 'trainIds')
255 |     else:
256 |         json2instanceImg(inJson, outImg)
257 | 
258 | 
259 | # call the main method
260 | if __name__ == "__main__":
261 |     main(sys.argv[1:])
262 | 


--------------------------------------------------------------------------------
/preperation/json2labelImg.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/python
  2 | #
  3 | 
  4 | # python imports
  5 | from anue_labels import name2label
  6 | from annotation import Annotation
  7 | import os
  8 | import sys
  9 | import getopt
 10 | 
 11 | import numpy
 12 | 
 13 | # Image processing
 14 | # Check if PIL is actually Pillow as expected
 15 | try:
 16 |     from PIL import PILLOW_VERSION
 17 | except:
 18 |     print("Please install the module 'Pillow' for image processing, e.g.")
 19 |     print("pip install pillow")
 20 |     sys.exit(-1)
 21 | 
 22 | try:
 23 |     import PIL.Image as Image
 24 |     import PIL.ImageDraw as ImageDraw
 25 | except:
 26 |     print("Failed to import the image processing packages.")
 27 |     sys.exit(-1)
 28 | 
 29 | 
 30 | sys.path.append(os.path.normpath(os.path.join(
 31 |     os.path.dirname(__file__), '..', 'helpers')))
 32 | 
 33 | # Print the information
 34 | 
 35 | 
 36 | def printHelp():
 37 |     print('{} [OPTIONS] inputJson outputImg'.format(
 38 |         os.path.basename(sys.argv[0])))
 39 |     print('')
 40 |     print('Reads labels as polygons in JSON format and converts them to label images,')
 41 |     print('where each pixel has an ID that represents the ground truth label.')
 42 |     print('')
 43 |     print('Options:')
 44 |     print(' -h                 Print this help')
 45 |     print(' -t                 Use the "trainIDs" instead of the regular mapping. See "labels.py" for details.')
 46 | 
 47 | # Print an error message and quit
 48 | 
 49 | 
 50 | def printError(message):
 51 |     print('ERROR: {}'.format(message))
 52 |     print('')
 53 |     print('USAGE:')
 54 |     printHelp()
 55 |     sys.exit(-1)
 56 | 
 57 | # Convert the given annotation to a label image
 58 | 
 59 | 
 60 | def createLabelImage(inJson, annotation, encoding, outline=None):
 61 |     # the size of the image
 62 |     size = (annotation.imgWidth, annotation.imgHeight)
 63 | 
 64 |     # the background
 65 |     if encoding == "id":
 66 |         background = name2label['unlabeled'].id
 67 |     elif encoding == "csId":
 68 |         background = name2label['unlabeled'].csId
 69 |     elif encoding == "csTrainId":
 70 |         background = name2label['unlabeled'].csTrainId
 71 |     elif encoding == "level4Id":
 72 |         background = name2label['unlabeled'].level4Id
 73 |     elif encoding == "level3Id":
 74 |         background = name2label['unlabeled'].level3Id
 75 |     elif encoding == "level2Id":
 76 |         background = name2label['unlabeled'].level2Id
 77 |     elif encoding == "level1Id":
 78 |         background = name2label['unlabeled'].level1Id
 79 |     elif encoding == "color":
 80 |         background = name2label['unlabeled'].color
 81 |     else:
 82 |         print("Unknown encoding '{}'".format(encoding))
 83 |         return None
 84 | 
 85 |     # this is the image that we want to create
 86 |     if encoding == "color":
 87 |         labelImg = Image.new("RGBA", size, background)
 88 |     else:
 89 |         # print(size, background)
 90 |         labelImg = Image.new("L", size, background)
 91 | 
 92 |     # a drawer to draw into the image
 93 |     drawer = ImageDraw.Draw(labelImg)
 94 | 
 95 |     # loop over all objects
 96 |     for obj in annotation.objects:
 97 |         label = obj.label
 98 |         polygon = obj.polygon
 99 | 
100 |         # If the object is deleted, skip it
101 |         if obj.deleted or len(polygon) < 3:
102 |             continue
103 | 
104 |         # If the label is not known, but ends with a 'group' (e.g. cargroup)
105 |         # try to remove the s and see if that works
106 |         if (not label in name2label) and label.endswith('group'):
107 |             label = label[:-len('group')]
108 | 
109 |         if not label in name2label:
110 |             print("Label '{}' not known.".format(label))
111 |             tqdm.write("Something wrong in: " + inJson)
112 |             continue
113 | 
114 |         # If the ID is negative that polygon should not be drawn
115 |         if name2label[label].id < 0:
116 |             continue
117 | 
118 |         if encoding == "id":
119 |             val = name2label[label].id
120 |         elif encoding == "csId":
121 |             val = name2label[label].csId
122 |         elif encoding == "csTrainId":
123 |             val = name2label[label].csTrainId
124 |         elif encoding == "level4Id":
125 |             val = name2label[label].level4Id
126 |         elif encoding == "level3Id":
127 |             val = name2label[label].level3Id
128 |         elif encoding == "level2Id":
129 |             val = name2label[label].level2Id
130 |         elif encoding == "level1Id":
131 |             val = name2label[label].level1Id
132 |         elif encoding == "color":
133 |             val = name2label[label].color
134 | 
135 |         try:
136 |             if outline:
137 | 
138 |                 drawer.polygon(polygon, fill=val, outline=outline)
139 |             else:
140 |                 drawer.polygon(polygon, fill=val)
141 |                 # print(label, val)
142 |         except:
143 |             print("Failed to draw polygon with label {}".format(label))
144 |             raise
145 | 
146 |     # print(numpy.array(labelImg))
147 | 
148 |     return labelImg
149 | 
150 | # A method that does all the work
151 | # inJson is the filename of the json file
152 | # outImg is the filename of the label image that is generated
153 | # encoding can be set to
154 | #     - "ids"      : classes are encoded using the regular label IDs
155 | #     - "trainIds" : classes are encoded using the training IDs
156 | #     - "color"    : classes are encoded using the corresponding colors
157 | 
158 | 
159 | def json2labelImg(inJson, outImg, encoding="ids"):
160 |     annotation = Annotation()
161 |     annotation.fromJsonFile(inJson)
162 |     labelImg = createLabelImage(inJson, annotation, encoding)
163 |     labelImg.save(outImg)
164 | 
165 | # The main method, if you execute this script directly
166 | # Reads the command line arguments and calls the method 'json2labelImg'
167 | 
168 | 
169 | def main(argv):
170 |     trainIds = False
171 |     try:
172 |         opts, args = getopt.getopt(argv, "ht")
173 |     except getopt.GetoptError:
174 |         printError('Invalid arguments')
175 |     for opt, arg in opts:
176 |         if opt == '-h':
177 |             printHelp()
178 |             sys.exit(0)
179 |         elif opt == '-t':
180 |             trainIds = True
181 |         else:
182 |             printError("Handling of argument '{}' not implementend".format(opt))
183 | 
184 |     if len(args) == 0:
185 |         printError("Missing input json file")
186 |     elif len(args) == 1:
187 |         printError("Missing output image filename")
188 |     elif len(args) > 2:
189 |         printError("Too many arguments")
190 | 
191 |     inJson = args[0]
192 |     outImg = args[1]
193 | 
194 |     if trainIds:
195 |         json2labelImg(inJson, outImg, "trainIds")
196 |     else:
197 |         json2labelImg(inJson, outImg)
198 | 
199 | 
200 | # call the main method
201 | if __name__ == "__main__":
202 |     main(sys.argv[1:])
203 | 


--------------------------------------------------------------------------------
/requirements.txt:
--------------------------------------------------------------------------------
1 | numpy
2 | pandas==1.2.1
3 | tqdm
4 | Pillow
5 | scipy==1.1.0
6 | imageio
7 | 


--------------------------------------------------------------------------------
/viewer/viewer.py:
--------------------------------------------------------------------------------
  1 | import matplotlib.pyplot as plt
  2 | import numpy as np
  3 | from anue_labels import labels
  4 | from scipy.misc import imread
  5 | import glob
  6 | from argparse import ArgumentParser
  7 | import random
  8 | 
  9 | def get_level_id(label, level):
 10 |     if level == 3:
 11 |         return label.id
 12 |     elif level == 2:
 13 |         return label.level3Id
 14 |     elif level == 1:
 15 |         return label.level2Id
 16 |     elif level == 0:
 17 |         return label.level1Id
 18 |     else:
 19 |         return label.id
 20 | 
 21 | def get_ids(label, level):
 22 |     id_list = []
 23 |     for l in labels:
 24 |         if get_level_id(l, level) == label:
 25 |             id_list.append(l.id)
 26 |     return id_list
 27 | 
 28 | num_labels = [7, 16, 26, 35]
 29 | 
 30 | colors = [
 31 |     [(128, 64, 128), (244, 35, 232), (220, 20, 60), (0, 0, 230), (220, 190, 40), (70, 70, 70), (70, 130, 180), (0, 0, 0)], 
 32 |     [(128, 64, 128), (250, 170, 160), (244, 35, 232), (230, 150, 140), (220, 20, 60), (255, 0, 0), (0, 0, 230), (255, 204, 54), (0, 0, 70), (220, 190, 40), (190, 153, 153), (174, 64, 67), (153, 153, 153), (70, 70, 70), (107, 142, 35), (70, 130, 180),(0, 0, 0)], 
 33 |     [(128, 64, 128), (250, 170, 160), (244, 35, 232), (230, 150, 140), (220, 20, 60), (255, 0, 0), (0, 0, 230), (119, 11, 32), (255, 204, 54), (0, 0, 142), (0, 0, 70), (0, 60, 100), (0, 0, 90), (220, 190, 40), (102, 102, 156), (190, 153, 153), (180, 165, 180), (174, 64, 67), (220, 220, 0), (250, 170, 30), (153, 153, 153), (169, 187, 214), (70, 70, 70), (150, 100, 100), (107, 142, 35), (70, 130, 180), (0, 0, 0)], 
 34 |     [(128, 64, 128), (250, 170, 160), (81, 0, 81), (244, 35, 232), (230, 150, 140), (152, 251, 152), (220, 20, 60), (246, 198, 145), (255, 0, 0), (0, 0, 230), (119, 11, 32), (255, 204, 54), (0, 0, 142), (0, 0, 70), (0, 60, 100), (0, 0, 90), (0, 0, 110), (0, 80, 100), (136, 143, 153), (220, 190, 40), (102, 102, 156), (190, 153, 153), (180, 165, 180), (174, 64, 67), (220, 220, 0), (250, 170, 30), (153, 153, 153), (153, 153, 153), (169, 187, 214), (70, 70, 70), (150, 100, 100), (150, 120, 90), (107, 142, 35), (70, 130, 180), (169, 187, 214), (0, 0, 0), (0, 0, 0), (0, 0, 0), (0, 0, 0), (0, 0, 142)]]
 35 | 
 36 | 
 37 | def get_image(label_mask, level):
 38 |     h, w = label_mask.shape
 39 |     image = np.zeros((h,w,3), dtype=int)
 40 |     for l in range(num_labels[level]):
 41 |         id_list = get_ids(l, level)
 42 |         # print(id_list)
 43 |         for id in id_list:
 44 |             indices = label_mask == id 
 45 |             
 46 |             for i in range(3):
 47 |                 
 48 |                 image[indices,i] = colors[level][l][i]
 49 | 
 50 |     return image
 51 | 
 52 | import time
 53 | 
 54 | 
 55 | 
 56 | def view_image(label_path):
 57 | 
 58 |     image_path = label_path.replace('gtFine', 'leftImg8bit').replace('_labelids','')
 59 |     label_mask = imread(label_path)
 60 |     image = imread(image_path)
 61 |     imgs = [image]
 62 |     for i in range(4):
 63 |         imgs.append(get_image(label_mask,4-i-1))
 64 | 
 65 | 
 66 |     f = plt.figure(figsize=(14,9), dpi=150)
 67 |             
 68 |     axarr = [None]*5
 69 |     axarr[0] = f.add_subplot(231)
 70 |     axarr[1] = f.add_subplot(232)
 71 |     axarr[2] = f.add_subplot(233)
 72 |     axarr[3] = f.add_subplot(234)
 73 |     axarr[4] = f.add_subplot(235)
 74 |     
 75 |     plt.subplots_adjust(top=0.9,
 76 | bottom=0.0,
 77 | left=0.0,
 78 | right=1.0,
 79 | hspace=0.15,
 80 | wspace=0.0)
 81 | 
 82 |     # f.
 83 |     # print(images[0].shape)
 84 |     id_names = [ 'image','id', 'level3', 'level2', 'level1'  ]
 85 |     for i in range(5):
 86 |         axarr[i].imshow(imgs[i])
 87 |         axarr[i].axis('off')
 88 |         axarr[i].set_title(id_names[i])
 89 |     
 90 | 
 91 |     f.suptitle(label_path)
 92 |     plt.show()
 93 |     time.sleep(5)
 94 |     plt.close()
 95 | 
 96 | 
 97 | 
 98 | 
 99 | def get_args():
100 |     parser = ArgumentParser()
101 |     parser.add_argument('--datadir', default="")
102 | 
103 |     
104 |     args = parser.parse_args()
105 | 
106 |     return args
107 | 
108 | # The main method
109 | def main(args):
110 |     label_path_list = glob.glob(args.datadir+f'/gtFine/*/*/*_labelids.png')
111 |     print(len(label_path_list))
112 |     random.shuffle(label_path_list)
113 |     for l in label_path_list:
114 |         
115 |         view_image(l)
116 | 
117 | 
118 | 
119 | 
120 | if __name__ == "__main__":
121 |     args = get_args()
122 |     main(args)
123 | 
124 | 
125 | 
126 | 
127 | 


--------------------------------------------------------------------------------
/viewer/viewer2.py:
--------------------------------------------------------------------------------
  1 | import matplotlib.pyplot as plt
  2 | import numpy as np
  3 | from anue_labels import labels
  4 | from scipy.misc import imread
  5 | import glob
  6 | from argparse import ArgumentParser
  7 | import random
  8 | 
  9 | def get_level_id(label, level):
 10 |     if level == 3:
 11 |         return label.id
 12 |     elif level == 2:
 13 |         return label.level3Id
 14 |     elif level == 1:
 15 |         return label.level2Id
 16 |     elif level == 0:
 17 |         return label.level1Id
 18 |     else:
 19 |         return label.id
 20 | 
 21 | def get_ids(label, level):
 22 |     id_list = []
 23 |     for l in labels:
 24 |         if get_level_id(l, level) == label:
 25 |             id_list.append(l.id)
 26 |     return id_list
 27 | 
 28 | num_labels = [7, 16, 26, 35]
 29 | 
 30 | colors = [
 31 |     [(128, 64, 128), (244, 35, 232), (220, 20, 60), (0, 0, 230), (220, 190, 40), (70, 70, 70), (70, 130, 180), (0, 0, 0)], 
 32 |     [(128, 64, 128), (250, 170, 160), (244, 35, 232), (230, 150, 140), (220, 20, 60), (255, 0, 0), (0, 0, 230), (255, 204, 54), (0, 0, 70), (220, 190, 40), (190, 153, 153), (174, 64, 67), (153, 153, 153), (70, 70, 70), (107, 142, 35), (70, 130, 180),(0, 0, 0)], 
 33 |     [(128, 64, 128), (250, 170, 160), (244, 35, 232), (230, 150, 140), (220, 20, 60), (255, 0, 0), (0, 0, 230), (119, 11, 32), (255, 204, 54), (0, 0, 142), (0, 0, 70), (0, 60, 100), (0, 0, 90), (220, 190, 40), (102, 102, 156), (190, 153, 153), (180, 165, 180), (174, 64, 67), (220, 220, 0), (250, 170, 30), (153, 153, 153), (169, 187, 214), (70, 70, 70), (150, 100, 100), (107, 142, 35), (70, 130, 180), (0, 0, 0)]]
 34 | 
 35 | 
 36 | def get_image(label_mask, level):
 37 |     h, w = label_mask.shape
 38 |     image = np.zeros((h,w,3), dtype=int)
 39 |     for l in range(num_labels[level]):
 40 |         id_list = get_ids(l, level)
 41 |         # print(id_list)
 42 |         for id in id_list:
 43 |             indices = label_mask == id 
 44 |             
 45 |             for i in range(3):
 46 |                 
 47 |                 image[indices,i] = colors[level][l][i]
 48 | 
 49 |     return image
 50 | 
 51 | import time
 52 | 
 53 | 
 54 | 
 55 | def view_image_label_pair(image_path, label_path):
 56 | 
 57 |     # image_path = label_path.replace('gtFine', 'leftImg8bit').replace('_labelids','')
 58 |     label_mask = imread(label_path)
 59 |     image = imread(image_path)
 60 |     imgs = [image]
 61 |     imgs.append(get_image(label_mask,2))
 62 | 
 63 | 
 64 |     f = plt.figure(figsize=(14,9), dpi=150)
 65 |             
 66 |     axarr = [None]*5
 67 |     axarr[0] = f.add_subplot(121)
 68 |     axarr[1] = f.add_subplot(122)
 69 |     
 70 |     
 71 |     plt.subplots_adjust(top=0.9,
 72 | bottom=0.0,
 73 | left=0.0,
 74 | right=1.0,
 75 | hspace=0.15,
 76 | wspace=0.0)
 77 | 
 78 |     # f.
 79 |     # print(images[0].shape)
 80 |     id_names = [ 'image', 'level3']
 81 |     for i in range(2):
 82 |         axarr[i].imshow(imgs[i])
 83 |         axarr[i].axis('off')
 84 |         axarr[i].set_title(id_names[i])
 85 |     
 86 | 
 87 |     f.suptitle(label_path)
 88 |     plt.show()
 89 |     time.sleep(5)
 90 |     plt.close()
 91 | 
 92 | 
 93 | 
 94 | 
 95 | def get_args():
 96 |     parser = ArgumentParser()
 97 |     parser.add_argument('--masks', default="/ssd_scratch/cvit/girish.varma/dataset/anue_test/gtFine/test")
 98 |     parser.add_argument('--imgs', default="/ssd_scratch/cvit/girish.varma/dataset/anue_test/leftImg8bit/test")
 99 | 
100 |     
101 |     args = parser.parse_args()
102 | 
103 |     return args
104 | 
105 | # The main method
106 | def main(args):
107 |     image_path_list = glob.glob(args.imgs+'/*/*_leftImg8bit.png')
108 |     random.shuffle(image_path_list)
109 |     # print(image_path_list[0].replace('_leftImg8bit.png', '_gtFine_labellevel3Ids.png').replace(args.imgs, args.masks))
110 |     mask_path_list = [ i.replace('_leftImg8bit.png', '_gtFine_labellevel3Ids.png').replace(args.imgs, args.masks) for i in image_path_list ]
111 | 
112 |     for l in range(len(image_path_list)):
113 |         print(image_path_list[l], mask_path_list[l])
114 |         view_image_label_pair(image_path_list[l], mask_path_list[l])
115 | 
116 | 
117 | 
118 | 
119 | if __name__ == "__main__":
120 |     args = get_args()
121 |     main(args)
122 | 
123 | 
124 | 
125 | 
126 | 


--------------------------------------------------------------------------------