`, where c is the identifier, CAM indicates the camera where the key object’s center point is situated, and x, y represent the horizontal and vertical coordinates of the 2D bounding box in the respective camera’s coordinate system with the `upper-left` corner as the `origin`, and the `right` and `bottom` as the `positive x and y axes`, respectively.
91 |
92 | In contrast to the `c tag`, for the question "Identify all the traffic elements in the front view," the output is presented as a list formatted as `[(c, s, x1, y1, x2, y2), ...]`. Here, `c` denotes the category, `s` represents the status, and `x1, y1, x2, y2` indicate the offsets of the top-left and bottom-right corners of the box relative to the center point.
93 |
94 |
95 |
96 |
97 |
98 |
99 |
--------------------------------------------------------------------------------
/docs/gvqa.md:
--------------------------------------------------------------------------------
1 | ### What is GVQA?
2 | The most exciting aspect of the dataset is that the questions and answers (`QA pairs`) are connected in a graph-style structure, with QA pairs as every node and potential logical progression as the edges. The reason for doing this in the AD domain is that AD tasks are well-defined per stage, from raw sensor input to final control action through perception, prediction and planning.
3 |
4 | Its key difference to prior VQA tasks for AD is the availability of logical dependencies between QAs, which can be used to guide the answering process. Below is a demo video illustrating the idea.
5 |
6 | https://github.com/OpenDriveLab/DriveLM/assets/54334254/988472a8-d7b9-4685-b4b8-7a0e77f68265
7 |
--------------------------------------------------------------------------------
/environment.yml:
--------------------------------------------------------------------------------
1 | name: drivelm
2 | channels:
3 | - omgarcia
4 | - conda-forge
5 | - defaults
6 | dependencies:
7 | - _libgcc_mutex=0.1=main
8 | - _openmp_mutex=5.1=1_gnu
9 | - appdirs=1.4.4=pyhd3eb1b0_0
10 | - blas=1.0=mkl
11 | - brotlipy=0.7.0=py38h27cfd23_1003
12 | - ca-certificates=2023.7.22=hbcca054_0
13 | - certifi=2023.7.22=pyhd8ed1ab_0
14 | - cffi=1.15.1=py38h5eee18b_3
15 | - cryptography=41.0.2=py38h22a60cf_0
16 | - cudatoolkit=11.7.0=hd8887f6_10
17 | - gcc-6=6.1.0=2
18 | - gmp=6.1.0=1
19 | - idna=3.4=py38h06a4308_0
20 | - intel-openmp=2023.1.0=hdb19cb5_46305
21 | - isl=0.17.1=0
22 | - ld_impl_linux-64=2.38=h1181459_1
23 | - libffi=3.4.4=h6a678d5_0
24 | - libgcc-ng=11.2.0=h1234567_1
25 | - libgfortran-ng=11.2.0=h00389a5_1
26 | - libgfortran5=11.2.0=h1234567_1
27 | - libgomp=11.2.0=h1234567_1
28 | - libstdcxx-ng=11.2.0=h1234567_1
29 | - mkl=2023.1.0=h6d00ec8_46342
30 | - mkl-service=2.4.0=py38h5eee18b_1
31 | - mkl_fft=1.3.6=py38h417a72b_1
32 | - mkl_random=1.2.2=py38h417a72b_1
33 | - mpc=1.0.1=0
34 | - mpfr=3.1.2=0
35 | - ncurses=6.4=h6a678d5_0
36 | - openssl=3.0.10=h7f8727e_2
37 | - pip=23.2.1=py38h06a4308_0
38 | - pooch=1.4.0=pyhd3eb1b0_0
39 | - pycparser=2.21=pyhd3eb1b0_0
40 | - pyopenssl=23.2.0=py38h06a4308_0
41 | - pysocks=1.7.1=py38h06a4308_0
42 | - python=3.8.17=h955ad1f_0
43 | - readline=8.2=h5eee18b_0
44 | - sqlite=3.41.2=h5eee18b_0
45 | - tbb=2021.8.0=hdb19cb5_0
46 | - tk=8.6.12=h1ccaba5_0
47 | - wheel=0.38.4=py38h06a4308_0
48 | - xz=5.4.2=h5eee18b_0
49 | - zlib=1.2.13=h5eee18b_0
50 | - pip:
51 | - absl-py==1.4.0
52 | - accelerate==0.21.0
53 | - addict==2.4.0
54 | - aiohttp==3.8.5
55 | - aiosignal==1.3.1
56 | - aliyun-python-sdk-core==2.13.36
57 | - aliyun-python-sdk-kms==2.16.1
58 | - ansi2html==1.8.0
59 | - antlr4-python3-runtime==4.9.3
60 | - anyio==3.7.1
61 | - argon2-cffi==23.1.0
62 | - argon2-cffi-bindings==21.2.0
63 | - arrow==1.2.3
64 | - asttokens==2.2.1
65 | - async-lru==2.0.4
66 | - async-timeout==4.0.2
67 | - attrs==23.1.0
68 | - babel==2.12.1
69 | - backcall==0.2.0
70 | - beautifulsoup4==4.12.2
71 | - bert-score
72 | - bitsandbytes==0.41.1
73 | - black==23.7.0
74 | - bleach==6.0.0
75 | - cachetools==5.3.1
76 | - cchardet==2.1.7
77 | - chardet==5.2.0
78 | - charset-normalizer==3.2.0
79 | - click==8.1.6
80 | - cmake==3.27.0
81 | - colorama==0.4.6
82 | - colorlog==6.7.0
83 | - comm==0.1.4
84 | - configargparse==1.7
85 | - contourpy==1.1.0
86 | # - cosine-annealing-warmup # no need for torch lightning
87 | - crcmod==1.7
88 | - cycler==0.11.0
89 | - dash==2.13.0
90 | - dash-core-components==2.0.0
91 | - dash-html-components==2.0.0
92 | - dash-table==5.0.0
93 | - datasets==2.14.3
94 | - debugpy==1.6.7.post1
95 | - decorator==5.1.1
96 | - defusedxml==0.7.1
97 | - descartes==1.1.0
98 | - dill==0.3.7
99 | - docker-pycreds==0.4.0
100 | - evaluate==0.4.0
101 | - exceptiongroup==1.1.3
102 | - executing==1.2.0
103 | - fastjsonschema==2.18.0
104 | - filelock==3.12.2
105 | - fire==0.5.0
106 | - flake8==6.1.0
107 | - flask==2.2.5
108 | - fonttools==4.42.0
109 | - fqdn==1.5.1
110 | - frozenlist==1.4.0
111 | - fsspec==2023.6.0
112 | - gitdb==4.0.10
113 | - gitpython==3.1.32
114 | - google-auth
115 | - google-auth-oauthlib
116 | - grpcio==1.56.2
117 | - huggingface-hub==0.16.4
118 | - hydra-core==1.3.2
119 | - imageio==2.31.1
120 | - importlib-metadata==6.8.0
121 | - importlib-resources==6.0.0
122 | - iniconfig==2.0.0
123 | - inquirerpy==0.3.4
124 | - ipykernel==6.25.1
125 | - ipython==8.12.2
126 | - ipython-genutils==0.2.0
127 | - ipywidgets==8.1.0
128 | - isoduration==20.11.0
129 | - itsdangerous==2.1.2
130 | - jedi==0.19.0
131 | - jinja2==3.1.2
132 | - jmespath==0.10.0
133 | - joblib==1.3.1
134 | - json5==0.9.14
135 | - jsonpointer==2.4
136 | - jsonschema==4.19.0
137 | - jsonschema-specifications==2023.7.1
138 | - jupyter==1.0.0
139 | - jupyter-client==8.3.1
140 | - jupyter-console==6.6.3
141 | - jupyter-core==5.3.1
142 | - jupyter-events==0.7.0
143 | - jupyter-lsp==2.2.0
144 | - jupyter-server==2.7.2
145 | - jupyter-server-terminals==0.4.4
146 | - jupyterlab==4.0.5
147 | - jupyterlab-pygments==0.2.2
148 | - jupyterlab-server==2.24.0
149 | - jupyterlab-widgets==3.0.8
150 | - kiwisolver==1.4.4
151 | # - language-evaluation # should install separately
152 | - lazy-loader==0.3
153 | - lightning-utilities==0.9.0
154 | - line-profiler==4.0.3
155 | - lit==16.0.6
156 | - llvmlite==0.31.0
157 | - lyft-dataset-sdk==0.0.8
158 | - markdown==3.4.4
159 | - markdown-it-py==3.0.0
160 | - markupsafe==2.1.3
161 | - matplotlib==3.5.2
162 | - matplotlib-inline==0.1.6
163 | - mccabe==0.7.0
164 | - mdurl==0.1.2
165 | - mistune==2.0.5
166 | # - mmcv==1.4.0 # no need for loading drivelm, need for loading nuScenes
167 | # - mmdet==2.14.0 # no need for loading drivelm, need for loading nuScenes
168 | # - mmengine==0.8.4 # no need for loading drivelm, need for loading nuScenes
169 | # - mmsegmentation==0.14.1 # no need for loading drivelm, need for loading nuScenes
170 | - model-index==0.1.11
171 | - more-itertools==10.1.0
172 | - mpmath==1.3.0
173 | - multidict==6.0.4
174 | - multiprocess==0.70.15
175 | - mypy-extensions==1.0.0
176 | - nbclient==0.8.0
177 | - nbconvert==7.4.0
178 | - nbformat==5.5.0
179 | - nest-asyncio==1.5.7
180 | - networkx==2.2
181 | - nltk==3.8.1
182 | - notebook==7.0.2
183 | - notebook-shim==0.2.3
184 | - numba==0.48.0
185 | - numpy
186 | - nuscenes-devkit==1.1.10
187 | - nvidia-cublas-cu11==11.10.3.66
188 | - nvidia-cuda-cupti-cu11==11.7.101
189 | - nvidia-cuda-nvrtc-cu11==11.7.99
190 | - nvidia-cuda-runtime-cu11==11.7.99
191 | - nvidia-cudnn-cu11==8.5.0.96
192 | - nvidia-cufft-cu11==10.9.0.58
193 | - nvidia-curand-cu11==10.2.10.91
194 | - nvidia-cusolver-cu11==11.4.0.1
195 | - nvidia-cusparse-cu11==11.7.4.91
196 | - nvidia-nccl-cu11==2.14.3
197 | - nvidia-nvtx-cu11==11.7.91
198 | - oauthlib==3.2.2
199 | - omegaconf==2.3.0
200 | # - open3d # visualization packaage, no need for loading drivelm
201 | - opencv-python==4.8.0.74
202 | # - opendatalab==0.0.10 # data download tools, no need for loading drivelm
203 | # - openmim==0.3.9 # mmlab package manager, no need for loading drivelm
204 | # - openxlab==0.0.22 # mmlab package, no need for loading drivelm
205 | - ordered-set==4.1.0
206 | - oss2==2.17.0
207 | - overrides==7.4.0
208 | - packaging==23.1
209 | - pandas==1.4.4
210 | - pandocfilters==1.5.0
211 | - parso==0.8.3
212 | - pathspec==0.11.2
213 | - pathtools==0.1.2
214 | - peft==0.4.0
215 | - pexpect==4.8.0
216 | - pfzy==0.3.4
217 | - pickleshare==0.7.5
218 | - pillow==10.0.0
219 | - pkgutil-resolve-name==1.3.10
220 | - platformdirs==3.10.0
221 | - plotly==5.16.1
222 | - pluggy==1.3.0
223 | - plyfile==1.0.1
224 | - prettytable==3.8.0
225 | - prometheus-client==0.17.1
226 | - prompt-toolkit==3.0.39
227 | - protobuf==4.23.4
228 | - psutil==5.9.5
229 | - ptyprocess==0.7.0
230 | - pure-eval==0.2.2
231 | - pyarrow==12.0.1
232 | - pyasn1==0.5.0
233 | - pyasn1-modules==0.3.0
234 | - pycocotools==2.0.7
235 | - pycodestyle==2.11.0
236 | - pycryptodome==3.18.0
237 | - pydeprecate==0.3.2
238 | - pyflakes==3.1.0
239 | - pygments==2.16.1
240 | - pyparsing==3.0.9
241 | - pyquaternion==0.9.9
242 | - pytest==7.4.0
243 | - python-dateutil==2.8.2
244 | - python-json-logger==2.0.7
245 | - pytorch-lightning==1.7.0
246 | - pytz==2023.3
247 | - pywavelets==1.4.1
248 | - pyyaml==6.0.1
249 | - pyzmq==25.1.1
250 | - qtconsole==5.4.3
251 | - qtpy==2.4.0
252 | - referencing==0.30.2
253 | - regex==2023.6.3
254 | - requests
255 | - requests-oauthlib
256 | - responses==0.18.0
257 | - retrying==1.3.4
258 | - rfc3339-validator==0.1.4
259 | - rfc3986-validator==0.1.1
260 | - rich==13.4.2
261 | - rouge-score==0.1.2
262 | - rpds-py==0.10.0
263 | - rsa==4.9
264 | - safetensors==0.3.1
265 | - scikit-image==0.19.3
266 | - scikit-learn==1.3.0
267 | - scipy==1.7.3
268 | - send2trash==1.8.2
269 | - sentencepiece==0.1.99
270 | - sentry-sdk==1.29.2
271 | - setproctitle==1.3.2
272 | - setuptools==60.2.0
273 | - shapely==1.8.5
274 | - six==1.16.0
275 | - smmap==5.0.0
276 | - sniffio==1.3.0
277 | - soupsieve==2.4.1
278 | - stack-data==0.6.2
279 | - sympy==1.12
280 | - tabulate==0.9.0
281 | - tenacity==8.2.3
282 | - tensorboard==2.13.0
283 | - tensorboard-data-server==0.7.1
284 | - termcolor==2.3.0
285 | - terminado==0.17.1
286 | - terminaltables==3.1.10
287 | - threadpoolctl==3.2.0
288 | - tifffile==2023.7.10
289 | - tinycss2==1.2.1
290 | - tokenizers==0.13.3
291 | - tomli==2.0.1
292 | - torch==2.0.1
293 | - torchaudio==2.0.2
294 | - torchmetrics==0.11.1
295 | - torchvision==0.15.2
296 | - tornado==6.3.3
297 | - tqdm==4.65.0
298 | - traitlets==5.9.0
299 | - transformers==4.31.0
300 | - trimesh==2.35.39
301 | - triton==2.0.0
302 | - typing-extensions==4.7.1
303 | - tzdata==2023.3
304 | - uri-template==1.3.0
305 | - urllib3==2.0.4
306 | - wandb==0.15.8
307 | - wcwidth==0.2.6
308 | - webcolors==1.13
309 | - webencodings==0.5.1
310 | - websocket-client==1.6.2
311 | - werkzeug==2.2.3
312 | - widgetsnbextension==4.0.8
313 | - xxhash==3.3.0
314 | - yapf==0.40.1
315 | - yarl==1.9.2
316 | - zipp==3.16.2
317 |
--------------------------------------------------------------------------------
/index.html:
--------------------------------------------------------------------------------
1 |
2 |
3 |
4 |
5 |
6 |
7 | You will be redirected to opendrivelab.com/DriveLM/ soon!
8 |
9 |
10 |
--------------------------------------------------------------------------------