├── .gitignore
├── LICENSE
├── README.md
├── camera_movement_estimator
├── __init__.py
└── camera_movement_estimator.py
├── development_and_analysis
└── color_assignment.ipynb
├── input_videos
├── NOTE.txt
└── match.mp4
├── main.py
├── models
└── NOTE.txt
├── output_images
└── player_2.jpg
├── output_videos
└── NOTE.txt
├── player_ball_assigner
├── __init__.py
└── player_ball_assigner.py
├── requirements.txt
├── speed_and_distance_estimator
├── __init__.py
└── speed_and_distance_estimator.py
├── stubs
├── camera_movement_stub.pkl
└── track_stubs.pkl
├── team_assigner
├── __init__.py
└── team_assigner.py
├── trackers
├── __init__.py
└── tracker.py
├── training
└── football_training_yolo_v5.ipynb
├── utils
├── __init__.py
├── bbox_utils.py
└── video_utils.py
├── view_transformer
├── __init__.py
└── view_transformer.py
└── yolo_inference.py
/.gitignore:
--------------------------------------------------------------------------------
1 | *__pycahce__*
2 | *.pyc
3 | *.avi
4 | *.pt
5 | *football-players-detection-1*
--------------------------------------------------------------------------------
/LICENSE:
--------------------------------------------------------------------------------
1 | MIT License
2 |
3 | Copyright (c) 2024 Rajveer Singh
4 |
5 | Permission is hereby granted, free of charge, to any person obtaining a copy
6 | of this software and associated documentation files (the "Software"), to deal
7 | in the Software without restriction, including without limitation the rights
8 | to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9 | copies of the Software, and to permit persons to whom the Software is
10 | furnished to do so, subject to the following conditions:
11 |
12 | The above copyright notice and this permission notice shall be included in all
13 | copies or substantial portions of the Software.
14 |
15 | THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16 | IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17 | FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18 | AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19 | LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20 | OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21 | SOFTWARE.
22 |
--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
1 | # Football Analysis using YOLO
2 |
3 | This project employs YOLO (You Only Look Once) object detection to conduct comprehensive analysis of football matches. The goal is to provide detailed insights into player performance, team dynamics, ball possession, and camera movements during a match.
4 |
5 |
6 |
7 | ## Installation
8 |
9 | 1. **Clone the Repository:**
10 |
11 | ```bash
12 | git clone https://github.com/rajveersinghcse/football-analysis-using-yolo.git
13 | cd football-analysis-using-yolo
14 | ```
15 |
16 | 2. **Install Dependencies:**
17 | ```bash
18 | pip install -r requirements.txt
19 | ```
20 |
21 | The following libraries are used in this project:
22 |
23 | - ultralytics
24 | - numpy
25 | - opencv-python
26 | - roboflow
27 | - pandas
28 | - pickle
29 | - supervision
30 | - shutil
31 | - scikit-learn
32 |
33 | ## Usage
34 |
35 | 1. **Data Preparation:**
36 |
37 | - Place your video footage of the football match in the `input` directory.
38 |
39 | 2. **Running the Analysis:**
40 |
41 | - Execute the main script `python main.py` to initiate the analysis process.
42 | - The analysis encompasses the following key steps:
43 | - Object tracking using YOLO for players, referees, and the football.
44 | - Estimating camera movements to understand viewpoint changes.
45 | - Calculating player speed, distance traveled, and determining ball possession.
46 | - Visualizing analysis results on the video frames.
47 |
48 | 3. **Output:**
49 | - The annotated and analyzed video will be saved in the `output_videos` directory for review.
50 |
51 | ## Code Structure
52 |
53 | - **`utils.py`**: Contains utility functions for video I/O operations.
54 | - **`trackers.py`**: Implements the YOLO-based object tracker and interpolation techniques.
55 | - **`team_assigner.py`**: Assigns teams to players based on their visual appearance.
56 | - **`player_ball_assigner.py`**: Determines ball possession among players during the match.
57 | - **`camera_movement_estimator.py`**: Estimates camera movements to analyze perspective changes.
58 | - **`view_transformer.py`**: Transforms object positions based on the camera view for accurate analysis.
59 | - **`speed_and_distance_estimator.py`**: Calculates player speeds and distances traveled for performance evaluation.
60 |
61 | ## Contributing
62 |
63 | Contributions, feedback, and suggestions are highly encouraged! Please feel free to open an issue or submit a pull request with any improvements or new features.
64 |
65 | ## License
66 |
67 | This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
68 |
69 | ## Acknowledgements
70 |
71 | Special thanks to the YOLOv5 team and the contributors of the libraries used in this project for their valuable contributions to the field of object detection and analysis in computer vision.
72 |
--------------------------------------------------------------------------------
/camera_movement_estimator/__init__.py:
--------------------------------------------------------------------------------
1 | from .camera_movement_estimator import CameraMovementEstimator
--------------------------------------------------------------------------------
/camera_movement_estimator/camera_movement_estimator.py:
--------------------------------------------------------------------------------
1 | import pickle
2 | import cv2
3 | import numpy as np
4 | import os
5 | import sys
6 | sys.path.append('../')
7 | from utils import measure_distance,measure_xy_distance
8 |
9 | class CameraMovementEstimator():
10 | def __init__(self,frame):
11 | self.minimum_distance = 5
12 |
13 | self.lk_params = dict(
14 | winSize = (15,15),
15 | maxLevel = 2,
16 | criteria = (cv2.TERM_CRITERIA_EPS | cv2.TERM_CRITERIA_COUNT,10,0.03)
17 | )
18 |
19 | first_frame_grayscale = cv2.cvtColor(frame,cv2.COLOR_BGR2GRAY)
20 | mask_features = np.zeros_like(first_frame_grayscale)
21 | mask_features[:,0:20] = 1
22 | mask_features[:,900:1050] = 1
23 |
24 | self.features = dict(
25 | maxCorners = 100,
26 | qualityLevel = 0.3,
27 | minDistance =3,
28 | blockSize = 7,
29 | mask = mask_features
30 | )
31 |
32 | def add_adjust_positions_to_tracks(self,tracks, camera_movement_per_frame):
33 | for object, object_tracks in tracks.items():
34 | for frame_num, track in enumerate(object_tracks):
35 | for track_id, track_info in track.items():
36 | position = track_info['position']
37 | camera_movement = camera_movement_per_frame[frame_num]
38 | position_adjusted = (position[0]-camera_movement[0],position[1]-camera_movement[1])
39 | tracks[object][frame_num][track_id]['position_adjusted'] = position_adjusted
40 |
41 |
42 |
43 | def get_camera_movement(self,frames,read_from_stub=False, stub_path=None):
44 | # Read the stub
45 | if read_from_stub and stub_path is not None and os.path.exists(stub_path):
46 | with open(stub_path,'rb') as f:
47 | return pickle.load(f)
48 |
49 | camera_movement = [[0,0]]*len(frames)
50 |
51 | old_gray = cv2.cvtColor(frames[0],cv2.COLOR_BGR2GRAY)
52 | old_features = cv2.goodFeaturesToTrack(old_gray,**self.features)
53 |
54 | for frame_num in range(1,len(frames)):
55 | frame_gray = cv2.cvtColor(frames[frame_num],cv2.COLOR_BGR2GRAY)
56 | new_features, _,_ = cv2.calcOpticalFlowPyrLK(old_gray,frame_gray,old_features,None,**self.lk_params)
57 |
58 | max_distance = 0
59 | camera_movement_x, camera_movement_y = 0,0
60 |
61 | for i, (new,old) in enumerate(zip(new_features,old_features)):
62 | new_features_point = new.ravel()
63 | old_features_point = old.ravel()
64 |
65 | distance = measure_distance(new_features_point,old_features_point)
66 | if distance>max_distance:
67 | max_distance = distance
68 | camera_movement_x,camera_movement_y = measure_xy_distance(old_features_point, new_features_point )
69 |
70 | if max_distance > self.minimum_distance:
71 | camera_movement[frame_num] = [camera_movement_x,camera_movement_y]
72 | old_features = cv2.goodFeaturesToTrack(frame_gray,**self.features)
73 |
74 | old_gray = frame_gray.copy()
75 |
76 | if stub_path is not None:
77 | with open(stub_path,'wb') as f:
78 | pickle.dump(camera_movement,f)
79 |
80 | return camera_movement
81 |
82 | def draw_camera_movement(self,frames, camera_movement_per_frame):
83 | output_frames=[]
84 |
85 | for frame_num, frame in enumerate(frames):
86 | frame= frame.copy()
87 |
88 | overlay = frame.copy()
89 | cv2.rectangle(overlay,(0,0),(500,100),(255,255,255),-1)
90 | alpha =0.6
91 | cv2.addWeighted(overlay,alpha,frame,1-alpha,0,frame)
92 |
93 | x_movement, y_movement = camera_movement_per_frame[frame_num]
94 | frame = cv2.putText(frame,f"Camera Movement X: {x_movement:.2f}",(10,30), cv2.FONT_HERSHEY_SIMPLEX,1,(0,0,0),3)
95 | frame = cv2.putText(frame,f"Camera Movement Y: {y_movement:.2f}",(10,60), cv2.FONT_HERSHEY_SIMPLEX,1,(0,0,0),3)
96 |
97 | output_frames.append(frame)
98 |
99 | return output_frames
--------------------------------------------------------------------------------
/development_and_analysis/color_assignment.ipynb:
--------------------------------------------------------------------------------
1 | {
2 | "cells": [
3 | {
4 | "cell_type": "code",
5 | "execution_count": 1,
6 | "metadata": {},
7 | "outputs": [],
8 | "source": [
9 | "import cv2 \n",
10 | "import matplotlib.pyplot as plt\n",
11 | "import numpy as np\n",
12 | "from sklearn.cluster import KMeans"
13 | ]
14 | },
15 | {
16 | "cell_type": "code",
17 | "execution_count": 5,
18 | "metadata": {},
19 | "outputs": [],
20 | "source": [
21 | "image_path = '../output_images/player_2.jpg'\n",
22 | "image = cv2.imread(image_path)\n",
23 | "image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)"
24 | ]
25 | },
26 | {
27 | "cell_type": "code",
28 | "execution_count": 6,
29 | "metadata": {},
30 | "outputs": [
31 | {
32 | "data": {
33 | "text/plain": [
34 | ""
35 | ]
36 | },
37 | "execution_count": 6,
38 | "metadata": {},
39 | "output_type": "execute_result"
40 | },
41 | {
42 | "data": {
43 | "image/png": "",
44 | "text/plain": [
45 | ""
46 | ]
47 | },
48 | "metadata": {},
49 | "output_type": "display_data"
50 | }
51 | ],
52 | "source": [
53 | "plt.imshow(image)"
54 | ]
55 | },
56 | {
57 | "cell_type": "markdown",
58 | "metadata": {},
59 | "source": [
60 | "#### We need only top half of the image"
61 | ]
62 | },
63 | {
64 | "cell_type": "code",
65 | "execution_count": 7,
66 | "metadata": {},
67 | "outputs": [
68 | {
69 | "data": {
70 | "text/plain": [
71 | ""
72 | ]
73 | },
74 | "execution_count": 7,
75 | "metadata": {},
76 | "output_type": "execute_result"
77 | },
78 | {
79 | "data": {
80 | "image/png": "",
81 | "text/plain": [
82 | ""
83 | ]
84 | },
85 | "metadata": {},
86 | "output_type": "display_data"
87 | }
88 | ],
89 | "source": [
90 | "top_half_image = image[0: int(image.shape[0]/2)]\n",
91 | "plt.imshow(top_half_image)"
92 | ]
93 | },
94 | {
95 | "cell_type": "markdown",
96 | "metadata": {},
97 | "source": [
98 | "#### Cluster the image into two clusters"
99 | ]
100 | },
101 | {
102 | "cell_type": "code",
103 | "execution_count": 8,
104 | "metadata": {},
105 | "outputs": [
106 | {
107 | "name": "stderr",
108 | "output_type": "stream",
109 | "text": [
110 | "c:\\ProgramData\\Anaconda3\\Lib\\site-packages\\sklearn\\cluster\\_kmeans.py:1412: FutureWarning: The default value of `n_init` will change from 10 to 'auto' in 1.4. Set the value of `n_init` explicitly to suppress the warning\n",
111 | " super()._check_params_vs_input(X, default_n_init=10)\n"
112 | ]
113 | },
114 | {
115 | "data": {
116 | "text/plain": [
117 | ""
118 | ]
119 | },
120 | "execution_count": 8,
121 | "metadata": {},
122 | "output_type": "execute_result"
123 | },
124 | {
125 | "data": {
126 | "image/png": "",
127 | "text/plain": [
128 | ""
129 | ]
130 | },
131 | "metadata": {},
132 | "output_type": "display_data"
133 | }
134 | ],
135 | "source": [
136 | "# Reshape the image into 2d array\n",
137 | "image_2d = top_half_image.reshape(-1, 3)\n",
138 | "\n",
139 | "# perform kmeans clustering\n",
140 | "Kmeans = KMeans(n_clusters=2, random_state=0).fit(image_2d) \n",
141 | "\n",
142 | "# get the cluster labels\n",
143 | "labels = Kmeans.labels_\n",
144 | "\n",
145 | "# reshape the labels to the original image shape\n",
146 | "clustered_image = labels.reshape(top_half_image.shape[0], top_half_image.shape[1])\n",
147 | "\n",
148 | "# Display the clustered image\n",
149 | "plt.imshow(clustered_image)"
150 | ]
151 | },
152 | {
153 | "cell_type": "code",
154 | "execution_count": 9,
155 | "metadata": {},
156 | "outputs": [
157 | {
158 | "name": "stdout",
159 | "output_type": "stream",
160 | "text": [
161 | "Non player cluster: 1\n"
162 | ]
163 | }
164 | ],
165 | "source": [
166 | "corner_clusters = [clustered_image[0, 0], clustered_image[0, -1], clustered_image[-1, 0], clustered_image[-1, -1]]\n",
167 | "non_player_cluster = max(set(corner_clusters), key=corner_clusters.count)\n",
168 | "print('Non player cluster:', non_player_cluster)"
169 | ]
170 | },
171 | {
172 | "cell_type": "code",
173 | "execution_count": 10,
174 | "metadata": {},
175 | "outputs": [
176 | {
177 | "name": "stdout",
178 | "output_type": "stream",
179 | "text": [
180 | "Player cluster: 0\n"
181 | ]
182 | }
183 | ],
184 | "source": [
185 | "player_cluster = 1-non_player_cluster\n",
186 | "print('Player cluster:', player_cluster)"
187 | ]
188 | },
189 | {
190 | "cell_type": "code",
191 | "execution_count": 11,
192 | "metadata": {},
193 | "outputs": [
194 | {
195 | "data": {
196 | "text/plain": [
197 | "array([171.1701847 , 235.37862797, 142.83641161])"
198 | ]
199 | },
200 | "execution_count": 11,
201 | "metadata": {},
202 | "output_type": "execute_result"
203 | }
204 | ],
205 | "source": [
206 | "Kmeans.cluster_centers_[player_cluster]"
207 | ]
208 | },
209 | {
210 | "cell_type": "code",
211 | "execution_count": null,
212 | "metadata": {},
213 | "outputs": [],
214 | "source": []
215 | }
216 | ],
217 | "metadata": {
218 | "kernelspec": {
219 | "display_name": "base",
220 | "language": "python",
221 | "name": "python3"
222 | },
223 | "language_info": {
224 | "codemirror_mode": {
225 | "name": "ipython",
226 | "version": 3
227 | },
228 | "file_extension": ".py",
229 | "mimetype": "text/x-python",
230 | "name": "python",
231 | "nbconvert_exporter": "python",
232 | "pygments_lexer": "ipython3",
233 | "version": "3.11.9"
234 | }
235 | },
236 | "nbformat": 4,
237 | "nbformat_minor": 2
238 | }
239 |
--------------------------------------------------------------------------------
/input_videos/NOTE.txt:
--------------------------------------------------------------------------------
1 | You have to put your Input Video here.
--------------------------------------------------------------------------------
/input_videos/match.mp4:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/rajveersinghcse/Football-Analysis-using-YOLO/5fd79c081a31d922ab30bc3d832d83d636008df6/input_videos/match.mp4
--------------------------------------------------------------------------------
/main.py:
--------------------------------------------------------------------------------
1 | from utils import read_video, save_video
2 | from trackers import Tracker
3 | import cv2
4 | import numpy as np
5 | from team_assigner import TeamAssigner
6 | from player_ball_assigner import PlayerBallAssigner
7 | from camera_movement_estimator import CameraMovementEstimator
8 | from view_transformer import ViewTransformer
9 | from speed_and_distance_estimator import SpeedAndDistance_Estimator
10 |
11 |
12 | def main():
13 | # Read Video
14 | video_frames = read_video("./input_videos/match.mp4")
15 |
16 | # Initialize Tracker
17 | tracker = Tracker("./models/best.pt")
18 |
19 | tracks = tracker.get_object_tracks(
20 | video_frames, read_from_stub=True, stub_path="stubs/track_stubs.pkl"
21 | )
22 | # Get object positions
23 | tracker.add_position_to_tracks(tracks)
24 |
25 | # camera movement estimator
26 | camera_movement_estimator = CameraMovementEstimator(video_frames[0])
27 | camera_movement_per_frame = camera_movement_estimator.get_camera_movement(
28 | video_frames, read_from_stub=True, stub_path="stubs/camera_movement_stub.pkl"
29 | )
30 | camera_movement_estimator.add_adjust_positions_to_tracks(
31 | tracks, camera_movement_per_frame
32 | )
33 |
34 | # View Trasnformer
35 | view_transformer = ViewTransformer()
36 | view_transformer.add_transformed_position_to_tracks(tracks)
37 |
38 | # Interpolate Ball Positions
39 | tracks["ball"] = tracker.interpolate_ball_positions(tracks["ball"])
40 |
41 | # Speed and distance estimator
42 | speed_and_distance_estimator = SpeedAndDistance_Estimator()
43 | speed_and_distance_estimator.add_speed_and_distance_to_tracks(tracks)
44 |
45 | # Assign Player Teams
46 | team_assigner = TeamAssigner()
47 | team_assigner.assign_team_color(video_frames[0], tracks["players"][0])
48 |
49 | for frame_num, player_track in enumerate(tracks["players"]):
50 | for player_id, track in player_track.items():
51 | team = team_assigner.get_player_team(
52 | video_frames[frame_num], track["bbox"], player_id
53 | )
54 | tracks["players"][frame_num][player_id]["team"] = team
55 | tracks["players"][frame_num][player_id]["team_color"] = (
56 | team_assigner.team_colors[team]
57 | )
58 |
59 | # Assign Ball Aquisition
60 | player_assigner = PlayerBallAssigner()
61 | team_ball_control = []
62 | for frame_num, player_track in enumerate(tracks["players"]):
63 | ball_bbox = tracks["ball"][frame_num][1]["bbox"]
64 | assigned_player = player_assigner.assign_ball_to_player(player_track, ball_bbox)
65 |
66 | if assigned_player != -1:
67 | tracks["players"][frame_num][assigned_player]["has_ball"] = True
68 | team_ball_control.append(
69 | tracks["players"][frame_num][assigned_player]["team"]
70 | )
71 | else:
72 | team_ball_control.append(team_ball_control[-1])
73 | team_ball_control = np.array(team_ball_control)
74 |
75 | # Draw output
76 | ## Draw object Tracks
77 | output_video_frames = tracker.draw_annotations(
78 | video_frames, tracks, team_ball_control
79 | )
80 |
81 | ## Draw Camera movement
82 | output_video_frames = camera_movement_estimator.draw_camera_movement(
83 | output_video_frames, camera_movement_per_frame
84 | )
85 |
86 | ## Draw Speed and Distance
87 | speed_and_distance_estimator.draw_speed_and_distance(output_video_frames, tracks)
88 |
89 | # Save video
90 | save_video(output_video_frames, "./output_videos/output_video.avi")
91 |
92 |
93 | if __name__ == "__main__":
94 | main()
95 |
--------------------------------------------------------------------------------
/models/NOTE.txt:
--------------------------------------------------------------------------------
1 | You need to run Football_training_yolo_v5.ipynb file to get the models. You will find the models into training folder.
--------------------------------------------------------------------------------
/output_images/player_2.jpg:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/rajveersinghcse/Football-Analysis-using-YOLO/5fd79c081a31d922ab30bc3d832d83d636008df6/output_images/player_2.jpg
--------------------------------------------------------------------------------
/output_videos/NOTE.txt:
--------------------------------------------------------------------------------
1 | Your output Video will be here after pasting the Input Video
--------------------------------------------------------------------------------
/player_ball_assigner/__init__.py:
--------------------------------------------------------------------------------
1 | from .player_ball_assigner import PlayerBallAssigner
--------------------------------------------------------------------------------
/player_ball_assigner/player_ball_assigner.py:
--------------------------------------------------------------------------------
1 | import sys
2 | sys.path.append('../')
3 | from utils import get_center_of_bbox, measure_distance
4 |
5 | class PlayerBallAssigner():
6 | def __init__(self):
7 | self.max_player_ball_distance = 70
8 |
9 | def assign_ball_to_player(self,players,ball_bbox):
10 | ball_position = get_center_of_bbox(ball_bbox)
11 |
12 | miniumum_distance = 99999
13 | assigned_player=-1
14 |
15 | for player_id, player in players.items():
16 | player_bbox = player['bbox']
17 |
18 | distance_left = measure_distance((player_bbox[0],player_bbox[-1]),ball_position)
19 | distance_right = measure_distance((player_bbox[2],player_bbox[-1]),ball_position)
20 | distance = min(distance_left,distance_right)
21 |
22 | if distance < self.max_player_ball_distance:
23 | if distance < miniumum_distance:
24 | miniumum_distance = distance
25 | assigned_player = player_id
26 |
27 | return assigned_player
--------------------------------------------------------------------------------
/requirements.txt:
--------------------------------------------------------------------------------
1 | ultralytics
2 | numpy
3 | opencv-python
4 | roboflow
5 | pandas
6 | pickle
7 | supervision
8 | shutil
9 | scikit-learn
10 |
--------------------------------------------------------------------------------
/speed_and_distance_estimator/__init__.py:
--------------------------------------------------------------------------------
1 | from .speed_and_distance_estimator import SpeedAndDistance_Estimator
--------------------------------------------------------------------------------
/speed_and_distance_estimator/speed_and_distance_estimator.py:
--------------------------------------------------------------------------------
1 | import cv2
2 | import sys
3 | sys.path.append('../')
4 | from utils import measure_distance ,get_foot_position
5 |
6 | class SpeedAndDistance_Estimator():
7 | def __init__(self):
8 | self.frame_window=5
9 | self.frame_rate=24
10 |
11 | def add_speed_and_distance_to_tracks(self,tracks):
12 | total_distance= {}
13 |
14 | for object, object_tracks in tracks.items():
15 | if object == "ball" or object == "referees":
16 | continue
17 | number_of_frames = len(object_tracks)
18 | for frame_num in range(0,number_of_frames, self.frame_window):
19 | last_frame = min(frame_num+self.frame_window,number_of_frames-1 )
20 |
21 | for track_id,_ in object_tracks[frame_num].items():
22 | if track_id not in object_tracks[last_frame]:
23 | continue
24 |
25 | start_position = object_tracks[frame_num][track_id]['position_transformed']
26 | end_position = object_tracks[last_frame][track_id]['position_transformed']
27 |
28 | if start_position is None or end_position is None:
29 | continue
30 |
31 | distance_covered = measure_distance(start_position,end_position)
32 | time_elapsed = (last_frame-frame_num)/self.frame_rate
33 | speed_meteres_per_second = distance_covered/time_elapsed
34 | speed_km_per_hour = speed_meteres_per_second*3.6
35 |
36 | if object not in total_distance:
37 | total_distance[object]= {}
38 |
39 | if track_id not in total_distance[object]:
40 | total_distance[object][track_id] = 0
41 |
42 | total_distance[object][track_id] += distance_covered
43 |
44 | for frame_num_batch in range(frame_num,last_frame):
45 | if track_id not in tracks[object][frame_num_batch]:
46 | continue
47 | tracks[object][frame_num_batch][track_id]['speed'] = speed_km_per_hour
48 | tracks[object][frame_num_batch][track_id]['distance'] = total_distance[object][track_id]
49 |
50 | def draw_speed_and_distance(self,frames,tracks):
51 | output_frames = []
52 | for frame_num, frame in enumerate(frames):
53 | for object, object_tracks in tracks.items():
54 | if object == "ball" or object == "referees":
55 | continue
56 | for _, track_info in object_tracks[frame_num].items():
57 | if "speed" in track_info:
58 | speed = track_info.get('speed',None)
59 | distance = track_info.get('distance',None)
60 | if speed is None or distance is None:
61 | continue
62 |
63 | bbox = track_info['bbox']
64 | position = get_foot_position(bbox)
65 | position = list(position)
66 | position[1]+=40
67 |
68 | position = tuple(map(int,position))
69 | cv2.putText(frame, f"{speed:.2f} km/h",position,cv2.FONT_HERSHEY_SIMPLEX,0.5,(0,0,0),2)
70 | cv2.putText(frame, f"{distance:.2f} m",(position[0],position[1]+20),cv2.FONT_HERSHEY_SIMPLEX,0.5,(0,0,0),2)
71 | output_frames.append(frame)
72 |
73 | return output_frames
--------------------------------------------------------------------------------
/stubs/camera_movement_stub.pkl:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/rajveersinghcse/Football-Analysis-using-YOLO/5fd79c081a31d922ab30bc3d832d83d636008df6/stubs/camera_movement_stub.pkl
--------------------------------------------------------------------------------
/stubs/track_stubs.pkl:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/rajveersinghcse/Football-Analysis-using-YOLO/5fd79c081a31d922ab30bc3d832d83d636008df6/stubs/track_stubs.pkl
--------------------------------------------------------------------------------
/team_assigner/__init__.py:
--------------------------------------------------------------------------------
1 | from .team_assigner import TeamAssigner
--------------------------------------------------------------------------------
/team_assigner/team_assigner.py:
--------------------------------------------------------------------------------
1 | from sklearn.cluster import KMeans
2 |
3 | class TeamAssigner:
4 | def __init__(self):
5 | self.team_colors = {}
6 | self.player_team_dict = {}
7 |
8 | def get_clustering_model(self,image):
9 | # Reshape the image to 2D array
10 | image_2d = image.reshape(-1,3)
11 |
12 | # Preform K-means with 2 clusters
13 | kmeans = KMeans(n_clusters=2, init="k-means++",n_init=1).fit(image_2d)
14 |
15 | return kmeans
16 |
17 | def get_player_color(self,frame,bbox):
18 | image = frame[int(bbox[1]):int(bbox[3]),int(bbox[0]):int(bbox[2])]
19 |
20 | top_half_image = image[0:int(image.shape[0]/2),:]
21 |
22 | # Get Clustering model
23 | kmeans = self.get_clustering_model(top_half_image)
24 |
25 | # Get the cluster labels forr each pixel
26 | labels = kmeans.labels_
27 |
28 | # Reshape the labels to the image shape
29 | clustered_image = labels.reshape(top_half_image.shape[0],top_half_image.shape[1])
30 |
31 | # Get the player cluster
32 | corner_clusters = [clustered_image[0,0],clustered_image[0,-1],clustered_image[-1,0],clustered_image[-1,-1]]
33 | non_player_cluster = max(set(corner_clusters),key=corner_clusters.count)
34 | player_cluster = 1 - non_player_cluster
35 |
36 | player_color = kmeans.cluster_centers_[player_cluster]
37 |
38 | return player_color
39 |
40 |
41 | def assign_team_color(self,frame, player_detections):
42 |
43 | player_colors = []
44 | for _, player_detection in player_detections.items():
45 | bbox = player_detection["bbox"]
46 | player_color = self.get_player_color(frame,bbox)
47 | player_colors.append(player_color)
48 |
49 | kmeans = KMeans(n_clusters=2, init="k-means++",n_init=10).fit(player_colors)
50 |
51 | self.kmeans = kmeans
52 |
53 | self.team_colors[1] = kmeans.cluster_centers_[0]
54 | self.team_colors[2] = kmeans.cluster_centers_[1]
55 |
56 |
57 | def get_player_team(self,frame,player_bbox,player_id):
58 | if player_id in self.player_team_dict:
59 | return self.player_team_dict[player_id]
60 |
61 | player_color = self.get_player_color(frame,player_bbox)
62 |
63 | team_id = self.kmeans.predict(player_color.reshape(1,-1))[0]
64 | team_id+=1
65 |
66 | if player_id == 91:
67 | team_id=1
68 |
69 | self.player_team_dict[player_id] = team_id
70 |
71 | return team_id
72 |
--------------------------------------------------------------------------------
/trackers/__init__.py:
--------------------------------------------------------------------------------
1 | from .tracker import Tracker
--------------------------------------------------------------------------------
/trackers/tracker.py:
--------------------------------------------------------------------------------
1 | from ultralytics import YOLO
2 | import supervision as sv
3 | import pickle
4 | import os
5 | import numpy as np
6 | import pandas as pd
7 | import cv2
8 | import sys
9 | sys.path.append('../')
10 | from utils import get_center_of_bbox, get_bbox_width, get_foot_position
11 |
12 | class Tracker:
13 | def __init__(self, model_path):
14 | self.model = YOLO(model_path)
15 | self.tracker = sv.ByteTrack()
16 |
17 | def add_position_to_tracks(sekf,tracks):
18 | for object, object_tracks in tracks.items():
19 | for frame_num, track in enumerate(object_tracks):
20 | for track_id, track_info in track.items():
21 | bbox = track_info['bbox']
22 | if object == 'ball':
23 | position= get_center_of_bbox(bbox)
24 | else:
25 | position = get_foot_position(bbox)
26 | tracks[object][frame_num][track_id]['position'] = position
27 |
28 | def interpolate_ball_positions(self,ball_positions):
29 | ball_positions = [x.get(1,{}).get('bbox',[]) for x in ball_positions]
30 | df_ball_positions = pd.DataFrame(ball_positions,columns=['x1','y1','x2','y2'])
31 |
32 | # Interpolate missing values
33 | df_ball_positions = df_ball_positions.interpolate()
34 | df_ball_positions = df_ball_positions.bfill()
35 |
36 | ball_positions = [{1: {"bbox":x}} for x in df_ball_positions.to_numpy().tolist()]
37 |
38 | return ball_positions
39 |
40 | def detect_frames(self, frames):
41 | batch_size=20
42 | detections = []
43 | for i in range(0,len(frames),batch_size):
44 | detections_batch = self.model.predict(frames[i:i+batch_size],conf=0.1)
45 | detections += detections_batch
46 | return detections
47 |
48 | def get_object_tracks(self, frames, read_from_stub=False, stub_path=None):
49 |
50 | if read_from_stub and stub_path is not None and os.path.exists(stub_path):
51 | with open(stub_path,'rb') as f:
52 | tracks = pickle.load(f)
53 | return tracks
54 |
55 | detections = self.detect_frames(frames)
56 |
57 | tracks={
58 | "players":[],
59 | "referees":[],
60 | "ball":[]
61 | }
62 |
63 | for frame_num, detection in enumerate(detections):
64 | cls_names = detection.names
65 | cls_names_inv = {v:k for k,v in cls_names.items()}
66 |
67 | # Covert to supervision Detection format
68 | detection_supervision = sv.Detections.from_ultralytics(detection)
69 |
70 | # Convert GoalKeeper to player object
71 | for object_ind , class_id in enumerate(detection_supervision.class_id):
72 | if cls_names[class_id] == "goalkeeper":
73 | detection_supervision.class_id[object_ind] = cls_names_inv["player"]
74 |
75 | # Track Objects
76 | detection_with_tracks = self.tracker.update_with_detections(detection_supervision)
77 |
78 | tracks["players"].append({})
79 | tracks["referees"].append({})
80 | tracks["ball"].append({})
81 |
82 | for frame_detection in detection_with_tracks:
83 | bbox = frame_detection[0].tolist()
84 | cls_id = frame_detection[3]
85 | track_id = frame_detection[4]
86 |
87 | if cls_id == cls_names_inv['player']:
88 | tracks["players"][frame_num][track_id] = {"bbox":bbox}
89 |
90 | if cls_id == cls_names_inv['referee']:
91 | tracks["referees"][frame_num][track_id] = {"bbox":bbox}
92 |
93 | for frame_detection in detection_supervision:
94 | bbox = frame_detection[0].tolist()
95 | cls_id = frame_detection[3]
96 |
97 | if cls_id == cls_names_inv['ball']:
98 | tracks["ball"][frame_num][1] = {"bbox":bbox}
99 |
100 | if stub_path is not None:
101 | with open(stub_path,'wb') as f:
102 | pickle.dump(tracks,f)
103 |
104 | return tracks
105 |
106 | def draw_ellipse(self,frame,bbox,color,track_id=None):
107 | y2 = int(bbox[3])
108 | x_center, _ = get_center_of_bbox(bbox)
109 | width = get_bbox_width(bbox)
110 |
111 | cv2.ellipse(
112 | frame,
113 | center=(x_center,y2),
114 | axes=(int(width), int(0.35*width)),
115 | angle=0.0,
116 | startAngle=-45,
117 | endAngle=235,
118 | color = color,
119 | thickness=2,
120 | lineType=cv2.LINE_4
121 | )
122 |
123 | rectangle_width = 40
124 | rectangle_height=20
125 | x1_rect = x_center - rectangle_width//2
126 | x2_rect = x_center + rectangle_width//2
127 | y1_rect = (y2- rectangle_height//2) +15
128 | y2_rect = (y2+ rectangle_height//2) +15
129 |
130 | if track_id is not None:
131 | cv2.rectangle(frame,
132 | (int(x1_rect),int(y1_rect) ),
133 | (int(x2_rect),int(y2_rect)),
134 | color,
135 | cv2.FILLED)
136 |
137 | x1_text = x1_rect+12
138 | if track_id > 99:
139 | x1_text -=10
140 |
141 | cv2.putText(
142 | frame,
143 | f"{track_id}",
144 | (int(x1_text),int(y1_rect+15)),
145 | cv2.FONT_HERSHEY_SIMPLEX,
146 | 0.6,
147 | (0,0,0),
148 | 2
149 | )
150 |
151 | return frame
152 |
153 | def draw_traingle(self,frame,bbox,color):
154 | y= int(bbox[1])
155 | x,_ = get_center_of_bbox(bbox)
156 |
157 | triangle_points = np.array([
158 | [x,y],
159 | [x-10,y-20],
160 | [x+10,y-20],
161 | ])
162 | cv2.drawContours(frame, [triangle_points],0,color, cv2.FILLED)
163 | cv2.drawContours(frame, [triangle_points],0,(0,0,0), 2)
164 |
165 | return frame
166 |
167 | def draw_team_ball_control(self,frame,frame_num,team_ball_control):
168 | # Draw a semi-transparent rectaggle
169 | overlay = frame.copy()
170 | cv2.rectangle(overlay, (1350, 850), (1900,970), (255,255,255), -1 )
171 | alpha = 0.4
172 | cv2.addWeighted(overlay, alpha, frame, 1 - alpha, 0, frame)
173 |
174 | team_ball_control_till_frame = team_ball_control[:frame_num+1]
175 | # Get the number of time each team had ball control
176 | team_1_num_frames = team_ball_control_till_frame[team_ball_control_till_frame==1].shape[0]
177 | team_2_num_frames = team_ball_control_till_frame[team_ball_control_till_frame==2].shape[0]
178 | team_1 = team_1_num_frames/(team_1_num_frames+team_2_num_frames)
179 | team_2 = team_2_num_frames/(team_1_num_frames+team_2_num_frames)
180 |
181 | cv2.putText(frame, f"Team 1 Ball Control: {team_1*100:.2f}%",(1400,900), cv2.FONT_HERSHEY_SIMPLEX, 1, (0,0,0), 3)
182 | cv2.putText(frame, f"Team 2 Ball Control: {team_2*100:.2f}%",(1400,950), cv2.FONT_HERSHEY_SIMPLEX, 1, (0,0,0), 3)
183 |
184 | return frame
185 |
186 | def draw_annotations(self,video_frames, tracks,team_ball_control):
187 | output_video_frames= []
188 | for frame_num, frame in enumerate(video_frames):
189 | frame = frame.copy()
190 |
191 | player_dict = tracks["players"][frame_num]
192 | ball_dict = tracks["ball"][frame_num]
193 | referee_dict = tracks["referees"][frame_num]
194 |
195 | # Draw Players
196 | for track_id, player in player_dict.items():
197 | color = player.get("team_color",(0,0,255))
198 | frame = self.draw_ellipse(frame, player["bbox"],color, track_id)
199 |
200 | if player.get('has_ball',False):
201 | frame = self.draw_traingle(frame, player["bbox"],(0,0,255))
202 |
203 | # Draw Referee
204 | for _, referee in referee_dict.items():
205 | frame = self.draw_ellipse(frame, referee["bbox"],(0,255,255))
206 |
207 | # Draw ball
208 | for track_id, ball in ball_dict.items():
209 | frame = self.draw_traingle(frame, ball["bbox"],(0,255,0))
210 |
211 |
212 | # Draw Team Ball Control
213 | frame = self.draw_team_ball_control(frame, frame_num, team_ball_control)
214 |
215 | output_video_frames.append(frame)
216 |
217 | return output_video_frames
--------------------------------------------------------------------------------
/training/football_training_yolo_v5.ipynb:
--------------------------------------------------------------------------------
1 | {
2 | "cells": [
3 | {
4 | "cell_type": "code",
5 | "execution_count": 2,
6 | "metadata": {
7 | "id": "5_sdU4m-d7dK"
8 | },
9 | "outputs": [],
10 | "source": [
11 | "import ultralytics\n",
12 | "from roboflow import Roboflow"
13 | ]
14 | },
15 | {
16 | "cell_type": "code",
17 | "execution_count": 3,
18 | "metadata": {
19 | "colab": {
20 | "base_uri": "https://localhost:8080/"
21 | },
22 | "id": "MzQwV91LdslB",
23 | "outputId": "5a73340a-7a6f-44d9-af68-7fc48d7ff953"
24 | },
25 | "outputs": [
26 | {
27 | "name": "stdout",
28 | "output_type": "stream",
29 | "text": [
30 | "loading Roboflow workspace...\n",
31 | "loading Roboflow project...\n"
32 | ]
33 | },
34 | {
35 | "name": "stderr",
36 | "output_type": "stream",
37 | "text": [
38 | "Downloading Dataset Version Zip in football-players-detection-1 to yolov5pytorch:: 100%|██████████| 148663/148663 [00:02<00:00, 66787.03it/s]"
39 | ]
40 | },
41 | {
42 | "name": "stdout",
43 | "output_type": "stream",
44 | "text": [
45 | "\n"
46 | ]
47 | },
48 | {
49 | "name": "stderr",
50 | "output_type": "stream",
51 | "text": [
52 | "\n",
53 | "Extracting Dataset Version Zip to football-players-detection-1 in yolov5pytorch:: 100%|██████████| 1338/1338 [00:00<00:00, 2278.21it/s]\n"
54 | ]
55 | }
56 | ],
57 | "source": [
58 | "rf = Roboflow(api_key=\"\")\n",
59 | "project = rf.workspace(\"roboflow-jvuqo\").project(\"football-players-detection-3zvbc\")\n",
60 | "version = project.version(1)\n",
61 | "dataset = version.download(\"yolov5\")"
62 | ]
63 | },
64 | {
65 | "cell_type": "code",
66 | "execution_count": 4,
67 | "metadata": {
68 | "colab": {
69 | "base_uri": "https://localhost:8080/",
70 | "height": 35
71 | },
72 | "id": "IyXs_n72dslD",
73 | "outputId": "8ea1de3e-5280-4fa2-8d10-6b59a03f7bbf"
74 | },
75 | "outputs": [
76 | {
77 | "data": {
78 | "application/vnd.google.colaboratory.intrinsic+json": {
79 | "type": "string"
80 | },
81 | "text/plain": [
82 | "'/content/football-players-detection-1'"
83 | ]
84 | },
85 | "execution_count": 4,
86 | "metadata": {},
87 | "output_type": "execute_result"
88 | }
89 | ],
90 | "source": [
91 | "dataset.location"
92 | ]
93 | },
94 | {
95 | "cell_type": "code",
96 | "execution_count": 5,
97 | "metadata": {
98 | "colab": {
99 | "base_uri": "https://localhost:8080/",
100 | "height": 35
101 | },
102 | "id": "tqk2jYzHdslD",
103 | "outputId": "bc81d4c4-8200-4dae-d554-59c35587127a"
104 | },
105 | "outputs": [
106 | {
107 | "data": {
108 | "application/vnd.google.colaboratory.intrinsic+json": {
109 | "type": "string"
110 | },
111 | "text/plain": [
112 | "'./football-players-detection-1/football-players-detection-1/test'"
113 | ]
114 | },
115 | "execution_count": 5,
116 | "metadata": {},
117 | "output_type": "execute_result"
118 | }
119 | ],
120 | "source": [
121 | "import shutil\n",
122 | "\n",
123 | "shutil.move('./football-players-detection-1/train', './football-players-detection-1/football-players-detection-1/train')\n",
124 | "shutil.move('./football-players-detection-1/valid', './football-players-detection-1/football-players-detection-1/valid')\n",
125 | "shutil.move('./football-players-detection-1/test', './football-players-detection-1/football-players-detection-1/test')"
126 | ]
127 | },
128 | {
129 | "cell_type": "markdown",
130 | "metadata": {
131 | "id": "iuBKL0DJdslE"
132 | },
133 | "source": [
134 | "## Training"
135 | ]
136 | },
137 | {
138 | "cell_type": "code",
139 | "execution_count": 6,
140 | "metadata": {
141 | "colab": {
142 | "base_uri": "https://localhost:8080/"
143 | },
144 | "id": "X6d6DMGrdslG",
145 | "outputId": "a4d3912a-fc10-4a81-9f2b-1f743e331d30"
146 | },
147 | "outputs": [
148 | {
149 | "name": "stdout",
150 | "output_type": "stream",
151 | "text": [
152 | "PRO TIP 💡 Replace 'model=yolov5l.pt' with new 'model=yolov5lu.pt'.\n",
153 | "YOLOv5 'u' models are trained with https://github.com/ultralytics/ultralytics and feature improved performance vs standard YOLOv5 models trained with https://github.com/ultralytics/yolov5.\n",
154 | "\n",
155 | "Downloading https://github.com/ultralytics/assets/releases/download/v8.2.0/yolov5lu.pt to 'yolov5lu.pt'...\n",
156 | "100% 102M/102M [00:00<00:00, 292MB/s] \n",
157 | "Ultralytics YOLOv8.2.2 🚀 Python-3.10.12 torch-2.2.1+cu121 CUDA:0 (Tesla T4, 15102MiB)\n",
158 | "\u001b[34m\u001b[1mengine/trainer: \u001b[0mtask=detect, mode=train, model=yolov5l.pt, data=/content/football-players-detection-1/data.yaml, epochs=100, time=None, patience=100, batch=16, imgsz=640, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, save_hybrid=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_conf=True, show_boxes=True, line_width=None, format=torchscript, keras=False, optimize=False, int8=False, dynamic=False, simplify=False, opset=None, workspace=4, nms=False, lr0=0.01, lrf=0.01, momentum=0.937, weight_decay=0.0005, warmup_epochs=3.0, warmup_momentum=0.8, warmup_bias_lr=0.1, box=7.5, cls=0.5, dfl=1.5, pose=12.0, kobj=1.0, label_smoothing=0.0, nbs=64, hsv_h=0.015, hsv_s=0.7, hsv_v=0.4, degrees=0.0, translate=0.1, scale=0.5, shear=0.0, perspective=0.0, flipud=0.0, fliplr=0.5, bgr=0.0, mosaic=1.0, mixup=0.0, copy_paste=0.0, auto_augment=randaugment, erasing=0.4, crop_fraction=1.0, cfg=None, tracker=botsort.yaml, save_dir=runs/detect/train\n",
159 | "Downloading https://ultralytics.com/assets/Arial.ttf to '/root/.config/Ultralytics/Arial.ttf'...\n",
160 | "100% 755k/755k [00:00<00:00, 26.5MB/s]\n",
161 | "2024-04-25 14:25:13.387233: E external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:9261] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered\n",
162 | "2024-04-25 14:25:13.387290: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:607] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered\n",
163 | "2024-04-25 14:25:13.389213: E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1515] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered\n",
164 | "Overriding model.yaml nc=80 with nc=4\n",
165 | "\n",
166 | " from n params module arguments \n",
167 | " 0 -1 1 7040 ultralytics.nn.modules.conv.Conv [3, 64, 6, 2, 2] \n",
168 | " 1 -1 1 73984 ultralytics.nn.modules.conv.Conv [64, 128, 3, 2] \n",
169 | " 2 -1 3 156928 ultralytics.nn.modules.block.C3 [128, 128, 3] \n",
170 | " 3 -1 1 295424 ultralytics.nn.modules.conv.Conv [128, 256, 3, 2] \n",
171 | " 4 -1 6 1118208 ultralytics.nn.modules.block.C3 [256, 256, 6] \n",
172 | " 5 -1 1 1180672 ultralytics.nn.modules.conv.Conv [256, 512, 3, 2] \n",
173 | " 6 -1 9 6433792 ultralytics.nn.modules.block.C3 [512, 512, 9] \n",
174 | " 7 -1 1 4720640 ultralytics.nn.modules.conv.Conv [512, 1024, 3, 2] \n",
175 | " 8 -1 3 9971712 ultralytics.nn.modules.block.C3 [1024, 1024, 3] \n",
176 | " 9 -1 1 2624512 ultralytics.nn.modules.block.SPPF [1024, 1024, 5] \n",
177 | " 10 -1 1 525312 ultralytics.nn.modules.conv.Conv [1024, 512, 1, 1] \n",
178 | " 11 -1 1 0 torch.nn.modules.upsampling.Upsample [None, 2, 'nearest'] \n",
179 | " 12 [-1, 6] 1 0 ultralytics.nn.modules.conv.Concat [1] \n",
180 | " 13 -1 3 2757632 ultralytics.nn.modules.block.C3 [1024, 512, 3, False] \n",
181 | " 14 -1 1 131584 ultralytics.nn.modules.conv.Conv [512, 256, 1, 1] \n",
182 | " 15 -1 1 0 torch.nn.modules.upsampling.Upsample [None, 2, 'nearest'] \n",
183 | " 16 [-1, 4] 1 0 ultralytics.nn.modules.conv.Concat [1] \n",
184 | " 17 -1 3 690688 ultralytics.nn.modules.block.C3 [512, 256, 3, False] \n",
185 | " 18 -1 1 590336 ultralytics.nn.modules.conv.Conv [256, 256, 3, 2] \n",
186 | " 19 [-1, 14] 1 0 ultralytics.nn.modules.conv.Concat [1] \n",
187 | " 20 -1 3 2495488 ultralytics.nn.modules.block.C3 [512, 512, 3, False] \n",
188 | " 21 -1 1 2360320 ultralytics.nn.modules.conv.Conv [512, 512, 3, 2] \n",
189 | " 22 [-1, 10] 1 0 ultralytics.nn.modules.conv.Concat [1] \n",
190 | " 23 -1 3 9971712 ultralytics.nn.modules.block.C3 [1024, 1024, 3, False] \n",
191 | " 24 [17, 20, 23] 1 7060444 ultralytics.nn.modules.head.Detect [4, [256, 512, 1024]] \n",
192 | "YOLOv5l summary: 416 layers, 53166428 parameters, 53166412 gradients, 135.3 GFLOPs\n",
193 | "\n",
194 | "Transferred 685/691 items from pretrained weights\n",
195 | "\u001b[34m\u001b[1mTensorBoard: \u001b[0mStart with 'tensorboard --logdir runs/detect/train', view at http://localhost:6006/\n",
196 | "Freezing layer 'model.24.dfl.conv.weight'\n",
197 | "\u001b[34m\u001b[1mAMP: \u001b[0mrunning Automatic Mixed Precision (AMP) checks with YOLOv8n...\n",
198 | "Downloading https://github.com/ultralytics/assets/releases/download/v8.2.0/yolov8n.pt to 'yolov8n.pt'...\n",
199 | "100% 6.23M/6.23M [00:00<00:00, 105MB/s]\n",
200 | "\u001b[34m\u001b[1mAMP: \u001b[0mchecks passed ✅\n",
201 | "\u001b[34m\u001b[1mtrain: \u001b[0mScanning /content/football-players-detection-1/football-players-detection-1/train/labels... 612 images, 0 backgrounds, 0 corrupt: 100% 612/612 [00:00<00:00, 1805.13it/s]\n",
202 | "\u001b[34m\u001b[1mtrain: \u001b[0mNew cache created: /content/football-players-detection-1/football-players-detection-1/train/labels.cache\n",
203 | "\u001b[34m\u001b[1malbumentations: \u001b[0mBlur(p=0.01, blur_limit=(3, 7)), MedianBlur(p=0.01, blur_limit=(3, 7)), ToGray(p=0.01), CLAHE(p=0.01, clip_limit=(1, 4.0), tile_grid_size=(8, 8))\n",
204 | "/usr/lib/python3.10/multiprocessing/popen_fork.py:66: RuntimeWarning: os.fork() was called. os.fork() is incompatible with multithreaded code, and JAX is multithreaded, so this will likely lead to a deadlock.\n",
205 | " self.pid = os.fork()\n",
206 | "\u001b[34m\u001b[1mval: \u001b[0mScanning /content/football-players-detection-1/football-players-detection-1/valid/labels... 38 images, 0 backgrounds, 0 corrupt: 100% 38/38 [00:00<00:00, 1101.57it/s]\n",
207 | "\u001b[34m\u001b[1mval: \u001b[0mNew cache created: /content/football-players-detection-1/football-players-detection-1/valid/labels.cache\n",
208 | "Plotting labels to runs/detect/train/labels.jpg... \n",
209 | "\u001b[34m\u001b[1moptimizer:\u001b[0m 'optimizer=auto' found, ignoring 'lr0=0.01' and 'momentum=0.937' and determining best 'optimizer', 'lr0' and 'momentum' automatically... \n",
210 | "\u001b[34m\u001b[1moptimizer:\u001b[0m AdamW(lr=0.00125, momentum=0.9) with parameter groups 113 weight(decay=0.0), 120 weight(decay=0.0005), 119 bias(decay=0.0)\n",
211 | "\u001b[34m\u001b[1mTensorBoard: \u001b[0mmodel graph visualization added ✅\n",
212 | "Image sizes 640 train, 640 val\n",
213 | "Using 2 dataloader workers\n",
214 | "Logging results to \u001b[1mruns/detect/train\u001b[0m\n",
215 | "Starting training for 100 epochs...\n",
216 | "\n",
217 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
218 | " 1/100 10.8G 1.266 1.486 0.8403 194 640: 100% 39/39 [00:41<00:00, 1.07s/it]\n",
219 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:03<00:00, 1.54s/it]\n",
220 | " all 38 905 0.274 0.285 0.219 0.124\n",
221 | "\n",
222 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
223 | " 2/100 10.4G 1.16 0.7307 0.8124 235 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
224 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.60it/s]\n",
225 | " all 38 905 0.622 0.516 0.547 0.324\n",
226 | "\n",
227 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
228 | " 3/100 10.4G 1.234 0.6746 0.8196 219 640: 100% 39/39 [00:30<00:00, 1.27it/s]\n",
229 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.40it/s]\n",
230 | " all 38 905 0.692 0.536 0.545 0.289\n",
231 | "\n",
232 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
233 | " 4/100 10.4G 1.22 0.6634 0.8191 77 640: 100% 39/39 [00:30<00:00, 1.27it/s]\n",
234 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.41it/s]\n",
235 | " all 38 905 0.757 0.682 0.689 0.39\n",
236 | "\n",
237 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
238 | " 5/100 10.4G 1.155 0.6372 0.812 177 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
239 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.82it/s]\n",
240 | " all 38 905 0.631 0.727 0.633 0.399\n",
241 | "\n",
242 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
243 | " 6/100 10.4G 1.083 0.6184 0.8074 178 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
244 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.59it/s]\n",
245 | " all 38 905 0.743 0.683 0.695 0.426\n",
246 | "\n",
247 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
248 | " 7/100 10.5G 1.102 0.6218 0.8087 104 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
249 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.43it/s]\n",
250 | " all 38 905 0.848 0.645 0.748 0.458\n",
251 | "\n",
252 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
253 | " 8/100 10.4G 1.035 0.5491 0.8034 234 640: 100% 39/39 [00:30<00:00, 1.30it/s]\n",
254 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.32it/s]\n",
255 | " all 38 905 0.889 0.644 0.734 0.467\n",
256 | "\n",
257 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
258 | " 9/100 10.7G 1.08 0.5593 0.808 170 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
259 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.53it/s]\n",
260 | " all 38 905 0.836 0.641 0.708 0.455\n",
261 | "\n",
262 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
263 | " 10/100 10.8G 1.041 0.5286 0.8014 203 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
264 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.46it/s]\n",
265 | " all 38 905 0.856 0.699 0.754 0.493\n",
266 | "\n",
267 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
268 | " 11/100 10.7G 1.083 0.5396 0.8054 132 640: 100% 39/39 [00:30<00:00, 1.27it/s]\n",
269 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.06it/s]\n",
270 | " all 38 905 0.878 0.686 0.76 0.471\n",
271 | "\n",
272 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
273 | " 12/100 10.7G 0.9954 0.5095 0.7989 121 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
274 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.67it/s]\n",
275 | " all 38 905 0.844 0.697 0.751 0.488\n",
276 | "\n",
277 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
278 | " 13/100 10.7G 1 0.487 0.7966 165 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
279 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.50it/s]\n",
280 | " all 38 905 0.858 0.692 0.745 0.493\n",
281 | "\n",
282 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
283 | " 14/100 10.7G 1.008 0.4831 0.7995 120 640: 100% 39/39 [00:30<00:00, 1.30it/s]\n",
284 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.73it/s]\n",
285 | " all 38 905 0.891 0.698 0.762 0.521\n",
286 | "\n",
287 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
288 | " 15/100 10.7G 0.959 0.4669 0.7942 122 640: 100% 39/39 [00:29<00:00, 1.30it/s]\n",
289 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.62it/s]\n",
290 | " all 38 905 0.88 0.708 0.746 0.493\n",
291 | "\n",
292 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
293 | " 16/100 10.7G 0.9878 0.4772 0.7952 247 640: 100% 39/39 [00:29<00:00, 1.30it/s]\n",
294 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.62it/s]\n",
295 | " all 38 905 0.78 0.694 0.74 0.455\n",
296 | "\n",
297 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
298 | " 17/100 10.8G 0.9391 0.4605 0.7953 129 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
299 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.74it/s]\n",
300 | " all 38 905 0.876 0.66 0.758 0.496\n",
301 | "\n",
302 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
303 | " 18/100 10.7G 0.9693 0.4725 0.7935 222 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
304 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.71it/s]\n",
305 | " all 38 905 0.82 0.728 0.748 0.503\n",
306 | "\n",
307 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
308 | " 19/100 10.7G 0.8985 0.4436 0.7923 131 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
309 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.56it/s]\n",
310 | " all 38 905 0.805 0.687 0.752 0.486\n",
311 | "\n",
312 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
313 | " 20/100 10.7G 0.9006 0.4337 0.7921 128 640: 100% 39/39 [00:30<00:00, 1.27it/s]\n",
314 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.48it/s]\n",
315 | " all 38 905 0.915 0.731 0.784 0.533\n",
316 | "\n",
317 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
318 | " 21/100 10.5G 0.9134 0.4385 0.7904 200 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
319 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.89it/s]\n",
320 | " all 38 905 0.901 0.753 0.787 0.53\n",
321 | "\n",
322 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
323 | " 22/100 10.4G 0.9397 0.4435 0.7935 238 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
324 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.57it/s]\n",
325 | " all 38 905 0.905 0.716 0.775 0.511\n",
326 | "\n",
327 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
328 | " 23/100 10.7G 0.9286 0.4472 0.7929 129 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
329 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.61it/s]\n",
330 | " all 38 905 0.909 0.716 0.786 0.539\n",
331 | "\n",
332 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
333 | " 24/100 10.8G 0.89 0.4373 0.7943 133 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
334 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.67it/s]\n",
335 | " all 38 905 0.875 0.711 0.788 0.542\n",
336 | "\n",
337 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
338 | " 25/100 10.4G 0.9148 0.4435 0.7901 257 640: 100% 39/39 [00:29<00:00, 1.30it/s]\n",
339 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.57it/s]\n",
340 | " all 38 905 0.875 0.724 0.776 0.529\n",
341 | "\n",
342 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
343 | " 26/100 10.4G 0.889 0.423 0.7898 167 640: 100% 39/39 [00:29<00:00, 1.30it/s]\n",
344 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.75it/s]\n",
345 | " all 38 905 0.923 0.737 0.801 0.53\n",
346 | "\n",
347 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
348 | " 27/100 10.7G 0.8876 0.4289 0.7897 109 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
349 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.38it/s]\n",
350 | " all 38 905 0.838 0.742 0.792 0.537\n",
351 | "\n",
352 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
353 | " 28/100 10.7G 0.9303 0.4417 0.7912 216 640: 100% 39/39 [00:29<00:00, 1.30it/s]\n",
354 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.85it/s]\n",
355 | " all 38 905 0.929 0.731 0.812 0.532\n",
356 | "\n",
357 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
358 | " 29/100 10.7G 0.9325 0.4455 0.7943 201 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
359 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.26it/s]\n",
360 | " all 38 905 0.893 0.743 0.794 0.522\n",
361 | "\n",
362 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
363 | " 30/100 10.7G 0.909 0.4442 0.7889 150 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
364 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.73it/s]\n",
365 | " all 38 905 0.918 0.708 0.785 0.505\n",
366 | "\n",
367 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
368 | " 31/100 10.7G 0.8851 0.4287 0.7916 114 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
369 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.88it/s]\n",
370 | " all 38 905 0.919 0.737 0.8 0.542\n",
371 | "\n",
372 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
373 | " 32/100 10.7G 0.8521 0.4116 0.7887 137 640: 100% 39/39 [00:29<00:00, 1.30it/s]\n",
374 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.30it/s]\n",
375 | " all 38 905 0.922 0.731 0.814 0.563\n",
376 | "\n",
377 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
378 | " 33/100 10.4G 0.8674 0.4204 0.7899 241 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
379 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.83it/s]\n",
380 | " all 38 905 0.838 0.698 0.771 0.532\n",
381 | "\n",
382 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
383 | " 34/100 10.6G 0.8881 0.415 0.7879 121 640: 100% 39/39 [00:30<00:00, 1.27it/s]\n",
384 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.60it/s]\n",
385 | " all 38 905 0.872 0.659 0.764 0.514\n",
386 | "\n",
387 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
388 | " 35/100 10.7G 0.8724 0.4107 0.7891 206 640: 100% 39/39 [00:30<00:00, 1.30it/s]\n",
389 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.50it/s]\n",
390 | " all 38 905 0.867 0.768 0.78 0.539\n",
391 | "\n",
392 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
393 | " 36/100 10.6G 0.8448 0.4047 0.7861 118 640: 100% 39/39 [00:29<00:00, 1.30it/s]\n",
394 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.48it/s]\n",
395 | " all 38 905 0.842 0.748 0.8 0.538\n",
396 | "\n",
397 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
398 | " 37/100 10.7G 0.8604 0.413 0.7907 127 640: 100% 39/39 [00:30<00:00, 1.30it/s]\n",
399 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.37it/s]\n",
400 | " all 38 905 0.89 0.737 0.799 0.544\n",
401 | "\n",
402 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
403 | " 38/100 10.4G 0.8537 0.408 0.789 117 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
404 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.48it/s]\n",
405 | " all 38 905 0.87 0.728 0.786 0.548\n",
406 | "\n",
407 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
408 | " 39/100 11.1G 0.8424 0.4031 0.787 101 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
409 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.22it/s]\n",
410 | " all 38 905 0.921 0.758 0.827 0.577\n",
411 | "\n",
412 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
413 | " 40/100 10.7G 0.8266 0.3936 0.7847 82 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
414 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.49it/s]\n",
415 | " all 38 905 0.92 0.766 0.814 0.57\n",
416 | "\n",
417 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
418 | " 41/100 10.7G 0.8407 0.3989 0.7869 188 640: 100% 39/39 [00:30<00:00, 1.27it/s]\n",
419 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.29it/s]\n",
420 | " all 38 905 0.915 0.771 0.823 0.568\n",
421 | "\n",
422 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
423 | " 42/100 10.4G 0.8431 0.4069 0.785 207 640: 100% 39/39 [00:30<00:00, 1.27it/s]\n",
424 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.55it/s]\n",
425 | " all 38 905 0.917 0.712 0.8 0.568\n",
426 | "\n",
427 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
428 | " 43/100 10.8G 0.8663 0.4132 0.7863 94 640: 100% 39/39 [00:30<00:00, 1.27it/s]\n",
429 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.52it/s]\n",
430 | " all 38 905 0.902 0.743 0.806 0.551\n",
431 | "\n",
432 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
433 | " 44/100 10.6G 0.8219 0.3898 0.7829 113 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
434 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.67it/s]\n",
435 | " all 38 905 0.9 0.734 0.804 0.561\n",
436 | "\n",
437 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
438 | " 45/100 10.4G 0.8253 0.3947 0.7851 292 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
439 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.53it/s]\n",
440 | " all 38 905 0.897 0.737 0.782 0.549\n",
441 | "\n",
442 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
443 | " 46/100 10.4G 0.8441 0.4042 0.7865 202 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
444 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.74it/s]\n",
445 | " all 38 905 0.895 0.743 0.8 0.562\n",
446 | "\n",
447 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
448 | " 47/100 10.7G 0.883 0.4083 0.7861 186 640: 100% 39/39 [00:30<00:00, 1.26it/s]\n",
449 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 3.27it/s]\n",
450 | " all 38 905 0.885 0.694 0.787 0.526\n",
451 | "\n",
452 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
453 | " 48/100 10.7G 0.8379 0.3935 0.7877 122 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
454 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.69it/s]\n",
455 | " all 38 905 0.917 0.753 0.819 0.558\n",
456 | "\n",
457 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
458 | " 49/100 10.7G 0.8275 0.3889 0.7853 157 640: 100% 39/39 [00:30<00:00, 1.30it/s]\n",
459 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.44it/s]\n",
460 | " all 38 905 0.906 0.738 0.806 0.555\n",
461 | "\n",
462 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
463 | " 50/100 10.8G 0.8338 0.3924 0.7846 133 640: 100% 39/39 [00:30<00:00, 1.27it/s]\n",
464 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.62it/s]\n",
465 | " all 38 905 0.875 0.733 0.788 0.564\n",
466 | "\n",
467 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
468 | " 51/100 10.7G 0.8114 0.3827 0.787 184 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
469 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.63it/s]\n",
470 | " all 38 905 0.939 0.746 0.814 0.577\n",
471 | "\n",
472 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
473 | " 52/100 10.9G 0.8092 0.3846 0.7861 96 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
474 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.37it/s]\n",
475 | " all 38 905 0.936 0.719 0.807 0.536\n",
476 | "\n",
477 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
478 | " 53/100 10.4G 0.8238 0.3861 0.787 167 640: 100% 39/39 [00:29<00:00, 1.30it/s]\n",
479 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.51it/s]\n",
480 | " all 38 905 0.903 0.749 0.82 0.554\n",
481 | "\n",
482 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
483 | " 54/100 10.4G 0.8358 0.3891 0.7845 245 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
484 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.71it/s]\n",
485 | " all 38 905 0.939 0.764 0.815 0.575\n",
486 | "\n",
487 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
488 | " 55/100 10.7G 0.7808 0.3683 0.7839 79 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
489 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 3.00it/s]\n",
490 | " all 38 905 0.903 0.754 0.815 0.565\n",
491 | "\n",
492 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
493 | " 56/100 10.6G 0.8 0.3703 0.7823 210 640: 100% 39/39 [00:29<00:00, 1.30it/s]\n",
494 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.65it/s]\n",
495 | " all 38 905 0.902 0.755 0.814 0.572\n",
496 | "\n",
497 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
498 | " 57/100 10.4G 0.771 0.3614 0.7841 210 640: 100% 39/39 [00:29<00:00, 1.31it/s]\n",
499 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.69it/s]\n",
500 | " all 38 905 0.939 0.753 0.814 0.582\n",
501 | "\n",
502 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
503 | " 58/100 10.4G 0.7918 0.3675 0.7824 194 640: 100% 39/39 [00:30<00:00, 1.30it/s]\n",
504 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.50it/s]\n",
505 | " all 38 905 0.913 0.744 0.808 0.552\n",
506 | "\n",
507 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
508 | " 59/100 10.4G 0.7813 0.3616 0.7828 100 640: 100% 39/39 [00:29<00:00, 1.30it/s]\n",
509 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.49it/s]\n",
510 | " all 38 905 0.876 0.757 0.809 0.577\n",
511 | "\n",
512 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
513 | " 60/100 10.7G 0.7627 0.3557 0.7819 155 640: 100% 39/39 [00:30<00:00, 1.30it/s]\n",
514 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.96it/s]\n",
515 | " all 38 905 0.888 0.752 0.796 0.555\n",
516 | "\n",
517 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
518 | " 61/100 10.4G 0.798 0.3683 0.7815 123 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
519 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.54it/s]\n",
520 | " all 38 905 0.931 0.734 0.795 0.551\n",
521 | "\n",
522 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
523 | " 62/100 10.4G 0.7997 0.3691 0.7853 118 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
524 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.25it/s]\n",
525 | " all 38 905 0.888 0.724 0.794 0.548\n",
526 | "\n",
527 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
528 | " 63/100 10.4G 0.7678 0.359 0.7826 202 640: 100% 39/39 [00:30<00:00, 1.30it/s]\n",
529 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 2.00it/s]\n",
530 | " all 38 905 0.884 0.757 0.812 0.563\n",
531 | "\n",
532 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
533 | " 64/100 10.7G 0.7701 0.3609 0.7826 269 640: 100% 39/39 [00:29<00:00, 1.31it/s]\n",
534 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.75it/s]\n",
535 | " all 38 905 0.922 0.746 0.808 0.578\n",
536 | "\n",
537 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
538 | " 65/100 10.4G 0.7587 0.356 0.7822 158 640: 100% 39/39 [00:30<00:00, 1.30it/s]\n",
539 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.77it/s]\n",
540 | " all 38 905 0.943 0.738 0.822 0.575\n",
541 | "\n",
542 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
543 | " 66/100 10.5G 0.7547 0.3567 0.7811 151 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
544 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.76it/s]\n",
545 | " all 38 905 0.894 0.747 0.814 0.561\n",
546 | "\n",
547 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
548 | " 67/100 10.7G 0.7432 0.3529 0.7782 234 640: 100% 39/39 [00:30<00:00, 1.30it/s]\n",
549 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.43it/s]\n",
550 | " all 38 905 0.905 0.733 0.793 0.556\n",
551 | "\n",
552 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
553 | " 68/100 10.4G 0.7363 0.3495 0.7827 185 640: 100% 39/39 [00:29<00:00, 1.31it/s]\n",
554 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.78it/s]\n",
555 | " all 38 905 0.924 0.747 0.808 0.564\n",
556 | "\n",
557 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
558 | " 69/100 10.4G 0.7471 0.3522 0.7821 129 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
559 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.86it/s]\n",
560 | " all 38 905 0.957 0.756 0.814 0.574\n",
561 | "\n",
562 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
563 | " 70/100 10.4G 0.7696 0.3587 0.7795 202 640: 100% 39/39 [00:29<00:00, 1.30it/s]\n",
564 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.61it/s]\n",
565 | " all 38 905 0.917 0.722 0.798 0.569\n",
566 | "\n",
567 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
568 | " 71/100 10.4G 0.7302 0.3436 0.782 145 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
569 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.67it/s]\n",
570 | " all 38 905 0.957 0.744 0.821 0.58\n",
571 | "\n",
572 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
573 | " 72/100 10.3G 0.7441 0.3477 0.7802 104 640: 100% 39/39 [00:29<00:00, 1.31it/s]\n",
574 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.72it/s]\n",
575 | " all 38 905 0.959 0.753 0.82 0.573\n",
576 | "\n",
577 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
578 | " 73/100 10.6G 0.7635 0.3554 0.7815 197 640: 100% 39/39 [00:30<00:00, 1.30it/s]\n",
579 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.86it/s]\n",
580 | " all 38 905 0.905 0.714 0.795 0.55\n",
581 | "\n",
582 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
583 | " 74/100 10.4G 0.7466 0.3499 0.7787 151 640: 100% 39/39 [00:31<00:00, 1.26it/s]\n",
584 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.49it/s]\n",
585 | " all 38 905 0.913 0.727 0.794 0.576\n",
586 | "\n",
587 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
588 | " 75/100 10.7G 0.7165 0.3359 0.7785 123 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
589 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.62it/s]\n",
590 | " all 38 905 0.932 0.757 0.811 0.579\n",
591 | "\n",
592 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
593 | " 76/100 10.6G 0.7554 0.3494 0.7804 140 640: 100% 39/39 [00:29<00:00, 1.31it/s]\n",
594 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.58it/s]\n",
595 | " all 38 905 0.921 0.739 0.826 0.602\n",
596 | "\n",
597 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
598 | " 77/100 10.7G 0.7221 0.3368 0.7773 161 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
599 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.64it/s]\n",
600 | " all 38 905 0.927 0.76 0.817 0.578\n",
601 | "\n",
602 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
603 | " 78/100 10.4G 0.7283 0.3409 0.7779 206 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
604 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.58it/s]\n",
605 | " all 38 905 0.91 0.766 0.822 0.585\n",
606 | "\n",
607 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
608 | " 79/100 10.7G 0.7277 0.3365 0.7782 165 640: 100% 39/39 [00:30<00:00, 1.30it/s]\n",
609 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.75it/s]\n",
610 | " all 38 905 0.937 0.74 0.829 0.595\n",
611 | "\n",
612 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
613 | " 80/100 10.4G 0.716 0.3354 0.7798 89 640: 100% 39/39 [00:29<00:00, 1.30it/s]\n",
614 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.54it/s]\n",
615 | " all 38 905 0.935 0.763 0.826 0.593\n",
616 | "\n",
617 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
618 | " 81/100 10.4G 0.7175 0.3317 0.7773 115 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
619 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.56it/s]\n",
620 | " all 38 905 0.937 0.76 0.825 0.596\n",
621 | "\n",
622 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
623 | " 82/100 10.7G 0.6918 0.3245 0.7777 162 640: 100% 39/39 [00:29<00:00, 1.32it/s]\n",
624 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.49it/s]\n",
625 | " all 38 905 0.944 0.754 0.827 0.598\n",
626 | "\n",
627 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
628 | " 83/100 10.9G 0.6849 0.3205 0.777 141 640: 100% 39/39 [00:30<00:00, 1.29it/s]\n",
629 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.86it/s]\n",
630 | " all 38 905 0.957 0.745 0.824 0.593\n",
631 | "\n",
632 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
633 | " 84/100 10.8G 0.697 0.323 0.7785 179 640: 100% 39/39 [00:30<00:00, 1.27it/s]\n",
634 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.46it/s]\n",
635 | " all 38 905 0.964 0.759 0.828 0.588\n",
636 | "\n",
637 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
638 | " 85/100 10.6G 0.7172 0.329 0.7774 216 640: 100% 39/39 [00:29<00:00, 1.31it/s]\n",
639 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.38it/s]\n",
640 | " all 38 905 0.958 0.741 0.812 0.574\n",
641 | "\n",
642 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
643 | " 86/100 10.5G 0.7137 0.328 0.7771 147 640: 100% 39/39 [00:29<00:00, 1.30it/s]\n",
644 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.61it/s]\n",
645 | " all 38 905 0.961 0.751 0.835 0.594\n",
646 | "\n",
647 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
648 | " 87/100 10.4G 0.6739 0.3182 0.7764 122 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
649 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.42it/s]\n",
650 | " all 38 905 0.952 0.756 0.83 0.588\n",
651 | "\n",
652 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
653 | " 88/100 10.8G 0.693 0.322 0.7768 288 640: 100% 39/39 [00:29<00:00, 1.30it/s]\n",
654 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 1.81it/s]\n",
655 | " all 38 905 0.95 0.726 0.828 0.594\n",
656 | "\n",
657 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
658 | " 89/100 10.4G 0.7022 0.3246 0.7769 181 640: 100% 39/39 [00:30<00:00, 1.28it/s]\n",
659 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.61it/s]\n",
660 | " all 38 905 0.949 0.746 0.833 0.595\n",
661 | "\n",
662 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
663 | " 90/100 10.4G 0.6736 0.3127 0.7774 79 640: 100% 39/39 [00:29<00:00, 1.31it/s]\n",
664 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.55it/s]\n",
665 | " all 38 905 0.912 0.756 0.825 0.591\n",
666 | "Closing dataloader mosaic\n",
667 | "\u001b[34m\u001b[1malbumentations: \u001b[0mBlur(p=0.01, blur_limit=(3, 7)), MedianBlur(p=0.01, blur_limit=(3, 7)), ToGray(p=0.01), CLAHE(p=0.01, clip_limit=(1, 4.0), tile_grid_size=(8, 8))\n",
668 | "/usr/lib/python3.10/multiprocessing/popen_fork.py:66: RuntimeWarning: os.fork() was called. os.fork() is incompatible with multithreaded code, and JAX is multithreaded, so this will likely lead to a deadlock.\n",
669 | " self.pid = os.fork()\n",
670 | "\n",
671 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
672 | " 91/100 10.9G 0.6377 0.3075 0.7749 92 640: 100% 39/39 [00:36<00:00, 1.07it/s]\n",
673 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.33it/s]\n",
674 | " all 38 905 0.934 0.751 0.823 0.587\n",
675 | "\n",
676 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
677 | " 92/100 11G 0.627 0.3039 0.7768 92 640: 100% 39/39 [00:28<00:00, 1.35it/s]\n",
678 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.45it/s]\n",
679 | " all 38 905 0.941 0.747 0.822 0.575\n",
680 | "\n",
681 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
682 | " 93/100 11.1G 0.6178 0.3001 0.7761 94 640: 100% 39/39 [00:28<00:00, 1.37it/s]\n",
683 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.50it/s]\n",
684 | " all 38 905 0.957 0.735 0.825 0.583\n",
685 | "\n",
686 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
687 | " 94/100 11.1G 0.6152 0.2984 0.7757 89 640: 100% 39/39 [00:28<00:00, 1.36it/s]\n",
688 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.52it/s]\n",
689 | " all 38 905 0.956 0.752 0.827 0.578\n",
690 | "\n",
691 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
692 | " 95/100 11.1G 0.618 0.2995 0.7742 88 640: 100% 39/39 [00:28<00:00, 1.35it/s]\n",
693 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:01<00:00, 2.00it/s]\n",
694 | " all 38 905 0.961 0.748 0.821 0.585\n",
695 | "\n",
696 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
697 | " 96/100 11.2G 0.6089 0.2968 0.775 95 640: 100% 39/39 [00:28<00:00, 1.36it/s]\n",
698 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.69it/s]\n",
699 | " all 38 905 0.944 0.733 0.818 0.575\n",
700 | "\n",
701 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
702 | " 97/100 11.1G 0.5971 0.2918 0.7733 92 640: 100% 39/39 [00:28<00:00, 1.35it/s]\n",
703 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.59it/s]\n",
704 | " all 38 905 0.961 0.753 0.827 0.593\n",
705 | "\n",
706 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
707 | " 98/100 11.2G 0.6029 0.2922 0.7725 94 640: 100% 39/39 [00:28<00:00, 1.36it/s]\n",
708 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.42it/s]\n",
709 | " all 38 905 0.943 0.745 0.827 0.583\n",
710 | "\n",
711 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
712 | " 99/100 11.1G 0.6011 0.291 0.7738 93 640: 100% 39/39 [00:28<00:00, 1.35it/s]\n",
713 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.67it/s]\n",
714 | " all 38 905 0.955 0.752 0.823 0.582\n",
715 | "\n",
716 | " Epoch GPU_mem box_loss cls_loss dfl_loss Instances Size\n",
717 | " 100/100 11G 0.5881 0.2825 0.7747 92 640: 100% 39/39 [00:28<00:00, 1.36it/s]\n",
718 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 3.27it/s]\n",
719 | " all 38 905 0.955 0.755 0.827 0.589\n",
720 | "\n",
721 | "100 epochs completed in 0.965 hours.\n",
722 | "Optimizer stripped from runs/detect/train/weights/last.pt, 106.8MB\n",
723 | "Optimizer stripped from runs/detect/train/weights/best.pt, 106.8MB\n",
724 | "\n",
725 | "Validating runs/detect/train/weights/best.pt...\n",
726 | "Ultralytics YOLOv8.2.2 🚀 Python-3.10.12 torch-2.2.1+cu121 CUDA:0 (Tesla T4, 15102MiB)\n",
727 | "YOLOv5l summary (fused): 303 layers, 53134492 parameters, 0 gradients, 134.7 GFLOPs\n",
728 | " Class Images Instances Box(P R mAP50 mAP50-95): 100% 2/2 [00:00<00:00, 2.43it/s]\n",
729 | " all 38 905 0.921 0.739 0.827 0.6\n",
730 | " ball 38 35 1 0.228 0.448 0.209\n",
731 | " goalkeeper 38 27 0.868 0.852 0.936 0.749\n",
732 | " player 38 754 0.959 0.956 0.984 0.791\n",
733 | " referee 38 89 0.858 0.921 0.939 0.653\n",
734 | "Speed: 0.1ms preprocess, 10.1ms inference, 0.0ms loss, 1.4ms postprocess per image\n",
735 | "Results saved to \u001b[1mruns/detect/train\u001b[0m\n",
736 | "💡 Learn more at https://docs.ultralytics.com/modes/train\n"
737 | ]
738 | }
739 | ],
740 | "source": [
741 | "!yolo task=detect mode=train model=yolov5l.pt data=\"{dataset.location}/data.yaml\" epochs=100 imgsz=640"
742 | ]
743 | },
744 | {
745 | "cell_type": "code",
746 | "execution_count": 6,
747 | "metadata": {
748 | "id": "-1Mi_LTWdslG"
749 | },
750 | "outputs": [],
751 | "source": []
752 | }
753 | ],
754 | "metadata": {
755 | "accelerator": "GPU",
756 | "colab": {
757 | "gpuType": "T4",
758 | "provenance": []
759 | },
760 | "kernelspec": {
761 | "display_name": "Python 3",
762 | "name": "python3"
763 | },
764 | "language_info": {
765 | "codemirror_mode": {
766 | "name": "ipython",
767 | "version": 3
768 | },
769 | "file_extension": ".py",
770 | "mimetype": "text/x-python",
771 | "name": "python",
772 | "nbconvert_exporter": "python",
773 | "pygments_lexer": "ipython3",
774 | "version": "3.11.9"
775 | }
776 | },
777 | "nbformat": 4,
778 | "nbformat_minor": 0
779 | }
780 |
--------------------------------------------------------------------------------
/utils/__init__.py:
--------------------------------------------------------------------------------
1 | from .video_utils import read_video, save_video
2 | from .bbox_utils import get_center_of_bbox, get_bbox_width, measure_distance,measure_xy_distance,get_foot_position
--------------------------------------------------------------------------------
/utils/bbox_utils.py:
--------------------------------------------------------------------------------
1 | def get_center_of_bbox(bbox):
2 | x1,y1,x2,y2 = bbox
3 | return int((x1+x2)/2),int((y1+y2)/2)
4 |
5 | def get_bbox_width(bbox):
6 | return bbox[2]-bbox[0]
7 |
8 | def measure_distance(p1,p2):
9 | return ((p1[0]-p2[0])**2 + (p1[1]-p2[1])**2)**0.5
10 |
11 | def measure_xy_distance(p1,p2):
12 | return p1[0]-p2[0],p1[1]-p2[1]
13 |
14 | def get_foot_position(bbox):
15 | x1,y1,x2,y2 = bbox
16 | return int((x1+x2)/2),int(y2)
--------------------------------------------------------------------------------
/utils/video_utils.py:
--------------------------------------------------------------------------------
1 | import cv2
2 |
3 | def read_video(video_path):
4 | cap = cv2.VideoCapture(video_path)
5 | frames = []
6 | while True:
7 | ret, frame = cap.read()
8 | if not ret:
9 | break
10 | frames.append(frame)
11 | return frames
12 |
13 | def save_video(ouput_video_frames,output_video_path):
14 | fourcc = cv2.VideoWriter_fourcc(*'XVID')
15 | out = cv2.VideoWriter(output_video_path, fourcc, 24, (ouput_video_frames[0].shape[1], ouput_video_frames[0].shape[0]))
16 | for frame in ouput_video_frames:
17 | out.write(frame)
18 | out.release()
19 |
--------------------------------------------------------------------------------
/view_transformer/__init__.py:
--------------------------------------------------------------------------------
1 | from .view_transformer import ViewTransformer
--------------------------------------------------------------------------------
/view_transformer/view_transformer.py:
--------------------------------------------------------------------------------
1 | import numpy as np
2 | import cv2
3 |
4 | class ViewTransformer():
5 | def __init__(self):
6 | court_width = 68
7 | court_length = 23.32
8 |
9 | self.pixel_vertices = np.array([[110, 1035],
10 | [265, 275],
11 | [910, 260],
12 | [1640, 915]])
13 |
14 | self.target_vertices = np.array([
15 | [0,court_width],
16 | [0, 0],
17 | [court_length, 0],
18 | [court_length, court_width]
19 | ])
20 |
21 | self.pixel_vertices = self.pixel_vertices.astype(np.float32)
22 | self.target_vertices = self.target_vertices.astype(np.float32)
23 |
24 | self.persepctive_trasnformer = cv2.getPerspectiveTransform(self.pixel_vertices, self.target_vertices)
25 |
26 | def transform_point(self,point):
27 | p = (int(point[0]),int(point[1]))
28 | is_inside = cv2.pointPolygonTest(self.pixel_vertices,p,False) >= 0
29 | if not is_inside:
30 | return None
31 |
32 | reshaped_point = point.reshape(-1,1,2).astype(np.float32)
33 | tranform_point = cv2.perspectiveTransform(reshaped_point,self.persepctive_trasnformer)
34 | return tranform_point.reshape(-1,2)
35 |
36 | def add_transformed_position_to_tracks(self,tracks):
37 | for object, object_tracks in tracks.items():
38 | for frame_num, track in enumerate(object_tracks):
39 | for track_id, track_info in track.items():
40 | position = track_info['position_adjusted']
41 | position = np.array(position)
42 | position_trasnformed = self.transform_point(position)
43 | if position_trasnformed is not None:
44 | position_trasnformed = position_trasnformed.squeeze().tolist()
45 | tracks[object][frame_num][track_id]['position_transformed'] = position_trasnformed
--------------------------------------------------------------------------------
/yolo_inference.py:
--------------------------------------------------------------------------------
1 | from ultralytics import YOLO
2 |
3 | model = YOLO('./models/best.pt')
4 |
5 | results = model.predict('./input_videos/match.mp4',save=True)
6 | print(results[0])
7 |
8 | for box in results[0].boxes:
9 | print(box)
--------------------------------------------------------------------------------