├── README.md
├── metric
    ├── README.md
    ├── ScoreSimilarity.py
    └── ScoreSimilarity_orig.py
└── tokenization_tools
    ├── README.md
    ├── detokenizer
        ├── README.md
        ├── sample
        │   ├── generated_score.musicxml
        │   ├── input_tokens.txt
        │   └── sample_usage.ipynb
        └── tokens_to_score.py
    ├── requirements.txt
    └── tokenizer
        ├── README.md
        ├── sample
            ├── generated_tokens.txt
            ├── input_score.musicxml
            └── sample_usage.ipynb
        └── score_to_tokens.py


/README.md:
--------------------------------------------------------------------------------
 1 | # Score Transformer
 2 | 
 3 | This is the official repository for "Score Transformer" (ACM Multimedia Asia 2021 / ISMIR2021 LBD).
 4 | 
 5 | [Paper](https://arxiv.org/abs/2112.00355) | [Short paper](https://archives.ismir.net/ismir2021/latebreaking/000032.pdf) | [Project page](https://score-transformer.github.io/)
 6 | 
 7 | <!--
 8 | - [Score Transformer: Generating Musical Scores from Note-level Representation](https://arxiv.org/abs/2112.00355) (ACM Multimedia Asia 2021)
 9 | - [Score Transformer: Transcribing Quantized MIDI into Comprehensive Musical Score](https://archives.ismir.net/ismir2021/latebreaking/000032.pdf) (ISMIR2021 LBD)
10 | 
11 | Project page: https://score-transformer.github.io/
12 | -->
13 | 
14 | ## Overview
15 | 
16 | This repository provides:
17 | - [**Tokenization tools**](tokenization_tools) between MusicXML scores and score tokens
18 |   - **note: updated version is available [here](https://github.com/suzuqn/ScoreRearrangement)!**
19 | - A [**metric**](metric) used in the papers
20 | 
21 | ## Citation
22 | If you find this repository helpful, please consider citing our paper:
23 | ```
24 | @inproceedings{suzuki2021,
25 |  author = {Suzuki, Masahiro},
26 |  title = {Score Transformer: Generating Musical Score from Note-level Representation},
27 |  booktitle = {Proceedings of the 3rd ACM International Conference on Multimedia in Asia},
28 |  year = {2021},
29 |  pages = {31:1--31:7},
30 |  doi = {10.1145/3469877.3490612}
31 | }
32 | ```
33 | 


--------------------------------------------------------------------------------
/metric/README.md:
--------------------------------------------------------------------------------
 1 | ### MetricForScoreSimilarity
 2 | The original implementation is from https://github.com/AndreaCogliati/MetricForScoreSimilarity, which is official implementation for the paper "A metric for Music Notation Transcription Accuracy."
 3 | 
 4 | We partially modified this implementation as described in our paper (see Section 5.4):
 5 | 
 6 | 1. added three musical aspects (*voice*, *beam*, and *tie*) to evaluate our model thoroughly
 7 | 2. excluded two aspects (*barline* and *note grouping*) that were also measured using other aspects (*time signature* vs. *barline*, and *voice* vs. *note grouping*)
 8 | 3. separated *insertion* and *deletion* errors
 9 | 
10 | and post-process the result using Pandas in a following way:
11 | 
12 | 4. integrated *note* and *rest* metrics
13 | 5. calculated error rates note-wisely


--------------------------------------------------------------------------------
/metric/ScoreSimilarity.py:
--------------------------------------------------------------------------------
  1 | import music21
  2 | import numpy as np
  3 | from enum import IntEnum
  4 | import copy
  5 | import itertools
  6 | 
  7 | 
  8 | class ScoreErrors(IntEnum):
  9 |     Clef = 0
 10 |     KeySignature = 1
 11 |     TimeSignature = 2
 12 |     NoteDeletion = 3
 13 |     NoteInsertion = 4
 14 |     NoteSpelling = 5
 15 |     NoteDuration = 6
 16 |     StemDirection = 7
 17 |     Beams = 8 # added
 18 |     Tie = 9 # added
 19 |     RestInsertion = 10
 20 |     RestDeletion = 11
 21 |     RestDuration = 12
 22 |     StaffAssignment = 13
 23 |     Voice = 14 # added
 24 | 
 25 | def scoreAlignment(aScore, bScore):
 26 |     """Compare two musical scores.
 27 | 
 28 |     Parameters:
 29 | 
 30 |     aScore/bScore: music21.stream.Score objects
 31 | 
 32 |     Return value:
 33 | 
 34 |     (path, d):
 35 |            path is a list of tuples containing pairs of matching offsets
 36 |            d is the alignment matrix
 37 |     """
 38 | 
 39 |     def convertScoreToListOfPitches(aScore):
 40 |         """Convert a piano score into a list of tuples containing pitches
 41 | 
 42 |         Parameter:
 43 |             aScore a music21.Stream containing two music21.stream.PartStaff
 44 | 
 45 |         Return value:
 46 |             list of tuples (offset, pitches)
 47 |                 offset is a real number indicating the offset of an object in music21 terms
 48 |                 pitches is a list of pitches in MIDI numbers
 49 |         """
 50 | 
 51 |         def getPitches(el):
 52 |             if isinstance(el, music21.note.Note):
 53 |                 return [el.pitch.midi]
 54 |             elif isinstance(el, music21.chord.Chord):
 55 |                 currentList = []
 56 |                 for pitch in el.pitches:
 57 |                     currentList.append(pitch.midi)
 58 |                 return currentList
 59 | 
 60 |         def convertStreamToList(aStream):
 61 |             aList = []
 62 |             currentOffset = 0.0
 63 |             currentList = []
 64 |             for el in aStream:
 65 |                 if el.offset == currentOffset:
 66 |                     currentList += getPitches(el)
 67 |                 else:
 68 |                     aList.append((currentOffset, currentList))
 69 |                     currentOffset = el.offset
 70 |                     currentList = getPitches(el)
 71 |             return aList
 72 | 
 73 |         def flattenStream(aStream):
 74 |             newStream = music21.stream.Stream()
 75 |             for el in aStream.recurse():
 76 |                 if isinstance(el, music21.note.Note) or isinstance(el, music21.chord.Chord):
 77 |                     newStream.insert(el.getOffsetInHierarchy(aStream), el)
 78 |             return newStream
 79 | 
 80 |         # aList = convertStreamToList(aScore.flat.notes)
 81 | 
 82 |         # added
 83 |         parts = aScore.getElementsByClass([music21.stream.PartStaff, music21.stream.Part])
 84 |         flat_notes = sorted(itertools.chain.from_iterable([flattenStream(part).elements for part in parts]), key=lambda x:x.offset)
 85 |         aList = convertStreamToList(flat_notes)
 86 | 
 87 |         return aList
 88 | 
 89 |     def compareSets(aSet, bSet):
 90 |         """Compare two sets of pitches.
 91 | 
 92 |         Parameters:
 93 | 
 94 |         aSet/bSet: list of pitches
 95 | 
 96 |         Return value:
 97 | 
 98 |             the number of mismatching objects in the two sets
 99 |         """
100 | 
101 |         a = aSet.copy()
102 |         b = bSet.copy()
103 | 
104 |         # Remove matching pitches from both sets
105 |         aTemp = []
106 |         for obj in a:
107 |             if obj in b:
108 |                 b.remove(obj)
109 |             else:
110 |                 aTemp.append(obj)
111 |         a = aTemp
112 | 
113 |         return len(a) + len(b)
114 | 
115 |     def costMatrix(s, t):
116 |         m = len(s)
117 |         n = len(t)
118 |         d = np.zeros((m + 1, n + 1))
119 | 
120 |         for i in range(1, m + 1):
121 |             d[i, 0] = np.inf
122 | 
123 |         for j in range(1, n + 1):
124 |             d[0, j] = np.inf
125 | 
126 |         for j in range(1, n + 1):
127 |             for i in range(1, m + 1):
128 |                 cost = compareSets(s[i - 1][1], t[j - 1][1])
129 |                 idx = np.argmin([d[i - 1, j], d[i, j - 1], d[i - 1, j - 1]])
130 |                 if idx == 0:
131 |                     d[i, j] = d[i - 1, j] + cost
132 |                 elif idx == 1:
133 |                     d[i, j] = d[i, j - 1] + cost
134 |                 else:
135 |                     d[i, j] = d[i - 1, j - 1] + cost
136 | 
137 |         return d
138 | 
139 |     # scoreAlignment
140 |     aList = convertScoreToListOfPitches(aScore)
141 |     bList = convertScoreToListOfPitches(bScore)
142 |     d = costMatrix(aList, bList)
143 | 
144 |     (i,j) = (d.shape[0] - 1, d.shape[1] - 1)
145 |     path = []
146 |     while not (i == 0 and j == 0):
147 |         aOff = aList[i-1][0]
148 |         bOff = bList[j-1][0]
149 |         path = [(aOff,bOff)] + path
150 | 
151 |         idx = np.argmin([d[i - 1, j], d[i, j - 1], d[i - 1, j - 1]])
152 |         if idx == 0:
153 |             i = i - 1
154 |         elif idx == 1:
155 |             j = j - 1
156 |         else:
157 |             i, j = i - 1, j - 1
158 | 
159 |     return path, d
160 | 
161 | 
162 | 
163 | def scoreSimilarity(estScore, gtScore):
164 |     """Compare two musical scores.
165 | 
166 |     Parameters:
167 | 
168 |     estScore/gtScore: music21.stream.Score objects of piano scores. The scores must contain two
169 |         music21.stream.PartStaff substreams (top and bottom staves)
170 | 
171 |     estScore is the estimated transcription
172 |     gtScore is the ground truth
173 | 
174 |     Return value:
175 | 
176 |     a NumPy array containing the differences between the two scores:
177 | 
178 |         barlines, clefs, key signatures, time signatures, note, note spelling,
179 |         note duration, staff assignment, rest, rest duration
180 | 
181 |     The differences for notes, rests and barlines are normalized with the number of symbols
182 |     in the ground truth
183 |     """
184 | 
185 |     def isInstanceOfClasses(obj, classes):
186 |         """Helper function to determine if an item is an instance of several classes"""
187 |         for cls in classes:
188 |             if isinstance(obj, cls):
189 |                 return True
190 |         return False
191 | 
192 |     def countSymbols(aScore):
193 |         """Count the number of symbols in a score
194 | 
195 |         Parameter:
196 |             aScore a music21.Stream
197 | 
198 |         Return value:
199 |             the number of music symbols (notes, rests, chords, barlines) in the score
200 |         """
201 | 
202 |         # Classes to consider
203 |         CLASSES = [music21.note.Note, music21.chord.Chord, music21.note.Rest]
204 | 
205 |         nSymbols = {'n_' + cls.__name__: sum([len(el.notes) if cls == music21.chord.Chord else 1
206 |                                     for el in aScore.recurse() if isinstance(el, cls)])
207 |                     for cls in CLASSES}
208 | 
209 |         return nSymbols
210 | 
211 |     def convertScoreToList(aScore):
212 |         """Convert a piano score into a list of tuples
213 | 
214 |         Parameter:
215 |             aScore a music21.Stream containing two music21.stream.PartStaff
216 | 
217 |         Return value:
218 |             list of tuples (offset, staff, object)
219 |                 offset is a real number indicating the offset of an object in music21 terms
220 |                 staff is an integer indicating the staff (0 = top, 1 = bottom)
221 |                 object is a music21 object
222 |         """
223 | 
224 |         # Classes to consider
225 |         CLASSES = [music21.bar.Barline, music21.note.Note, music21.note.Rest, music21.chord.Chord]
226 | 
227 |         def convertStreamToList(aStream, staff):
228 |             aList = []
229 |             currentOffset = 0.0
230 |             currentList = []
231 |             for el in aStream.recurse():
232 |                 if isInstanceOfClasses(el, CLASSES):
233 |                     if el.getOffsetInHierarchy(aStream) == currentOffset:
234 |                         currentList.append((staff, el))
235 |                     else:
236 |                         aList.append((currentOffset, currentList))
237 |                         currentOffset = el.getOffsetInHierarchy(aStream)
238 |                         currentList = [(staff, el)]
239 |             return aList
240 | 
241 |         def flattenStream(aStream):
242 |             newStream = music21.stream.Stream()
243 |             for el in aStream.recurse():
244 |                 if isInstanceOfClasses(el, CLASSES):
245 |                     newStream.insert(el.getOffsetInHierarchy(aStream), el)
246 |                 elif isinstance(el, music21.stream.Measure):
247 |                     newStream.insert(el.getOffsetInHierarchy(aStream), music21.bar.Barline())
248 |             return newStream
249 | 
250 |         def getNext(iterator):
251 |             try:
252 |                 return next(iterator)
253 |             except StopIteration:
254 |                 return None
255 | 
256 |         parts = aScore.getElementsByClass([music21.stream.PartStaff, music21.stream.Part])  # get staves
257 |         topStaffList = convertStreamToList(flattenStream(parts[0]), 0)
258 |         bottomStaffList = convertStreamToList(flattenStream(parts[1]), 1) if len(parts) == 2 else []
259 | 
260 |         aList = []
261 |         tIterator = iter(topStaffList)
262 |         bIterator = iter(bottomStaffList)
263 |         tEl = getNext(tIterator)
264 |         bEl = getNext(bIterator)
265 | 
266 |         while tEl or bEl:
267 |             if not tEl:
268 |                 aList.append((bEl[0], bEl[1]))
269 |                 bEl = getNext(bIterator)
270 |             elif not bEl:
271 |                 aList.append((tEl[0], tEl[1]))
272 |                 tEl = getNext(tIterator)
273 |             else:
274 |                 if tEl[0] < bEl[0]:
275 |                     aList.append((tEl[0], tEl[1]))
276 |                     tEl = getNext(tIterator)
277 |                 elif tEl[0] > bEl[0]:
278 |                     aList.append((bEl[0], bEl[1]))
279 |                     bEl = getNext(bIterator)
280 |                 else:
281 |                     aList.append((tEl[0], tEl[1] + bEl[1]))
282 |                     tEl = getNext(tIterator)
283 |                     bEl = getNext(bIterator)
284 | 
285 |         return aList
286 | 
287 |     def countObjects(aSet):
288 |         """Count objects in a set
289 | 
290 |         Parameters:
291 | 
292 |         aSet: list of tuples (staff, object)
293 |             staff is an integer indicating the staff (1 = top, 2 = bottom)
294 |             object is a music21 object
295 | 
296 |         Return value:
297 | 
298 |             a tuple with the numbers of objects in the set (see definition of errors below)
299 |         """
300 | 
301 |         errors = np.zeros((len(ScoreErrors.__members__)), int)
302 | 
303 |         for obj in aSet:
304 |             if isinstance(obj[1], (music21.stream.Measure, music21.bar.Barline, music21.clef.Clef, \
305 |                                     music21.key.Key, music21.key.KeySignature, music21.meter.TimeSignature)):
306 |                 pass
307 |             elif isinstance(obj[1], music21.note.Note):
308 |                 errors[ScoreErrors.NoteDeletion] += 1
309 |             elif isinstance(obj[1], music21.chord.Chord):
310 |                 errors[ScoreErrors.NoteDeletion] += len(obj[1].pitches)
311 |             elif isinstance(obj[1], music21.note.Rest):
312 |                 errors[ScoreErrors.RestDeletion] += 1
313 |             else:
314 |                 print('Class not found:', type(obj[1]))
315 | 
316 |         return errors
317 | 
318 |     def compareSets(aSet, bSet):
319 |         """Compare two sets of concurrent musical objects.
320 | 
321 |         Parameters:
322 | 
323 |         aSet/bSet: list of tuples (staff, object)
324 |             staff is an integer indicating the staff (1 = top, 2 = bottom)
325 |             object is a music21 object
326 | 
327 |         Return value:
328 | 
329 |             a tuple with the differences between the two sets (see definition of errors below)
330 |         """
331 | 
332 |         def findEnharmonicEquivalent(note, aSet):
333 |             """Find the first enharmonic equivalent in a set
334 | 
335 |             Parameters:
336 | 
337 |             note: a music21.note.Note object
338 |             aSet: list of tuples (staff, object)
339 |                 staff is an integer indicating the staff (0 = top, 1 = bottom)
340 |                 object is a music21 object
341 | 
342 |             Return value:
343 | 
344 |                 index of the first enharmonic equivalent of note in aSet
345 |                 -1 otherwise
346 |             """
347 |             for i, obj in enumerate(aSet):
348 |                 if isinstance(obj[1], music21.note.Note) and obj[1].pitch.ps == note.pitch.ps:
349 |                     return i
350 |             return -1
351 | 
352 |         def splitChords(aSet):
353 |             """Split chords into seperate notes
354 | 
355 |             Parameters:
356 | 
357 |             aSet: list of tuples (staff, object)
358 |                 staff is an integer indicating the staff (0 = top, 1 = bottom)
359 |                 object is a music21 object
360 | 
361 |             Return value:
362 |                 a tuple (newSet, chords)
363 |                 newSet: aSet with split chords
364 |                 chords: the number of chords in aSet
365 | 
366 |             """
367 |             newSet = []
368 |             chordSet = [] # added
369 |             numChords = 0
370 |             for obj in aSet:
371 |                 if isinstance(obj[1], music21.chord.Chord):
372 |                     numChords += 1
373 |                     for note in obj[1]: # added
374 |                         if not note.containerHierarchy:
375 |                             note.containerHierarchy = obj[1].containerHierarchy
376 |                         if not note.contextSites:
377 |                             note.contextSites = obj[1].contextSites
378 |                         if note.stemDirection == 'unspecified':
379 |                             note.stemDirection = obj[1].stemDirection
380 | 
381 |                         # newNote = copy.deepcopy(note)
382 |                         newSet.append((obj[0], note))
383 |                     chordSet.append(obj) # added
384 |                 else:
385 |                     newSet.append(obj)
386 | 
387 |             return newSet, chordSet, numChords # modified
388 | 
389 |         def compareObj(aObj, bObj):
390 |             # Compare Music 21 objects
391 |             if isinstance(aObj, music21.note.Note) or isinstance(aObj, music21.chord.Chord):
392 |                 return False
393 |             if aObj == bObj:
394 |                 return True
395 |             if type(aObj) != type(bObj):
396 |                 if not isinstance(aObj, music21.key.Key) and not isinstance(aObj, music21.key.KeySignature): # added
397 |                     return False
398 |             if isinstance(aObj, music21.stream.Measure):
399 |                 return True
400 |             if isinstance(aObj, music21.bar.Barline):
401 |                 return True
402 |             if isinstance(aObj, music21.clef.Clef):
403 |                 if type(aObj) == type(bObj):
404 |                     return True
405 |             if isinstance(aObj, music21.key.Key) or isinstance(aObj, music21.key.KeySignature): # mod
406 |                 if aObj.sharps == bObj.sharps:
407 |                     return True
408 |             if isinstance(aObj, music21.meter.TimeSignature):
409 |                 if aObj.numerator / aObj.beatCount == bObj.numerator / bObj.beatCount: # mod
410 |                     return True
411 |             if isinstance(aObj, music21.note.Note):
412 |                 if aObj.pitch == bObj.pitch and aObj.duration == bObj.duration and aObj.stemDirection == bObj.stemDirection:
413 |                     return True
414 |             if isinstance(aObj, music21.note.Rest):
415 |                 if aObj.duration == bObj.duration:
416 |                     return True
417 |             if isinstance(aObj, music21.chord.Chord):
418 |                 if aObj.duration == bObj.duration and aObj.pitches == bObj.pitches:
419 |                     return True
420 |             return False
421 | 
422 |         def findObj(aPair, aSet):
423 |             # Find
424 |             for bPair in aSet:
425 |                 if aPair[0] == bPair[0]:
426 |                     if compareObj(aPair[1], bPair[1]):
427 |                         return bPair
428 |             return None
429 | 
430 |         def comparePitch(aObj, bObj): # added
431 |             if isinstance(aObj, music21.note.Note):
432 |                 return aObj.pitch == bObj.pitch
433 |             elif isinstance(aObj, music21.chord.Chord):
434 |                 return set(aObj.pitches) == set(bObj.pitches)
435 | 
436 |         def getBeams(noteObj): # added
437 |             return '_'.join(['-'.join([b.type, b.direction]) if b.direction else b.type for b in noteObj.beams])
438 | 
439 |         def getTie(noteObj): # added
440 |             return noteObj.tie.type if noteObj.tie is not None else ''
441 | 
442 |         def referClef(noteObj): # added
443 |             return noteObj.getContextByClass('Clef').name if noteObj.getContextByClass('Clef') is not None else ''
444 | 
445 |         def referTimeSig(noteObj): # added
446 |             return noteObj.getContextByClass('TimeSignature').numerator / noteObj.getContextByClass('TimeSignature').denominator \
447 |                     if noteObj.getContextByClass('TimeSignature') is not None else ''
448 | 
449 |         def referKeySig(noteObj): # added
450 |             keyObj = (noteObj.getContextByClass('Key') or noteObj.getContextByClass('KeySignature'))
451 |             return keyObj.sharps if keyObj else 0
452 | 
453 |         def referVoice(noteObj): # added
454 |             return noteObj.getContextByClass('Voice').id if noteObj.getContextByClass('Voice') is not None else '1'
455 | 
456 |         errors = np.zeros((len(ScoreErrors.__members__)), int)
457 | 
458 |         a = aSet.copy()
459 |         b = bSet.copy()
460 | 
461 |         # Remove matching pairs from both sets
462 |         aTemp = []
463 |         for pair in a:
464 |             bPair = findObj(pair, b)
465 |             if bPair:
466 |                 b.remove(bPair)
467 |             else:
468 |                 aTemp.append(pair)
469 |         a = aTemp
470 | 
471 |         # Find mismatched staff placement
472 |         aTemp = []
473 |         for obj in a:
474 |             bTemp = [o[1] for o in b if o[0] != obj[0]]
475 |             if obj[1] in bTemp:
476 |                 idx = b.index((1 - obj[0], obj[1]))
477 |                 del b[idx]
478 |                 errors[ScoreErrors.StaffAssignment] += 1
479 |             else:
480 |                 aTemp.append(obj)
481 |         a = aTemp
482 | 
483 |         a, aChords, aNumChords = splitChords(a)
484 |         b, bChords, bNumChords = splitChords(b)
485 | 
486 |         # Find mismatches in notes
487 |         aTemp = []
488 |         for obj in a:
489 |             if isinstance(obj[1], music21.note.Note):
490 |                 found = False
491 |                 for bObj in b:
492 |                     if isinstance(bObj[1], music21.note.Note) and bObj[1].pitch == obj[1].pitch:
493 |                         if bObj[0] != obj[0]:
494 |                             errors[ScoreErrors.StaffAssignment] += 1
495 |                         else: # added
496 |                             if bObj[1].duration != obj[1].duration:
497 |                                 errors[ScoreErrors.NoteDuration] += 1
498 |                             if bObj[1].stemDirection != obj[1].stemDirection:
499 |                                 errors[ScoreErrors.StemDirection] += 1
500 | 
501 |                             if getBeams(bObj[1]) != getBeams(obj[1]): # added
502 |                                 errors[ScoreErrors.Beams] += 1
503 |                             if getTie(bObj[1]) != getTie(obj[1]): # added
504 |                                 errors[ScoreErrors.Tie] += 1
505 |                             if referClef(bObj[1]) != referClef(obj[1]): # added
506 |                                 errors[ScoreErrors.Clef] += 1
507 |                             if referTimeSig(bObj[1]) != referTimeSig(obj[1]): # added
508 |                                 errors[ScoreErrors.TimeSignature] += 1
509 |                             if referKeySig(bObj[1]) != referKeySig(obj[1]): # added
510 |                                 errors[ScoreErrors.KeySignature] += 1
511 |                             if referVoice(bObj[1]) != referVoice(obj[1]): # added
512 |                                 errors[ScoreErrors.Voice] += 1
513 | 
514 |                         b.remove(bObj)
515 |                         found = True
516 |                         break
517 |                 if not found:
518 |                     aTemp.append(obj)
519 |             else:
520 |                 aTemp.append(obj)
521 |         a = aTemp
522 | 
523 |         # Find mismatched duration of rests
524 |         aTemp = []
525 |         for obj in a:
526 |             if isinstance(obj[1], music21.note.Rest):
527 |                 for bObj in b:
528 |                     if isinstance(bObj[1], music21.note.Rest) and bObj[1].duration != obj[1].duration:
529 |                         b.remove(bObj)
530 |                         errors[ScoreErrors.RestDuration] += 1
531 |                         break
532 |                 aTemp.append(obj)
533 |             else:
534 |                 aTemp.append(obj)
535 |         a = aTemp
536 | 
537 |         # Find enharmonic equivalents and report spelling mistakes and duration mistakes
538 |         aTemp = []
539 |         for obj in a:
540 |             if isinstance(obj[1], music21.note.Note):
541 |                 idx = findEnharmonicEquivalent(obj[1], b)
542 |                 if idx != -1:
543 |                     if b[idx][0] != obj[0]:
544 |                         errors[ScoreErrors.StaffAssignment] += 1
545 |                     if b[idx][1].duration != obj[1].duration:
546 |                         errors[ScoreErrors.NoteDuration] += 1
547 |                     if b[idx][1].stemDirection != obj[1].stemDirection:
548 |                         errors[ScoreErrors.StemDirection] += 1
549 | 
550 |                     if getBeams(b[idx][1]) != getBeams(obj[1]): # added
551 |                         errors[ScoreErrors.Beams] += 1
552 |                     if getTie(b[idx][1]) != getTie(obj[1]): # added
553 |                         errors[ScoreErrors.Tie] += 1
554 |                     if referClef(b[idx][1]) != referClef(obj[1]): # added
555 |                         errors[ScoreErrors.Clef] += 1
556 |                     if referTimeSig(b[idx][1]) != referTimeSig(obj[1]): # added
557 |                         errors[ScoreErrors.TimeSignature] += 1
558 |                     if referKeySig(b[idx][1]) != referKeySig(obj[1]): # added
559 |                         errors[ScoreErrors.KeySignature] += 1
560 |                     if referVoice(b[idx][1]) != referVoice(obj[1]): # added
561 |                         errors[ScoreErrors.Voice] += 1
562 | 
563 |                     del b[idx]
564 |                     errors[ScoreErrors.NoteSpelling] += 1
565 |                 else:
566 |                     aTemp.append(obj)
567 |             else:
568 |                 aTemp.append(obj)
569 |         a = aTemp
570 | 
571 |         aErrors = countObjects(a)
572 |         bErrors = countObjects(b)
573 | 
574 |         errors += bErrors
575 |         errors[ScoreErrors.NoteInsertion] = aErrors[ScoreErrors.NoteDeletion]
576 |         errors[ScoreErrors.RestInsertion] = aErrors[ScoreErrors.RestDeletion]
577 | 
578 |         # print()
579 |         # print('aSet =', aSet)
580 |         # print('bSet =', bSet)
581 |         # print('errors =', errors)
582 |         # print()
583 | 
584 |         return errors
585 | 
586 |     def getSet(aList, start, end):
587 |         set = []
588 |         for aTuple in aList:
589 |             if aTuple[0] >= end:
590 |                 return set
591 |             if aTuple[0] >= start:
592 |                 set += aTuple[1]
593 |         return set
594 | 
595 |     # scoreSimilarity
596 |     path, _ = scoreAlignment(estScore, gtScore)
597 | 
598 |     aList = convertScoreToList(estScore)
599 |     bList = convertScoreToList(gtScore)
600 | 
601 |     nSymbols = countSymbols(gtScore)
602 | 
603 |     errors = np.zeros((len(ScoreErrors.__members__)), float)
604 | 
605 |     aStart, aEnd = 0.0, 0.0
606 |     bStart, bEnd = 0.0, 0.0
607 |     for pair in path:
608 |         if pair[0] != aEnd and pair[1] != bEnd:
609 |             aEnd, bEnd = pair[0], pair[1]
610 |             errors += compareSets(getSet(aList, aStart, aEnd), getSet(bList, bStart, bEnd))
611 | 
612 |             aStart, aEnd = aEnd, aEnd
613 |             bStart, bEnd = bEnd, bEnd
614 |         elif pair[0] == aEnd:
615 |             bEnd = pair[1]
616 |         else:
617 |             aEnd = pair[0]
618 | 
619 |     errors += compareSets(getSet(aList, aStart, float('inf')), getSet(bList, bStart, float('inf')))
620 | 
621 |     results = {k: int(v) for k, v in zip(ScoreErrors.__members__.keys(), errors)}
622 |     results.update(nSymbols)
623 | 
624 |     return results
625 | 


--------------------------------------------------------------------------------
/metric/ScoreSimilarity_orig.py:
--------------------------------------------------------------------------------
  1 | import music21
  2 | import numpy as np
  3 | from enum import IntEnum
  4 | 
  5 | 
  6 | class ScoreErrors(IntEnum):
  7 |     Barline = 0
  8 |     Clef = 1
  9 |     KeySignature = 2
 10 |     TimeSignature = 3
 11 |     Note = 4
 12 |     NoteSpelling = 5
 13 |     NoteDuration = 6
 14 |     StemDirection = 7
 15 |     Grouping = 8
 16 |     Rest = 9
 17 |     RestDuration = 10
 18 |     StaffAssignment = 11
 19 | 
 20 | 
 21 | def scoreAlignment(aScore, bScore):
 22 |     """Compare two musical scores.
 23 | 
 24 |     Parameters:
 25 | 
 26 |     aScore/bScore: music21.stream.Score objects
 27 | 
 28 |     Return value:
 29 | 
 30 |     (path, d):
 31 |            path is a list of tuples containing pairs of matching offsets
 32 |            d is the alignment matrix
 33 |     """
 34 | 
 35 |     def convertScoreToListOfPitches(aScore):
 36 |         """Convert a piano score into a list of tuples containing pitches
 37 | 
 38 |         Parameter:
 39 |             aScore a music21.Stream containing two music21.stream.PartStaff
 40 | 
 41 |         Return value:
 42 |             list of tuples (offset, pitches)
 43 |                 offset is a real number indicating the offset of an object in music21 terms 
 44 |                 pitches is a list of pitches in MIDI numbers
 45 |         """
 46 | 
 47 |         def getPitches(el):
 48 |             if isinstance(el, music21.note.Note):
 49 |                 return [el.pitch.midi]
 50 |             elif isinstance(el, music21.chord.Chord):
 51 |                 currentList = []
 52 |                 for pitch in el.pitches:
 53 |                     currentList.append(pitch.midi)
 54 |                 return currentList
 55 | 
 56 |         def convertStreamToList(aStream):
 57 |             aList = []
 58 |             currentOffset = 0.0
 59 |             currentList = []
 60 |             for el in aStream:
 61 |                 if el.offset == currentOffset:
 62 |                     currentList += getPitches(el)
 63 |                 else:
 64 |                     aList.append((currentOffset, currentList))
 65 |                     currentOffset = el.offset
 66 |                     currentList = getPitches(el)
 67 |             return aList
 68 | 
 69 |         aList = convertStreamToList(aScore.flat.notes)
 70 |         return aList
 71 | 
 72 |     def compareSets(aSet, bSet):
 73 |         """Compare two sets of pitches.
 74 | 
 75 |         Parameters:
 76 | 
 77 |         aSet/bSet: list of pitches
 78 | 
 79 |         Return value:
 80 | 
 81 |             the number of mismatching objects in the two sets
 82 |         """
 83 | 
 84 |         a = aSet.copy()
 85 |         b = bSet.copy()
 86 | 
 87 |         # Remove matching pitches from both sets
 88 |         aTemp = []
 89 |         for obj in a:
 90 |             if obj in b:
 91 |                 b.remove(obj)
 92 |             else:
 93 |                 aTemp.append(obj)
 94 |         a = aTemp
 95 | 
 96 |         return len(a) + len(b)
 97 | 
 98 |     def costMatrix(s, t):
 99 |         m = len(s)
100 |         n = len(t)
101 |         d = np.zeros((m + 1, n + 1))
102 | 
103 |         for i in range(1, m + 1):
104 |             d[i, 0] = np.inf
105 | 
106 |         for j in range(1, n + 1):
107 |             d[0, j] = np.inf
108 | 
109 |         for j in range(1, n + 1):
110 |             for i in range(1, m + 1):
111 |                 cost = compareSets(s[i - 1][1], t[j - 1][1])
112 |                 idx = np.argmin([d[i - 1, j], d[i, j - 1], d[i - 1, j - 1]])
113 |                 if idx == 0:
114 |                     d[i, j] = d[i - 1, j] + cost
115 |                 elif idx == 1:
116 |                     d[i, j] = d[i, j - 1] + cost
117 |                 else:
118 |                     d[i, j] = d[i - 1, j - 1] + cost
119 | 
120 |         return d
121 | 
122 |     # scoreAlignment
123 |     aList = convertScoreToListOfPitches(aScore)
124 |     bList = convertScoreToListOfPitches(bScore)
125 |     d = costMatrix(aList, bList)
126 | 
127 |     (i,j) = (d.shape[0] - 1, d.shape[1] - 1)
128 |     path = []
129 |     while not (i == 0 and j == 0):
130 |         aOff = aList[i-1][0]
131 |         bOff = bList[j-1][0]
132 |         path = [(aOff,bOff)] + path
133 | 
134 |         idx = np.argmin([d[i - 1, j], d[i, j - 1], d[i - 1, j - 1]])
135 |         if idx == 0:
136 |             i = i - 1
137 |         elif idx == 1:
138 |             j = j - 1
139 |         else:
140 |             i, j = i - 1, j - 1
141 | 
142 |     return path, d
143 | 
144 | 
145 | 
146 | def scoreSimilarity(estScore, gtScore):
147 |     """Compare two musical scores.
148 | 
149 |     Parameters:
150 | 
151 |     estScore/gtScore: music21.stream.Score objects of piano scores. The scores must contain two
152 |         music21.stream.PartStaff substreams (top and bottom staves)
153 | 
154 |     estScore is the estimated transcription
155 |     gtScore is the ground truth
156 | 
157 |     Return value:
158 | 
159 |     a NumPy array containing the differences between the two scores:
160 | 
161 |         barlines, clefs, key signatures, time signatures, note, note spelling, 
162 |         note duration, staff assignment, rest, rest duration
163 | 
164 |     The differences for notes, rests and barlines are normalized with the number of symbols
165 |     in the ground truth
166 |     """
167 | 
168 |     def isInstanceOfClasses(obj, classes):
169 |         """Helper function to determine if an item is an instance of several classes"""
170 |         for cls in classes:
171 |             if isinstance(obj, cls):
172 |                 return True
173 |         return False
174 | 
175 |     def countSymbols(aScore):
176 |         """Count the number of symbols in a score
177 | 
178 |         Parameter:
179 |             aScore a music21.Stream
180 | 
181 |         Return value:
182 |             the number of music symbols (notes, rests, chords, barlines) in the score
183 |         """
184 | 
185 |         # Classes to consider
186 |         CLASSES = [music21.bar.Barline, music21.note.Note, music21.note.Rest,
187 |                    music21.chord.Chord]
188 | 
189 |         nSymbols = 0
190 |         for el in aScore.recurse():
191 |             if isInstanceOfClasses(el, CLASSES):
192 |                 nSymbols += 1
193 | 
194 |         return nSymbols
195 | 
196 |     def convertScoreToList(aScore):
197 |         """Convert a piano score into a list of tuples
198 | 
199 |         Parameter:
200 |             aScore a music21.Stream containing two music21.stream.PartStaff
201 | 
202 |         Return value:
203 |             list of tuples (offset, staff, object)
204 |                 offset is a real number indicating the offset of an object in music21 terms 
205 |                 staff is an integer indicating the staff (0 = top, 1 = bottom)
206 |                 object is a music21 object
207 |         """
208 | 
209 |         # Classes to consider
210 |         CLASSES = [music21.bar.Barline, music21.clef.Clef,
211 |                    music21.key.Key, music21.meter.TimeSignature, music21.note.Note, music21.note.Rest,
212 |                    music21.chord.Chord]
213 | 
214 |         def convertStreamToList(aStream, staff):
215 |             aList = []
216 |             currentOffset = 0.0
217 |             currentList = []
218 |             for el in aStream.recurse():
219 |                 if isInstanceOfClasses(el, CLASSES):
220 |                     if el.getOffsetInHierarchy(aStream) == currentOffset:
221 |                         currentList.append((staff, el))
222 |                     else:
223 |                         aList.append((currentOffset, currentList))
224 |                         currentOffset = el.getOffsetInHierarchy(aStream)
225 |                         currentList = [(staff, el)]
226 |             return aList
227 | 
228 |         def flattenStream(aStream):
229 |             newStream = music21.stream.Stream()
230 |             for el in aStream.recurse():
231 |                 if isInstanceOfClasses(el, CLASSES):
232 |                     newStream.insert(el.getOffsetInHierarchy(aStream), el)
233 |                 elif isinstance(el, music21.stream.Measure):
234 |                     newStream.insert(el.getOffsetInHierarchy(aStream), music21.bar.Barline())
235 |             return newStream
236 | 
237 |         def getNext(iterator):
238 |             try:
239 |                 return next(iterator)
240 |             except StopIteration:
241 |                 return None
242 | 
243 |         parts = aScore.getElementsByClass([music21.stream.PartStaff, music21.stream.Part])  # get staves
244 |         topStaffList = convertStreamToList(flattenStream(parts[0]), 0)
245 |         bottomStaffList = convertStreamToList(flattenStream(parts[1]), 1)
246 | 
247 |         aList = []
248 |         tIterator = iter(topStaffList)
249 |         bIterator = iter(bottomStaffList)
250 |         tEl = getNext(tIterator)
251 |         bEl = getNext(bIterator)
252 | 
253 |         while tEl or bEl:
254 |             if not tEl:
255 |                 aList.append((bEl[0], bEl[1]))
256 |                 bEl = getNext(bIterator)
257 |             elif not bEl:
258 |                 aList.append((tEl[0], tEl[1]))
259 |                 tEl = getNext(tIterator)
260 |             else:
261 |                 if tEl[0] < bEl[0]:
262 |                     aList.append((tEl[0], tEl[1]))
263 |                     tEl = getNext(tIterator)
264 |                 elif tEl[0] > bEl[0]:
265 |                     aList.append((bEl[0], bEl[1]))
266 |                     bEl = getNext(bIterator)
267 |                 else:
268 |                     aList.append((tEl[0], tEl[1] + bEl[1]))
269 |                     tEl = getNext(tIterator)
270 |                     bEl = getNext(bIterator)
271 | 
272 |         return aList
273 | 
274 |     def countObjects(aSet):
275 |         """Count objects in a set
276 | 
277 |         Parameters:
278 | 
279 |         aSet: list of tuples (staff, object)
280 |             staff is an integer indicating the staff (1 = top, 2 = bottom)
281 |             object is a music21 object
282 | 
283 |         Return value:
284 | 
285 |             a tuple with the numbers of objects in the set (see definition of errors below)
286 |         """
287 | 
288 |         errors = np.zeros((len(ScoreErrors.__members__)), int)
289 | 
290 |         for obj in aSet:
291 |             if isinstance(obj[1], music21.stream.Measure) or isinstance(obj[1], music21.bar.Barline):
292 |                 errors[ScoreErrors.Barline] += 1
293 |             elif isinstance(obj[1], music21.clef.Clef):
294 |                 errors[ScoreErrors.Clef] += 1
295 |             elif isinstance(obj[1], music21.key.Key):
296 |                 errors[ScoreErrors.KeySignature] += 1
297 |             elif isinstance(obj[1], music21.meter.TimeSignature):
298 |                 errors[ScoreErrors.TimeSignature] += 1
299 |             elif isinstance(obj[1], music21.note.Note):
300 |                 errors[ScoreErrors.Note] += 1
301 |             elif isinstance(obj[1], music21.chord.Chord):
302 |                 errors[ScoreErrors.Note] += len(obj[1].pitches)
303 |             elif isinstance(obj[1], music21.note.Rest):
304 |                 errors[ScoreErrors.Rest] += 1
305 |             else:
306 |                 print('Class not found:', type(obj[1]))
307 | 
308 |         return errors
309 | 
310 |     def compareSets(aSet, bSet):
311 |         """Compare two sets of concurrent musical objects.
312 | 
313 |         Parameters:
314 | 
315 |         aSet/bSet: list of tuples (staff, object)
316 |             staff is an integer indicating the staff (1 = top, 2 = bottom)
317 |             object is a music21 object
318 | 
319 |         Return value:
320 | 
321 |             a tuple with the differences between the two sets (see definition of errors below)
322 |         """
323 | 
324 |         def findEnharmonicEquivalent(note, aSet):
325 |             """Find the first enharmonic equivalent in a set
326 | 
327 |             Parameters:
328 | 
329 |             note: a music21.note.Note object
330 |             aSet: list of tuples (staff, object)
331 |                 staff is an integer indicating the staff (0 = top, 1 = bottom)
332 |                 object is a music21 object
333 | 
334 |             Return value:
335 | 
336 |                 index of the first enharmonic equivalent of note in aSet
337 |                 -1 otherwise
338 |             """
339 |             for i, obj in enumerate(aSet):
340 |                 if isinstance(obj[1], music21.note.Note) and obj[1].pitch.ps == note.pitch.ps:
341 |                     return i
342 |             return -1
343 | 
344 |         def splitChords(aSet):
345 |             """Split chords into seperate notes
346 | 
347 |             Parameters:
348 | 
349 |             aSet: list of tuples (staff, object)
350 |                 staff is an integer indicating the staff (0 = top, 1 = bottom)
351 |                 object is a music21 object
352 | 
353 |             Return value:
354 |                 a tuple (newSet, chords)
355 |                 newSet: aSet with split chords
356 |                 chords: the number of chords in aSet
357 | 
358 |             """
359 |             newSet = []
360 |             chords = 0
361 |             for obj in aSet:
362 |                 if isinstance(obj[1], music21.chord.Chord):
363 |                     chords += 1
364 |                     for pitch in obj[1].pitches:
365 |                         newNote = music21.note.Note()
366 |                         newNote.offset = obj[1].offset
367 |                         newNote.pitch = pitch
368 |                         newNote.duration = obj[1].duration
369 |                         newNote.stemDirection = obj[1].getStemDirection(pitch)
370 |                         newSet.append((obj[0], newNote))
371 |                 else:
372 |                     newSet.append(obj)
373 | 
374 |             return newSet, chords
375 | 
376 |         def compareObj(aObj, bObj):
377 |             # Compare Music 21 objects
378 |             if aObj == bObj:
379 |                 return True
380 |             if type(aObj) != type(bObj):
381 |                 return False
382 |             if isinstance(aObj, music21.stream.Measure):
383 |                 return True
384 |             if isinstance(aObj, music21.bar.Barline):
385 |                 return True
386 |             if isinstance(aObj, music21.clef.Clef):
387 |                 if type(aObj) == type(bObj):
388 |                     return True
389 |             if isinstance(aObj, music21.key.Key):
390 |                 if aObj.sharps == bObj.sharps:
391 |                     return True
392 |             if isinstance(aObj, music21.meter.TimeSignature):
393 |                 if aObj.numerator == bObj.numerator and aObj.beatCount == bObj.beatCount:
394 |                     return True
395 |             if isinstance(aObj, music21.note.Note):
396 |                 if aObj.pitch == bObj.pitch and aObj.duration == bObj.duration and aObj.stemDirection == bObj.stemDirection:
397 |                     return True
398 |             if isinstance(aObj, music21.note.Rest):
399 |                 if aObj.duration == bObj.duration:
400 |                     return True
401 |             if isinstance(aObj, music21.chord.Chord):
402 |                 if aObj.duration == bObj.duration and aObj.pitches == bObj.pitches and aObj.stemDirection == bObj.stemDirection:
403 |                     return True
404 |             return False
405 | 
406 |         def findObj(aPair, aSet):
407 |             # Find
408 |             for bPair in aSet:
409 |                 if aPair[0] == bPair[0]:
410 |                     if compareObj(aPair[1], bPair[1]):
411 |                         return bPair
412 |             return None
413 | 
414 |         errors = np.zeros((len(ScoreErrors.__members__)), int)
415 | 
416 |         a = aSet.copy()
417 |         b = bSet.copy()
418 | 
419 |         # Remove matching pairs from both sets
420 |         # aTemp = []
421 |         # for obj in a:
422 |         #     if obj in b:
423 |         #         b.remove(obj)
424 |         #     else:
425 |         #         aTemp.append(obj)
426 |         # a = aTemp
427 |         aTemp = []
428 |         for pair in a:
429 |             bPair = findObj(pair, b)
430 |             if bPair:
431 |                 b.remove(bPair)
432 |             else:
433 |                 aTemp.append(pair)
434 |         a = aTemp
435 | 
436 |         # Find mismatched staff placement
437 |         aTemp = []
438 |         for obj in a:
439 |             bTemp = [o[1] for o in b if o[0] != obj[0]]
440 |             if obj[1] in bTemp:
441 |                 idx = b.index((1 - obj[0], obj[1]))
442 |                 del b[idx]
443 |                 errors[ScoreErrors.StaffAssignment] += 1
444 |             else:
445 |                 aTemp.append(obj)
446 |         a = aTemp
447 | 
448 |         # Split chords and report grouping errors
449 |         a, aChords = splitChords(a)
450 |         b, bChords = splitChords(b)
451 |         errors[ScoreErrors.Grouping] += abs(aChords - bChords)
452 | 
453 |         # Find mismatches in notes
454 |         aTemp = []
455 |         for obj in a:
456 |             if isinstance(obj[1], music21.note.Note):
457 |                 found = False
458 |                 for bObj in b:
459 |                     if isinstance(bObj[1], music21.note.Note) and bObj[1].pitch == obj[1].pitch:
460 |                         if bObj[0] != obj[0]:
461 |                             errors[ScoreErrors.StaffAssignment] += 1
462 |                         if bObj[1].duration != obj[1].duration:
463 |                             errors[ScoreErrors.NoteDuration] += 1
464 |                         if bObj[1].stemDirection != obj[1].stemDirection:
465 |                             errors[ScoreErrors.StemDirection] += 1
466 |                         b.remove(bObj)
467 |                         found = True
468 |                         break
469 |                 if not found:
470 |                     aTemp.append(obj)
471 |             else:
472 |                 aTemp.append(obj)
473 |         a = aTemp
474 | 
475 |         # Find mismatched duration of rests
476 |         aTemp = []
477 |         for obj in a:
478 |             if isinstance(obj[1], music21.note.Rest):
479 |                 for bObj in b:
480 |                     if isinstance(bObj[1], music21.note.Rest) and bObj[1].duration != obj[1].duration:
481 |                         b.remove(bObj)
482 |                         errors[ScoreErrors.RestDuration] += 1
483 |                         break
484 |                 aTemp.append(obj)
485 |             else:
486 |                 aTemp.append(obj)
487 |         a = aTemp
488 | 
489 |         # Find enharmonic equivalents and report spelling mistakes and duration mistakes
490 |         aTemp = []
491 |         for obj in a:
492 |             if isinstance(obj[1], music21.note.Note):
493 |                 idx = findEnharmonicEquivalent(obj[1], b)
494 |                 if idx != -1:
495 |                     if b[idx][0] != obj[0]:
496 |                         errors[ScoreErrors.StaffAssignment] += 1
497 |                     if b[idx][1].duration != obj[1].duration:
498 |                         errors[ScoreErrors.NoteDuration] += 1
499 |                     if b[idx][1].stemDirection != obj[1].stemDirection:
500 |                         errors[ScoreErrors.StemDirection] += 1
501 |                     del b[idx]
502 |                     errors[ScoreErrors.NoteSpelling] += 1
503 |                 else:
504 |                     aTemp.append(obj)
505 |             else:
506 |                 aTemp.append(obj)
507 |         a = aTemp
508 | 
509 |         errors += countObjects(a)
510 |         errors += countObjects(b)
511 | 
512 |         # print()
513 |         # print('aSet =', aSet)
514 |         # print('bSet =', bSet)
515 |         # print('errors =', errors)
516 |         # print()
517 | 
518 |         return errors
519 | 
520 |     def errorsToCost(errors):
521 |         cost = errors[ScoreErrors.Barline]
522 |         cost += errors[ScoreErrors.Clef]
523 |         cost += errors[ScoreErrors.KeySignature]
524 |         cost += errors[ScoreErrors.TimeSignature]
525 |         cost += errors[ScoreErrors.Note]
526 |         cost += errors[ScoreErrors.NoteSpelling] * 1 / 4
527 |         cost += errors[ScoreErrors.NoteDuration] * 1 / 4
528 |         cost += errors[ScoreErrors.StemDirection] * 1 / 4
529 |         cost += errors[ScoreErrors.StaffAssignment] * 1 / 2
530 |         cost += errors[ScoreErrors.Grouping]
531 |         cost += errors[ScoreErrors.Rest]
532 |         cost += errors[ScoreErrors.RestDuration] * 1 / 2
533 |         return cost
534 | 
535 |     def getSet(aList, start, end):
536 |         set = []
537 |         for aTuple in aList:
538 |             if aTuple[0] >= end:
539 |                 return set
540 |             if aTuple[0] >= start:
541 |                 set += aTuple[1]
542 |         return set
543 | 
544 |     # scoreSimilarity
545 |     path, _ = scoreAlignment(estScore, gtScore)
546 | 
547 |     aList = convertScoreToList(estScore)
548 |     bList = convertScoreToList(gtScore)
549 | 
550 |     nSymbols = countSymbols(gtScore)
551 | 
552 |     errors = np.zeros((len(ScoreErrors.__members__)), float)
553 | 
554 |     aStart, aEnd = 0.0, 0.0
555 |     bStart, bEnd = 0.0, 0.0
556 |     for pair in path:
557 |         if pair[0] != aEnd and pair[1] != bEnd:
558 |             aEnd, bEnd = pair[0], pair[1]
559 |             errors += compareSets(getSet(aList, aStart, aEnd), getSet(bList, bStart, bEnd))
560 |             aStart, aEnd = aEnd, aEnd
561 |             bStart, bEnd = bEnd, bEnd
562 |         elif pair[0] == aEnd:
563 |             bEnd = pair[1]
564 |         else:
565 |             aEnd = pair[0]
566 |     errors += compareSets(getSet(aList, aStart, float('inf')), getSet(bList, bStart, float('inf')))
567 |     for aspect in [ScoreErrors.Note, ScoreErrors.NoteSpelling, ScoreErrors.NoteDuration, ScoreErrors.StemDirection,
568 |                    ScoreErrors.StaffAssignment, ScoreErrors.Grouping, ScoreErrors.Rest, ScoreErrors.RestDuration]:
569 |         errors[aspect] /= nSymbols
570 | 
571 |     return errors
572 |     
573 | #
574 | # Evaluate dataset
575 | #
576 | 
577 | from music21 import converter 
578 | import os
579 | import numpy as np
580 | import scipy.io as sio
581 | 
582 | METHODS = ['F', 'G', 'C', 'M']
583 | METHODS_ORD = [2, 3, 0, 1]
584 | BASEDIR = 'dataset'
585 | N = 19
586 | pieces = list(range(1,N+1))
587 | gt = [None] * N
588 | for piece in pieces:
589 |     filename = os.path.join(BASEDIR, 'K-' + str(piece) + '.mxl')
590 |     try:
591 |         gt[piece - 1] = converter.parse(filename)
592 |     except:
593 |         print("Can't load", filename)
594 |         pass
595 | 
596 | results = -np.ones((len(METHODS), N, len(ScoreErrors.__members__)))
597 | for piece in pieces:
598 |     if gt[piece - 1] == None:
599 |         continue
600 |     for method in METHODS:
601 |         filename = os.path.join(BASEDIR, method + '-' + str(piece) + '.mxl')
602 |         try:
603 |             comparisonPiece = converter.parse(filename)
604 |             print(filename, end = ' ')
605 |             score = scoreSimilarity(gt[piece - 1], comparisonPiece)
606 |             print(score)
607 |             results[METHODS_ORD[METHODS.index(method)], piece - 1, :] = score
608 |         except music21.converter.ConverterException:
609 |             pass
610 |         except Exception as err:
611 |             print(type(err), err)
612 |      
613 | print('Saving results to MAT file')    
614 | mat_results = {'results' : results}
615 | sio.savemat('resultsWithAlignment', mat_results)
616 | print('Done')


--------------------------------------------------------------------------------
/tokenization_tools/README.md:
--------------------------------------------------------------------------------
 1 | # Tokenization tools
 2 | 
 3 | This directory contains the tokenizer and de-tokenizer between **MusicXML** and proposed **score token** representation.
 4 | 
 5 | - [**tokenizer**](tokenizer)
 6 |   - MusicXML -> Score tokens
 7 | 
 8 | - [**de-tokenizer**](detokenizer)
 9 |   - Score tokens -> MusicXML
10 | 
11 | #### requirements
12 | 
13 | Python 3.6+
14 | 
15 | - **tokenizer**
16 |   - beautifulsoup4 (4.6.3)
17 |   - lxml (4.9.1)
18 |   - pretty_midi (0.2.9)
19 | 
20 | - **de-tokenizer**
21 |   - music21 (7.3.3)
22 | 
23 | Note: The library versions here are not specified ones, but **tested** ones.
24 | 


--------------------------------------------------------------------------------
/tokenization_tools/detokenizer/README.md:
--------------------------------------------------------------------------------
 1 | ## Overview
 2 | 
 3 | Detokenizer builds musical scores from token sequences, utilizing [music21](https://web.mit.edu/music21/).
 4 | 
 5 | ## Usage
 6 | 
 7 | #### 1. import
 8 | 
 9 | ```python
10 | from tokens_to_score import tokens_to_score
11 | ```
12 | 
13 | #### 2. pass token sequence (as a string) to the function
14 | 
15 | ```Python
16 | s = tokens_to_score(token_sequence)
17 | ```
18 | 
19 | - s : music21 Score object 
20 | 
21 | 
22 | #### 3. write score into a MusicXML file with ".write" method (of music21 object)  
23 | 
24 | ```python
25 | s.write('musicxml', 'generated_score')
26 | ```
27 | 
28 | - You'll get the "generated_score.xml" file.
29 | 
30 | ## Specifications
31 | 
32 | ### Supported tokens
33 | 
34 | - Score tokens (that "[score_to_tokens.py](../tokenizer/)" generates)
35 | 
36 | ### Requirements
37 | 
38 | Python 3.6+
39 | 
40 | - music21 (7.3.3)
41 | 
42 | Note: The library version here are not specified one, but **tested** one.
43 | 


--------------------------------------------------------------------------------
/tokenization_tools/detokenizer/sample/generated_score.musicxml:
--------------------------------------------------------------------------------
  1 | <?xml version="1.0" encoding="utf-8"?>
  2 | <!DOCTYPE score-partwise  PUBLIC "-//Recordare//DTD MusicXML 3.0 Partwise//EN" "http://www.musicxml.org/dtds/partwise.dtd">
  3 | <score-partwise version="3.0">
  4 |   <movement-title>Music21 Fragment</movement-title>
  5 |   <identification>
  6 |     <creator type="composer">Music21</creator>
  7 |     <encoding>
  8 |       <encoding-date>2021-11-05</encoding-date>
  9 |       <software>music21 v.6.7.1</software>
 10 |     </encoding>
 11 |   </identification>
 12 |   <defaults>
 13 |     <scaling>
 14 |       <millimeters>7</millimeters>
 15 |       <tenths>40</tenths>
 16 |     </scaling>
 17 |   </defaults>
 18 |   <part-list>
 19 |     <part-group number="1" type="start">
 20 |       <group-symbol>brace</group-symbol>
 21 |       <group-barline>yes</group-barline>
 22 |     </part-group>
 23 |     <score-part id="Pd26664c0785aa59838bbd0090aa79994">
 24 |       <part-name />
 25 |     </score-part>
 26 |     <part-group number="1" type="stop" />
 27 |   </part-list>
 28 |   <!--=========================== Part 1 ===========================-->
 29 |   <part id="Pd26664c0785aa59838bbd0090aa79994">
 30 |     <!--========================= Measure 0 ==========================-->
 31 |     <measure number="0">
 32 |       <attributes>
 33 |         <divisions>10080</divisions>
 34 |         <time>
 35 |           <beats>4</beats>
 36 |           <beat-type>4</beat-type>
 37 |         </time>
 38 |         <staves>2</staves>
 39 |         <clef number="1">
 40 |           <sign>G</sign>
 41 |           <line>2</line>
 42 |         </clef>
 43 |         <clef number="2">
 44 |           <sign>F</sign>
 45 |           <line>4</line>
 46 |         </clef>
 47 |       </attributes>
 48 |       <note>
 49 |         <pitch>
 50 |           <step>D</step>
 51 |           <octave>5</octave>
 52 |         </pitch>
 53 |         <duration>5040</duration>
 54 |         <voice>0</voice>
 55 |         <type>eighth</type>
 56 |         <stem>up</stem>
 57 |         <staff>1</staff>
 58 |         <beam number="1">begin</beam>
 59 |       </note>
 60 |       <note>
 61 |         <pitch>
 62 |           <step>E</step>
 63 |           <octave>5</octave>
 64 |         </pitch>
 65 |         <duration>5040</duration>
 66 |         <voice>0</voice>
 67 |         <type>eighth</type>
 68 |         <stem>up</stem>
 69 |         <staff>1</staff>
 70 |         <beam number="1">continue</beam>
 71 |       </note>
 72 |       <note>
 73 |         <pitch>
 74 |           <step>D</step>
 75 |           <octave>5</octave>
 76 |         </pitch>
 77 |         <duration>5040</duration>
 78 |         <voice>0</voice>
 79 |         <type>eighth</type>
 80 |         <stem>up</stem>
 81 |         <staff>1</staff>
 82 |         <beam number="1">continue</beam>
 83 |       </note>
 84 |       <note>
 85 |         <pitch>
 86 |           <step>E</step>
 87 |           <octave>5</octave>
 88 |         </pitch>
 89 |         <duration>5040</duration>
 90 |         <voice>0</voice>
 91 |         <type>eighth</type>
 92 |         <stem>up</stem>
 93 |         <staff>1</staff>
 94 |         <beam number="1">end</beam>
 95 |       </note>
 96 |       <note>
 97 |         <pitch>
 98 |           <step>D</step>
 99 |           <octave>5</octave>
100 |         </pitch>
101 |         <duration>10080</duration>
102 |         <voice>0</voice>
103 |         <type>quarter</type>
104 |         <stem>up</stem>
105 |         <staff>1</staff>
106 |       </note>
107 |       <note>
108 |         <pitch>
109 |           <step>E</step>
110 |           <octave>5</octave>
111 |         </pitch>
112 |         <duration>10080</duration>
113 |         <voice>0</voice>
114 |         <type>quarter</type>
115 |         <stem>up</stem>
116 |         <staff>1</staff>
117 |       </note>
118 |       <backup>
119 |         <duration>40320</duration>
120 |       </backup>
121 |       <note>
122 |         <pitch>
123 |           <step>B</step>
124 |           <octave>4</octave>
125 |         </pitch>
126 |         <duration>10080</duration>
127 |         <voice>1</voice>
128 |         <type>quarter</type>
129 |         <stem>down</stem>
130 |         <staff>1</staff>
131 |       </note>
132 |       <note>
133 |         <chord />
134 |         <pitch>
135 |           <step>F</step>
136 |           <octave>4</octave>
137 |         </pitch>
138 |         <duration>10080</duration>
139 |         <voice>1</voice>
140 |         <type>quarter</type>
141 |         <staff>1</staff>
142 |       </note>
143 |       <note>
144 |         <pitch>
145 |           <step>B</step>
146 |           <octave>4</octave>
147 |         </pitch>
148 |         <duration>10080</duration>
149 |         <voice>1</voice>
150 |         <type>quarter</type>
151 |         <stem>down</stem>
152 |         <staff>1</staff>
153 |       </note>
154 |       <note>
155 |         <chord />
156 |         <pitch>
157 |           <step>F</step>
158 |           <octave>4</octave>
159 |         </pitch>
160 |         <duration>10080</duration>
161 |         <voice>1</voice>
162 |         <type>quarter</type>
163 |         <staff>1</staff>
164 |       </note>
165 |       <note>
166 |         <pitch>
167 |           <step>F</step>
168 |           <octave>4</octave>
169 |         </pitch>
170 |         <duration>10080</duration>
171 |         <voice>1</voice>
172 |         <type>quarter</type>
173 |         <stem>down</stem>
174 |         <staff>1</staff>
175 |       </note>
176 |       <note>
177 |         <chord />
178 |         <pitch>
179 |           <step>B</step>
180 |           <octave>4</octave>
181 |         </pitch>
182 |         <duration>10080</duration>
183 |         <voice>1</voice>
184 |         <type>quarter</type>
185 |         <staff>1</staff>
186 |       </note>
187 |       <note>
188 |         <pitch>
189 |           <step>F</step>
190 |           <octave>4</octave>
191 |         </pitch>
192 |         <duration>10080</duration>
193 |         <voice>1</voice>
194 |         <type>quarter</type>
195 |         <stem>down</stem>
196 |         <staff>1</staff>
197 |       </note>
198 |       <note>
199 |         <chord />
200 |         <pitch>
201 |           <step>B</step>
202 |           <octave>4</octave>
203 |         </pitch>
204 |         <duration>10080</duration>
205 |         <voice>1</voice>
206 |         <type>quarter</type>
207 |         <staff>1</staff>
208 |       </note>
209 |       <backup>
210 |         <duration>80640</duration>
211 |       </backup>
212 |       <note>
213 |         <pitch>
214 |           <step>G</step>
215 |           <octave>2</octave>
216 |         </pitch>
217 |         <duration>10080</duration>
218 |         <voice>2</voice>
219 |         <type>quarter</type>
220 |         <stem>up</stem>
221 |         <staff>2</staff>
222 |       </note>
223 |       <note>
224 |         <pitch>
225 |           <step>B</step>
226 |           <octave>3</octave>
227 |         </pitch>
228 |         <duration>10080</duration>
229 |         <voice>2</voice>
230 |         <type>quarter</type>
231 |         <stem>down</stem>
232 |         <staff>2</staff>
233 |       </note>
234 |       <note>
235 |         <chord />
236 |         <pitch>
237 |           <step>G</step>
238 |           <octave>3</octave>
239 |         </pitch>
240 |         <duration>10080</duration>
241 |         <voice>2</voice>
242 |         <type>quarter</type>
243 |         <staff>2</staff>
244 |       </note>
245 |       <note>
246 |         <pitch>
247 |           <step>G</step>
248 |           <octave>2</octave>
249 |         </pitch>
250 |         <duration>10080</duration>
251 |         <voice>2</voice>
252 |         <type>quarter</type>
253 |         <stem>up</stem>
254 |         <staff>2</staff>
255 |       </note>
256 |       <note>
257 |         <pitch>
258 |           <step>B</step>
259 |           <octave>3</octave>
260 |         </pitch>
261 |         <duration>10080</duration>
262 |         <voice>2</voice>
263 |         <type>quarter</type>
264 |         <stem>down</stem>
265 |         <staff>2</staff>
266 |       </note>
267 |       <note>
268 |         <chord />
269 |         <pitch>
270 |           <step>G</step>
271 |           <octave>3</octave>
272 |         </pitch>
273 |         <duration>10080</duration>
274 |         <voice>2</voice>
275 |         <type>quarter</type>
276 |         <staff>2</staff>
277 |       </note>
278 |     </measure>
279 |     <!--========================= Measure 0 ==========================-->
280 |     <measure number="0">
281 |       <note>
282 |         <pitch>
283 |           <step>C</step>
284 |           <octave>5</octave>
285 |         </pitch>
286 |         <duration>10080</duration>
287 |         <voice>1</voice>
288 |         <type>quarter</type>
289 |         <stem>up</stem>
290 |         <staff>1</staff>
291 |       </note>
292 |       <note>
293 |         <chord />
294 |         <pitch>
295 |           <step>E</step>
296 |           <octave>4</octave>
297 |         </pitch>
298 |         <duration>10080</duration>
299 |         <voice>1</voice>
300 |         <type>quarter</type>
301 |         <staff>1</staff>
302 |       </note>
303 |       <note>
304 |         <pitch>
305 |           <step>F</step>
306 |           <octave>5</octave>
307 |         </pitch>
308 |         <duration>10080</duration>
309 |         <voice>1</voice>
310 |         <type>quarter</type>
311 |         <stem>down</stem>
312 |         <staff>1</staff>
313 |       </note>
314 |       <note>
315 |         <chord />
316 |         <pitch>
317 |           <step>C</step>
318 |           <octave>5</octave>
319 |         </pitch>
320 |         <duration>10080</duration>
321 |         <voice>1</voice>
322 |         <type>quarter</type>
323 |         <staff>1</staff>
324 |       </note>
325 |       <note>
326 |         <chord />
327 |         <pitch>
328 |           <step>F</step>
329 |           <octave>4</octave>
330 |         </pitch>
331 |         <duration>10080</duration>
332 |         <voice>1</voice>
333 |         <type>quarter</type>
334 |         <staff>1</staff>
335 |       </note>
336 |       <note>
337 |         <pitch>
338 |           <step>E</step>
339 |           <octave>5</octave>
340 |         </pitch>
341 |         <duration>10080</duration>
342 |         <voice>1</voice>
343 |         <type>quarter</type>
344 |         <stem>up</stem>
345 |         <staff>1</staff>
346 |       </note>
347 |       <note>
348 |         <chord />
349 |         <pitch>
350 |           <step>C</step>
351 |           <octave>5</octave>
352 |         </pitch>
353 |         <duration>10080</duration>
354 |         <voice>1</voice>
355 |         <type>quarter</type>
356 |         <staff>1</staff>
357 |       </note>
358 |       <note>
359 |         <chord />
360 |         <pitch>
361 |           <step>E</step>
362 |           <octave>4</octave>
363 |         </pitch>
364 |         <duration>10080</duration>
365 |         <voice>1</voice>
366 |         <type>quarter</type>
367 |         <staff>1</staff>
368 |       </note>
369 |       <note>
370 |         <pitch>
371 |           <step>E</step>
372 |           <octave>5</octave>
373 |         </pitch>
374 |         <duration>5040</duration>
375 |         <voice>1</voice>
376 |         <type>eighth</type>
377 |         <stem>down</stem>
378 |         <staff>1</staff>
379 |         <beam number="1">begin</beam>
380 |       </note>
381 |       <note>
382 |         <pitch>
383 |           <step>E</step>
384 |           <alter>-1</alter>
385 |           <octave>5</octave>
386 |         </pitch>
387 |         <duration>5040</duration>
388 |         <voice>1</voice>
389 |         <type>eighth</type>
390 |         <accidental>flat</accidental>
391 |         <stem>down</stem>
392 |         <staff>1</staff>
393 |         <beam number="1">end</beam>
394 |       </note>
395 |       <backup>
396 |         <duration>90720</duration>
397 |       </backup>
398 |       <note>
399 |         <pitch>
400 |           <step>G</step>
401 |           <octave>3</octave>
402 |         </pitch>
403 |         <duration>10080</duration>
404 |         <voice>2</voice>
405 |         <type>quarter</type>
406 |         <stem>down</stem>
407 |         <staff>2</staff>
408 |       </note>
409 |       <note>
410 |         <chord />
411 |         <pitch>
412 |           <step>C</step>
413 |           <octave>3</octave>
414 |         </pitch>
415 |         <duration>10080</duration>
416 |         <voice>2</voice>
417 |         <type>quarter</type>
418 |         <staff>2</staff>
419 |       </note>
420 |       <note>
421 |         <pitch>
422 |           <step>A</step>
423 |           <octave>3</octave>
424 |         </pitch>
425 |         <duration>10080</duration>
426 |         <voice>2</voice>
427 |         <type>quarter</type>
428 |         <stem>down</stem>
429 |         <staff>2</staff>
430 |       </note>
431 |       <note>
432 |         <chord />
433 |         <pitch>
434 |           <step>C</step>
435 |           <octave>3</octave>
436 |         </pitch>
437 |         <duration>10080</duration>
438 |         <voice>2</voice>
439 |         <type>quarter</type>
440 |         <staff>2</staff>
441 |       </note>
442 |       <note>
443 |         <pitch>
444 |           <step>G</step>
445 |           <octave>3</octave>
446 |         </pitch>
447 |         <duration>10080</duration>
448 |         <voice>2</voice>
449 |         <type>quarter</type>
450 |         <stem>down</stem>
451 |         <staff>2</staff>
452 |       </note>
453 |       <note>
454 |         <chord />
455 |         <pitch>
456 |           <step>C</step>
457 |           <octave>3</octave>
458 |         </pitch>
459 |         <duration>10080</duration>
460 |         <voice>2</voice>
461 |         <type>quarter</type>
462 |         <staff>2</staff>
463 |       </note>
464 |       <note>
465 |         <rest />
466 |         <duration>10080</duration>
467 |         <voice>2</voice>
468 |         <type>quarter</type>
469 |         <staff>2</staff>
470 |       </note>
471 |     </measure>
472 |     <!--========================= Measure 0 ==========================-->
473 |     <measure number="0">
474 |       <note>
475 |         <pitch>
476 |           <step>D</step>
477 |           <octave>5</octave>
478 |         </pitch>
479 |         <duration>5040</duration>
480 |         <voice>0</voice>
481 |         <type>eighth</type>
482 |         <stem>up</stem>
483 |         <staff>1</staff>
484 |         <beam number="1">begin</beam>
485 |       </note>
486 |       <note>
487 |         <pitch>
488 |           <step>E</step>
489 |           <alter>0</alter>
490 |           <octave>5</octave>
491 |         </pitch>
492 |         <duration>5040</duration>
493 |         <voice>0</voice>
494 |         <type>eighth</type>
495 |         <accidental>natural</accidental>
496 |         <stem>up</stem>
497 |         <staff>1</staff>
498 |         <beam number="1">continue</beam>
499 |       </note>
500 |       <note>
501 |         <pitch>
502 |           <step>D</step>
503 |           <octave>5</octave>
504 |         </pitch>
505 |         <duration>5040</duration>
506 |         <voice>0</voice>
507 |         <type>eighth</type>
508 |         <stem>up</stem>
509 |         <staff>1</staff>
510 |         <beam number="1">continue</beam>
511 |       </note>
512 |       <note>
513 |         <pitch>
514 |           <step>E</step>
515 |           <octave>5</octave>
516 |         </pitch>
517 |         <duration>5040</duration>
518 |         <voice>0</voice>
519 |         <type>eighth</type>
520 |         <stem>up</stem>
521 |         <staff>1</staff>
522 |         <beam number="1">end</beam>
523 |       </note>
524 |       <note>
525 |         <pitch>
526 |           <step>D</step>
527 |           <octave>5</octave>
528 |         </pitch>
529 |         <duration>10080</duration>
530 |         <voice>0</voice>
531 |         <type>quarter</type>
532 |         <stem>up</stem>
533 |         <staff>1</staff>
534 |       </note>
535 |       <note>
536 |         <pitch>
537 |           <step>A</step>
538 |           <octave>5</octave>
539 |         </pitch>
540 |         <duration>10080</duration>
541 |         <voice>0</voice>
542 |         <type>quarter</type>
543 |         <stem>up</stem>
544 |         <staff>1</staff>
545 |       </note>
546 |       <backup>
547 |         <duration>40320</duration>
548 |       </backup>
549 |       <note>
550 |         <pitch>
551 |           <step>F</step>
552 |           <alter>1</alter>
553 |           <octave>4</octave>
554 |         </pitch>
555 |         <duration>10080</duration>
556 |         <voice>1</voice>
557 |         <type>quarter</type>
558 |         <accidental>sharp</accidental>
559 |         <stem>down</stem>
560 |         <staff>1</staff>
561 |       </note>
562 |       <note>
563 |         <chord />
564 |         <pitch>
565 |           <step>C</step>
566 |           <octave>5</octave>
567 |         </pitch>
568 |         <duration>10080</duration>
569 |         <voice>1</voice>
570 |         <type>quarter</type>
571 |         <staff>1</staff>
572 |       </note>
573 |       <note>
574 |         <pitch>
575 |           <step>F</step>
576 |           <alter>1</alter>
577 |           <octave>4</octave>
578 |         </pitch>
579 |         <duration>10080</duration>
580 |         <voice>1</voice>
581 |         <type>quarter</type>
582 |         <accidental>sharp</accidental>
583 |         <stem>down</stem>
584 |         <staff>1</staff>
585 |       </note>
586 |       <note>
587 |         <chord />
588 |         <pitch>
589 |           <step>C</step>
590 |           <octave>5</octave>
591 |         </pitch>
592 |         <duration>10080</duration>
593 |         <voice>1</voice>
594 |         <type>quarter</type>
595 |         <staff>1</staff>
596 |       </note>
597 |       <note>
598 |         <pitch>
599 |           <step>F</step>
600 |           <alter>1</alter>
601 |           <octave>4</octave>
602 |         </pitch>
603 |         <duration>10080</duration>
604 |         <voice>1</voice>
605 |         <type>quarter</type>
606 |         <accidental>sharp</accidental>
607 |         <stem>down</stem>
608 |         <staff>1</staff>
609 |       </note>
610 |       <note>
611 |         <chord />
612 |         <pitch>
613 |           <step>C</step>
614 |           <octave>5</octave>
615 |         </pitch>
616 |         <duration>10080</duration>
617 |         <voice>1</voice>
618 |         <type>quarter</type>
619 |         <staff>1</staff>
620 |       </note>
621 |       <note>
622 |         <pitch>
623 |           <step>D</step>
624 |           <octave>5</octave>
625 |         </pitch>
626 |         <duration>10080</duration>
627 |         <voice>1</voice>
628 |         <type>quarter</type>
629 |         <stem>down</stem>
630 |         <staff>1</staff>
631 |       </note>
632 |       <note>
633 |         <chord />
634 |         <pitch>
635 |           <step>C</step>
636 |           <octave>5</octave>
637 |         </pitch>
638 |         <duration>10080</duration>
639 |         <voice>1</voice>
640 |         <type>quarter</type>
641 |         <staff>1</staff>
642 |       </note>
643 |       <backup>
644 |         <duration>80640</duration>
645 |       </backup>
646 |       <note>
647 |         <pitch>
648 |           <step>A</step>
649 |           <octave>2</octave>
650 |         </pitch>
651 |         <duration>10080</duration>
652 |         <voice>2</voice>
653 |         <type>quarter</type>
654 |         <stem>up</stem>
655 |         <staff>2</staff>
656 |       </note>
657 |       <note>
658 |         <pitch>
659 |           <step>C</step>
660 |           <octave>4</octave>
661 |         </pitch>
662 |         <duration>10080</duration>
663 |         <voice>2</voice>
664 |         <type>quarter</type>
665 |         <stem>down</stem>
666 |         <staff>2</staff>
667 |       </note>
668 |       <note>
669 |         <chord />
670 |         <pitch>
671 |           <step>F</step>
672 |           <alter>1</alter>
673 |           <octave>3</octave>
674 |         </pitch>
675 |         <duration>10080</duration>
676 |         <voice>2</voice>
677 |         <type>quarter</type>
678 |         <accidental>sharp</accidental>
679 |         <staff>2</staff>
680 |       </note>
681 |       <note>
682 |         <chord />
683 |         <pitch>
684 |           <step>D</step>
685 |           <octave>3</octave>
686 |         </pitch>
687 |         <duration>10080</duration>
688 |         <voice>2</voice>
689 |         <type>quarter</type>
690 |         <staff>2</staff>
691 |       </note>
692 |       <note>
693 |         <pitch>
694 |           <step>D</step>
695 |           <octave>2</octave>
696 |         </pitch>
697 |         <duration>10080</duration>
698 |         <voice>2</voice>
699 |         <type>quarter</type>
700 |         <stem>up</stem>
701 |         <staff>2</staff>
702 |       </note>
703 |       <note>
704 |         <pitch>
705 |           <step>C</step>
706 |           <octave>4</octave>
707 |         </pitch>
708 |         <duration>10080</duration>
709 |         <voice>2</voice>
710 |         <type>quarter</type>
711 |         <stem>down</stem>
712 |         <staff>2</staff>
713 |       </note>
714 |       <note>
715 |         <chord />
716 |         <pitch>
717 |           <step>F</step>
718 |           <alter>1</alter>
719 |           <octave>3</octave>
720 |         </pitch>
721 |         <duration>10080</duration>
722 |         <voice>2</voice>
723 |         <type>quarter</type>
724 |         <accidental>sharp</accidental>
725 |         <staff>2</staff>
726 |       </note>
727 |       <note>
728 |         <chord />
729 |         <pitch>
730 |           <step>D</step>
731 |           <octave>3</octave>
732 |         </pitch>
733 |         <duration>10080</duration>
734 |         <voice>2</voice>
735 |         <type>quarter</type>
736 |         <staff>2</staff>
737 |       </note>
738 |     </measure>
739 |     <!--========================= Measure 0 ==========================-->
740 |     <measure number="0">
741 |       <note>
742 |         <pitch>
743 |           <step>B</step>
744 |           <octave>4</octave>
745 |         </pitch>
746 |         <duration>5040</duration>
747 |         <voice>1</voice>
748 |         <type>eighth</type>
749 |         <stem>down</stem>
750 |         <staff>1</staff>
751 |         <beam number="1">begin</beam>
752 |       </note>
753 |       <note>
754 |         <pitch>
755 |           <step>A</step>
756 |           <octave>5</octave>
757 |         </pitch>
758 |         <duration>5040</duration>
759 |         <voice>1</voice>
760 |         <type>eighth</type>
761 |         <stem>down</stem>
762 |         <staff>1</staff>
763 |         <beam number="1">continue</beam>
764 |       </note>
765 |       <note>
766 |         <pitch>
767 |           <step>G</step>
768 |           <octave>5</octave>
769 |         </pitch>
770 |         <duration>5040</duration>
771 |         <voice>1</voice>
772 |         <type>eighth</type>
773 |         <stem>down</stem>
774 |         <staff>1</staff>
775 |         <beam number="1">continue</beam>
776 |       </note>
777 |       <note>
778 |         <pitch>
779 |           <step>F</step>
780 |           <alter>0</alter>
781 |           <octave>5</octave>
782 |         </pitch>
783 |         <duration>5040</duration>
784 |         <voice>1</voice>
785 |         <type>eighth</type>
786 |         <accidental>natural</accidental>
787 |         <stem>down</stem>
788 |         <staff>1</staff>
789 |         <beam number="1">end</beam>
790 |       </note>
791 |       <note>
792 |         <pitch>
793 |           <step>D</step>
794 |           <octave>5</octave>
795 |         </pitch>
796 |         <duration>5040</duration>
797 |         <voice>1</voice>
798 |         <type>eighth</type>
799 |         <stem>up</stem>
800 |         <staff>1</staff>
801 |         <beam number="1">begin</beam>
802 |       </note>
803 |       <note>
804 |         <pitch>
805 |           <step>B</step>
806 |           <octave>4</octave>
807 |         </pitch>
808 |         <duration>5040</duration>
809 |         <voice>1</voice>
810 |         <type>eighth</type>
811 |         <stem>up</stem>
812 |         <staff>1</staff>
813 |         <beam number="1">continue</beam>
814 |       </note>
815 |       <note>
816 |         <pitch>
817 |           <step>A</step>
818 |           <octave>4</octave>
819 |         </pitch>
820 |         <duration>5040</duration>
821 |         <voice>1</voice>
822 |         <type>eighth</type>
823 |         <stem>up</stem>
824 |         <staff>1</staff>
825 |         <beam number="1">continue</beam>
826 |       </note>
827 |       <note>
828 |         <pitch>
829 |           <step>G</step>
830 |           <octave>4</octave>
831 |         </pitch>
832 |         <duration>5040</duration>
833 |         <voice>1</voice>
834 |         <type>eighth</type>
835 |         <stem>up</stem>
836 |         <staff>1</staff>
837 |         <beam number="1">end</beam>
838 |       </note>
839 |       <backup>
840 |         <duration>40320</duration>
841 |       </backup>
842 |       <note>
843 |         <pitch>
844 |           <step>G</step>
845 |           <octave>2</octave>
846 |         </pitch>
847 |         <duration>10080</duration>
848 |         <voice>2</voice>
849 |         <type>quarter</type>
850 |         <stem>up</stem>
851 |         <staff>2</staff>
852 |       </note>
853 |       <note>
854 |         <pitch>
855 |           <step>F</step>
856 |           <alter>0</alter>
857 |           <octave>4</octave>
858 |         </pitch>
859 |         <duration>10080</duration>
860 |         <voice>2</voice>
861 |         <type>quarter</type>
862 |         <accidental>natural</accidental>
863 |         <stem>down</stem>
864 |         <staff>2</staff>
865 |       </note>
866 |       <note>
867 |         <chord />
868 |         <pitch>
869 |           <step>B</step>
870 |           <octave>3</octave>
871 |         </pitch>
872 |         <duration>10080</duration>
873 |         <voice>2</voice>
874 |         <type>quarter</type>
875 |         <staff>2</staff>
876 |       </note>
877 |       <note>
878 |         <chord />
879 |         <pitch>
880 |           <step>G</step>
881 |           <octave>3</octave>
882 |         </pitch>
883 |         <duration>10080</duration>
884 |         <voice>2</voice>
885 |         <type>quarter</type>
886 |         <staff>2</staff>
887 |       </note>
888 |       <note>
889 |         <pitch>
890 |           <step>F</step>
891 |           <octave>4</octave>
892 |         </pitch>
893 |         <duration>10080</duration>
894 |         <voice>2</voice>
895 |         <type>quarter</type>
896 |         <stem>down</stem>
897 |         <staff>2</staff>
898 |       </note>
899 |       <note>
900 |         <chord />
901 |         <pitch>
902 |           <step>B</step>
903 |           <octave>3</octave>
904 |         </pitch>
905 |         <duration>10080</duration>
906 |         <voice>2</voice>
907 |         <type>quarter</type>
908 |         <staff>2</staff>
909 |       </note>
910 |       <note>
911 |         <chord />
912 |         <pitch>
913 |           <step>G</step>
914 |           <octave>3</octave>
915 |         </pitch>
916 |         <duration>10080</duration>
917 |         <voice>2</voice>
918 |         <type>quarter</type>
919 |         <staff>2</staff>
920 |       </note>
921 |       <note>
922 |         <rest />
923 |         <duration>10080</duration>
924 |         <voice>2</voice>
925 |         <type>quarter</type>
926 |         <staff>2</staff>
927 |       </note>
928 |       <barline location="right">
929 |         <bar-style>regular</bar-style>
930 |       </barline>
931 |     </measure>
932 |   </part>
933 | </score-partwise>


--------------------------------------------------------------------------------
/tokenization_tools/detokenizer/sample/input_tokens.txt:
--------------------------------------------------------------------------------
1 | R bar clef_treble time_4/4 <voice> note_D5 len_1/2 stem_up beam_start note_E5 len_1/2 stem_up beam_continue note_D5 len_1/2 stem_up beam_continue note_E5 len_1/2 stem_up beam_stop note_D5 len_1 stem_up note_E5 len_1 stem_up </voice> <voice> note_B4 note_F4 len_1 stem_down note_B4 note_F4 len_1 stem_down note_F4 note_B4 len_1 stem_down note_F4 note_B4 len_1 stem_down </voice> bar note_C5 note_E4 len_1 stem_up note_F5 note_C5 note_F4 len_1 stem_down note_E5 note_C5 note_E4 len_1 stem_up note_E5 len_1/2 stem_down beam_start note_Eb5 len_1/2 stem_down beam_stop bar <voice> note_D5 len_1/2 stem_up beam_start note_E5 len_1/2 stem_up beam_continue note_D5 len_1/2 stem_up beam_continue note_E5 len_1/2 stem_up beam_stop note_D5 len_1 stem_up note_A5 len_1 stem_up </voice> <voice> note_F#4 note_C5 len_1 stem_down note_F#4 note_C5 len_1 stem_down note_F#4 note_C5 len_1 stem_down note_D5 note_C5 len_1 stem_down </voice> bar note_B4 len_1/2 stem_down beam_start note_A5 len_1/2 stem_down beam_continue note_G5 len_1/2 stem_down beam_continue note_F5 len_1/2 stem_down beam_stop note_D5 len_1/2 stem_up beam_start note_B4 len_1/2 stem_up beam_continue note_A4 len_1/2 stem_up beam_continue note_G4 len_1/2 stem_up beam_stop L bar clef_bass time_4/4 note_G2 len_1 stem_up note_B3 note_G3 len_1 stem_down note_G2 len_1 stem_up note_B3 note_G3 len_1 stem_down bar note_G3 note_C3 len_1 stem_down note_A3 note_C3 len_1 stem_down note_G3 note_C3 len_1 stem_down rest len_1 bar note_A2 len_1 stem_up note_C4 note_F#3 note_D3 len_1 stem_down note_D2 len_1 stem_up note_C4 note_F#3 note_D3 len_1 stem_down bar note_G2 len_1 stem_up note_F4 note_B3 note_G3 len_1 stem_down note_F4 note_B3 note_G3 len_1 stem_down rest len_1


--------------------------------------------------------------------------------
/tokenization_tools/detokenizer/sample/sample_usage.ipynb:
--------------------------------------------------------------------------------
 1 | {
 2 |  "cells": [
 3 |   {
 4 |    "cell_type": "code",
 5 |    "execution_count": 1,
 6 |    "metadata": {},
 7 |    "outputs": [],
 8 |    "source": [
 9 |     "# import \"tokens_to_score.py\" (assuming the file is in the same directory)\n",
10 |     "from tokens_to_score import *"
11 |    ]
12 |   },
13 |   {
14 |    "cell_type": "code",
15 |    "execution_count": 2,
16 |    "metadata": {},
17 |    "outputs": [
18 |     {
19 |      "data": {
20 |       "text/plain": [
21 |        "'R bar clef_treble time_4/4 <voice> note_D5 len_1/2 stem_up beam_start note_E5 len_1/2 stem_up beam_continue note_D5 len_1/2 stem_up beam_continue note_E5 len_1/2 stem_up beam_stop note_D5 len_1 stem_up note_E5 len_1 stem_up </voice> <voice> note_B4 note_F4 len_1 stem_down note_B4 note_F4 len_1 stem_down note_F4 note_B4 len_1 stem_down note_F4 note_B4 len_1 stem_down </voice> bar note_C5 note_E4 len_1 stem_up note_F5 note_C5 note_F4 len_1 stem_down note_E5 note_C5 note_E4 len_1 stem_up note_E5 len_1/2 stem_down beam_start note_Eb5 len_1/2 stem_down beam_stop bar <voice> note_D5 len_1/2 stem_up beam_start note_E5 len_1/2 stem_up beam_continue note_D5 len_1/2 stem_up beam_continue note_E5 len_1/2 stem_up beam_stop note_D5 len_1 stem_up note_A5 len_1 stem_up </voice> <voice> note_F#4 note_C5 len_1 stem_down note_F#4 note_C5 len_1 stem_down note_F#4 note_C5 len_1 stem_down note_D5 note_C5 len_1 stem_down </voice> bar note_B4 len_1/2 stem_down beam_start note_A5 len_1/2 stem_down beam_continue note_G5 len_1/2 stem_down beam_continue note_F5 len_1/2 stem_down beam_stop note_D5 len_1/2 stem_up beam_start note_B4 len_1/2 stem_up beam_continue note_A4 len_1/2 stem_up beam_continue note_G4 len_1/2 stem_up beam_stop L bar clef_bass time_4/4 note_G2 len_1 stem_up note_B3 note_G3 len_1 stem_down note_G2 len_1 stem_up note_B3 note_G3 len_1 stem_down bar note_G3 note_C3 len_1 stem_down note_A3 note_C3 len_1 stem_down note_G3 note_C3 len_1 stem_down rest len_1 bar note_A2 len_1 stem_up note_C4 note_F#3 note_D3 len_1 stem_down note_D2 len_1 stem_up note_C4 note_F#3 note_D3 len_1 stem_down bar note_G2 len_1 stem_up note_F4 note_B3 note_G3 len_1 stem_down note_F4 note_B3 note_G3 len_1 stem_down rest len_1'"
22 |       ]
23 |      },
24 |      "execution_count": 2,
25 |      "metadata": {},
26 |      "output_type": "execute_result"
27 |     }
28 |    ],
29 |    "source": [
30 |     "# load tokens\n",
31 |     "token_sequence = open('input_tokens.txt').read()\n",
32 |     "token_sequence"
33 |    ]
34 |   },
35 |   {
36 |    "cell_type": "code",
37 |    "execution_count": 3,
38 |    "metadata": {},
39 |    "outputs": [],
40 |    "source": [
41 |     "# convert them to music21 Score object\n",
42 |     "s = tokens_to_score(token_sequence)"
43 |    ]
44 |   },
45 |   {
46 |    "cell_type": "code",
47 |    "execution_count": 4,
48 |    "metadata": {},
49 |    "outputs": [
50 |     {
51 |      "data": {
52 |       "text/plain": [
53 |        "'generated_score.xml'"
54 |       ]
55 |      },
56 |      "execution_count": 4,
57 |      "metadata": {},
58 |      "output_type": "execute_result"
59 |     }
60 |    ],
61 |    "source": [
62 |     "# write into a MusicXML file\n",
63 |     "s.write('musicxml', 'generated_score')"
64 |    ]
65 |   }
66 |  ],
67 |  "metadata": {
68 |   "kernelspec": {
69 |    "display_name": "Python 3",
70 |    "language": "python",
71 |    "name": "python3"
72 |   },
73 |   "language_info": {
74 |    "codemirror_mode": {
75 |     "name": "ipython",
76 |     "version": 3
77 |    },
78 |    "file_extension": ".py",
79 |    "mimetype": "text/x-python",
80 |    "name": "python",
81 |    "nbconvert_exporter": "python",
82 |    "pygments_lexer": "ipython3",
83 |    "version": "3.8.8"
84 |   }
85 |  },
86 |  "nbformat": 4,
87 |  "nbformat_minor": 4
88 | }
89 | 


--------------------------------------------------------------------------------
/tokenization_tools/detokenizer/tokens_to_score.py:
--------------------------------------------------------------------------------
  1 | from music21 import *
  2 | 
  3 | # dictionary to change note names
  4 | sharp_to_flat = {'C#': 'D-', 'D#': 'E-', 'F#': 'G-', 'G#': 'A-', 'A#': 'B-'}
  5 | flat_to_sharp = {v:k for k, v in sharp_to_flat.items()}
  6 | 
  7 | # translate note numbers into note names considering key signature
  8 | def pitch_to_name(pitch_, key=key.KeySignature(0)):
  9 |     if pitch_.isdecimal():
 10 |         name = str(pitch.Pitch(int(pitch_)))
 11 |         if key.sharps < 0:
 12 |             for k, v in sharp_to_flat.items():
 13 |                 name = name.replace(k, v)
 14 |         elif key.sharps > 0:
 15 |             for k, v in flat_to_sharp.items():
 16 |                 name = name.replace(k, v)
 17 |         return name
 18 |     else:
 19 |         return pitch_.replace('b', '-')
 20 | 
 21 | # aggregate note(rest)-related tokens
 22 | def aggr_note_token(tokens):
 23 |     notes, others, out = [], [], []
 24 |     note_flag, len_flag = False, False
 25 | 
 26 |     for t in tokens:
 27 |         parts = t.split('_')
 28 |         if parts[0] in ('note', 'rest'):
 29 |             if note_flag and len_flag and len(notes):
 30 |                 out.append(' '.join(notes))
 31 |                 notes = []
 32 |             note_flag = True
 33 |             len_flag = False
 34 |             notes.append(t)
 35 |         elif parts[0] == 'len':
 36 |             len_flag = True
 37 |             notes.append(t)
 38 |         elif parts[0] in ('stem', 'beam', 'tie'):
 39 |             notes.append(t)
 40 |         else: # other than note-related
 41 |             if len(notes):
 42 |                 out.append(' '.join(notes))
 43 |                 notes = []
 44 |             out.append(t)
 45 | 
 46 |     # buffer flush
 47 |     if len(notes):
 48 |         out.append(' '.join(notes))
 49 | 
 50 |     return out
 51 | 
 52 | # translate clef or signature token into music21 object
 53 | def single_token_to_obj(token):
 54 |     parts = token.split('_')
 55 |     if parts[0] == 'clef':
 56 |         if parts[1] == 'treble':
 57 |             return clef.TrebleClef()
 58 |         elif parts[1] == 'bass':
 59 |             return clef.BassClef()
 60 |     elif parts[0] == 'key':
 61 |         if parts[1] == 'sharp':
 62 |             return key.KeySignature(int(parts[2]))
 63 |         elif parts[1] == 'flat':
 64 |             return key.KeySignature(-1 * int(parts[2]))
 65 |         elif parts[1] == 'natural':
 66 |             return key.KeySignature(0)
 67 |     elif parts[0] == 'time':
 68 |         if '/' in parts[1]:
 69 |             return meter.TimeSignature(parts[1])
 70 |         else:
 71 |             return meter.TimeSignature(parts[1]+'/4' if int(parts[1]) < 6 else parts[1]+'/8')
 72 | 
 73 | # translate note(rest)-related tokens into music21 object
 74 | def note_token_to_obj(tokens, key):
 75 |     if tokens[0] == 'rest': # for rests
 76 |         length = str_to_float(tokens[1])
 77 |         return note.Rest(quarterLength=length)
 78 | 
 79 |     # for notes
 80 |     note_names = [pitch_to_name(t.split('_')[1], key) for t in tokens if t.split('_')[0] == 'note']
 81 |     lengths = [str_to_float(t) for t in tokens if t.split('_')[0] == 'len']
 82 |     direction = [t.split('_')[1] for t in tokens if t.split('_')[0] in ('stem', 'dir')] + [t.split('_')[2] for t in tokens if t.split('_')[0] == 'len' and len(t.split('_')) >= 3]
 83 |     beams = [t.split('_')[1:] for t in tokens if t.split('_')[0] == 'beam'] + [t.split('_')[3:] for t in tokens if t.split('_')[0] == 'len' and len(t.split('_')) >= 4]
 84 |     tie_ = [t.split('_')[1] for t in tokens if t.split('_')[0] == 'tie']
 85 | 
 86 |     if len(note_names) > 1: # chord
 87 |         if len(lengths) > 1:
 88 |             chords = []
 89 |             for i, l in enumerate(lengths):
 90 |                 chord_ = chord.Chord(note_names, quarterLength=l)
 91 |                 if len(direction):
 92 |                     chord_.stemDirection = direction[0]
 93 | 
 94 |                 if len(beams):
 95 |                     append_beams(chord_, beams)
 96 | 
 97 |                 if len(tie_):
 98 |                     chord_.tie = tie.Tie('continue')
 99 |                 elif i == 0:
100 |                     chord_.tie = tie.Tie('start')
101 |                 elif i == len(lengths) - 1:
102 |                     chord_.tie = tie.Tie('stop')
103 |                 else:
104 |                     chord_.tie = tie.Tie('continue')
105 | 
106 |                 chords.append(chord_)
107 | 
108 |             return chords
109 |         else:
110 |             chord_ = chord.Chord(note_names, quarterLength=lengths[0])
111 |             if len(direction):
112 |                 chord_.stemDirection = direction[0]
113 |             if len(beams):
114 |                 append_beams(chord_, beams)
115 |             if len(tie_):
116 |                 chord_.tie = tie.Tie(tie_[0])
117 |             return chord_
118 |     else: # note
119 |         if len(lengths) > 1:
120 |             notes = []
121 |             for i, l in enumerate(lengths):
122 |                 note_ = note.Note(note_names[0], quarterLength=l)
123 |                 if len(direction):
124 |                     note_.stemDirection = direction[0]
125 | 
126 |                 if len(beams):
127 |                     append_beams(note_, beams)
128 | 
129 |                 if len(tie_):
130 |                     note_.tie = tie.Tie('continue')
131 |                 elif i == 0:
132 |                     note_.tie = tie.Tie('start')
133 |                 elif i == len(lengths) - 1:
134 |                     note_.tie = tie.Tie('stop')
135 |                 else:
136 |                     note_.tie = tie.Tie('continue')
137 | 
138 |                 notes.append(note_)
139 | 
140 |             return notes
141 |         else:
142 |             note_ = note.Note(note_names[0], quarterLength=lengths[0])
143 |             if len(direction):
144 |                 note_.stemDirection = direction[0]
145 |             if len(beams):
146 |                 append_beams(note_, beams)
147 |             if len(tie_):
148 |                 note_.tie = tie.Tie(tie_[0])
149 |             return note_
150 | 
151 | # [aux func] translate note length into float number
152 | def str_to_float(t):
153 |     length = t.split('_')[1] if 'len' in t else t
154 |     if '/' in length:
155 |         numerator, denominator = length.split('/')
156 |         return int(numerator) / int(denominator)
157 |     else:
158 |         return float(length)
159 | 
160 | # [aux func] append beams property to music21 Note or Chord object
161 | def append_beams(obj, beams):
162 |     for b in beams[0]:
163 |         if '-' in b:
164 |             former, latter = b.split('-')
165 |             obj.beams.append(former, latter)
166 |         else:
167 |             obj.beams.append(b)
168 | 
169 | def tokens_to_PartStaff(tokens, key_=0, start_voice=1):
170 |     tokens = concatenated_to_regular(tokens)
171 | 
172 |     p = stream.PartStaff()
173 |     k = key.KeySignature(key_)
174 | 
175 |     voice_id = start_voice
176 |     voice_flag = False
177 |     after_voice = False
178 |     voice_start = None
179 | 
180 |     ottava_flag = False
181 |     ottava_elements = []
182 | 
183 |     tokens = aggr_note_token(tokens)
184 | 
185 |     for i, t in enumerate(tokens):
186 |         if t == 'bar':
187 |             if i != 0:
188 |                 p.append(m)
189 |             m = stream.Measure()
190 |             voice_id = start_voice
191 |             voice_start = None
192 |             voice_flag = False
193 |             after_voice = False
194 |         elif t == '<voice>':
195 |             v = stream.Voice(id=voice_id)
196 |             voice_flag = True
197 |             if voice_start is None:
198 |                 voice_start = m.duration.quarterLength # record the start point of voice
199 |         elif t == '</voice>':
200 |             if voice_flag:
201 |                 v.makeAccidentals(useKeySignature=k)
202 |                 for element in v:
203 |                     element.offset += voice_start
204 |                 m.append(v)
205 |                 voice_id += 1
206 |                 voice_flag = False
207 |                 after_voice = True
208 |         elif t.split('_')[0] in ('clef', 'key', 'time'):
209 |             if t[:11] == 'key_natural' and i+1 < len(tokens) and tokens[i+1].split('_')[0] == 'key':
210 |                 continue # workaround for MuseScore (which ignores consecutive key signtures): if key signatures appear in succession, skip the one with natural
211 |             o = single_token_to_obj(t)
212 |             if voice_flag:
213 |                 v.append(o)
214 |             else:
215 |                 m.append(o)
216 |             if t.split('_')[0] == 'key': # generate another key signature object to use makeAccidentals and to translate note number to name
217 |                 k = o
218 |         elif t[:4] in ('note', 'rest'):
219 |             n = note_token_to_obj(t.split(), k)
220 |             if ottava_flag:
221 |                 ottava_elements.append(n)
222 | 
223 |             if voice_flag:
224 |                 v.append(n)
225 |             else:
226 |                 m.append(n)
227 | 
228 |             if after_voice:
229 |                 n.offset -= v.quarterLength * (voice_id - 1)
230 |     # last measure
231 |     p.append(m)
232 |     p.makeAccidentals()
233 | 
234 |     return p
235 | 
236 | def concatenated_to_regular(tokens):
237 |     regular_tokens = []
238 |     for t in tokens:
239 |         if t.startswith('len') or t.startswith('attr'):
240 |             attrs = t.split('_')
241 |             if len(attrs) == 2:
242 |                 regular_tokens.append(f'len_{attrs[1]}')
243 |             elif len(attrs) == 3:
244 |                 regular_tokens += [f'len_{attrs[1]}', f'stem_{attrs[2]}']
245 |             else:
246 |                 regular_tokens += [f'len_{attrs[1]}', f'stem_{attrs[2]}', f'beam_{"_".join(attrs[3:])}']
247 |         else:
248 |             regular_tokens.append(t)
249 |     return regular_tokens
250 | 
251 | # build music21 Score object from a token sequnece (string)
252 | def tokens_to_score(string, voice_numbering=False):
253 |     R_str, L_str = split_R_L(string)
254 |     R_tokens = R_str.split()
255 |     L_tokens = L_str.split()
256 |     if voice_numbering:
257 |         r = tokens_to_PartStaff(R_tokens)
258 |         r_voices = max([len(m.voices) if m.hasVoices() else 1 for m in r])
259 |         l = tokens_to_PartStaff(L_tokens, start_voice=r_voices+1)
260 |     else:
261 |         r = tokens_to_PartStaff(R_tokens, start_voice=0)
262 |         l = tokens_to_PartStaff(L_tokens, start_voice=0)
263 | 
264 |     # add last barline
265 |     r.elements[-1].rightBarline = bar.Barline('regular')
266 |     l.elements[-1].rightBarline = bar.Barline('regular')
267 | 
268 |     s = stream.Score()
269 |     g = layout.StaffGroup([r, l], symbol='brace', barTogether=True)
270 |     s.append([g, r, l])
271 |     return s
272 | 
273 | def split_R_L(string):
274 |     tokens = string.split()
275 |     tokens = concatenated_to_regular(tokens)
276 |     
277 |     if 'L' in tokens:
278 |         R = ' '.join(tokens[tokens.index('R')+1:tokens.index('L')])
279 |         L = ' '.join(tokens[tokens.index('L')+1:])
280 |     else:
281 |         R = ' '.join(tokens[tokens.index('R')+1:])
282 |         L = ''
283 |     return R, L


--------------------------------------------------------------------------------
/tokenization_tools/requirements.txt:
--------------------------------------------------------------------------------
1 | beautifulsoup4
2 | lxml
3 | music21
4 | pretty_midi
5 | 


--------------------------------------------------------------------------------
/tokenization_tools/tokenizer/README.md:
--------------------------------------------------------------------------------
 1 | ## Overview
 2 | 
 3 | Tokenizer creates token sequences from musical scores, utilizing [Beautiful Soup](https://www.crummy.com/software/BeautifulSoup/).
 4 | 
 5 | ## Usage
 6 | 
 7 | #### 1. import
 8 | 
 9 | ```python
10 | from score_to_tokens import MusicXML_to_tokens
11 | ```
12 | 
13 | #### 2. pass a score path to "MusicXML_to_tokens" function
14 | 
15 | ```Python
16 | tokens = MusicXML_to_tokens('input_score.musicxml')
17 | ```
18 | 
19 | - The list of tokens will be returned.
20 | 
21 | ## Specifications
22 | 
23 | ### Supported scores / formats
24 | 
25 | - Piano scores (for both hands)
26 | - MusicXML format
27 | 
28 | ### Supported score elements
29 | 
30 | - Barline
31 | - Clef (treble / bass)
32 | - Key Signature
33 | - Time Signature
34 | - Note
35 |   - note name (+ accidental) / length / stem direction / beam / tie  
36 | - Rest
37 |   - length
38 | 
39 | ### Requirements
40 | 
41 | Python 3.6+
42 | 
43 | - beautifulsoup4 (4.6.3)
44 | - lxml (4.9.1)
45 | - pretty_midi (0.2.9)
46 | 
47 | Note: The library versions here are not specified ones, but **tested** ones.
48 | 


--------------------------------------------------------------------------------
/tokenization_tools/tokenizer/sample/generated_tokens.txt:
--------------------------------------------------------------------------------
1 | R bar clef_treble key_sharp_3 time_3/4 note_E5 len_2 stem_down tie_stop note_C#5 len_1/2 stem_down beam_start note_E5 len_1/2 stem_down beam_stop bar <voice> note_E5 len_2 stem_up note_C#5 len_1/2 stem_up beam_start note_B4 len_1/4 stem_up beam_continue_start note_A4 len_1/4 stem_up beam_stop_stop </voice> <voice> note_A4 len_2 stem_down note_E4 len_1 stem_down </voice> bar <voice> note_C#5 len_2 stem_up note_B4 len_1 stem_up </voice> <voice> note_G#4 len_3 stem_down </voice> bar <voice> note_A4 len_2 stem_up note_F#4 len_1 stem_up </voice> <voice> note_D4 len_3 stem_down </voice> L bar clef_bass key_sharp_3 time_3/4 note_G#3 note_E3 note_A2 len_2 stem_down note_G#3 note_E3 note_A2 len_1 stem_down bar note_F#2 len_1/2 stem_up beam_start note_C#3 len_1/2 stem_up beam_stop note_F#3 len_1/2 stem_down beam_start note_G#3 len_1/2 stem_down beam_stop note_A3 len_1/2 stem_down beam_start note_B3 len_1/2 stem_down beam_stop bar note_E2 len_1/2 stem_up beam_start note_C#3 len_1/2 stem_up beam_stop note_E3 len_1/2 stem_down beam_start note_F#3 len_1/2 stem_down beam_stop note_C#4 len_1/2 stem_down beam_start note_G#3 len_1/2 stem_down beam_stop bar note_D2 len_1/2 stem_up beam_start note_A2 len_1/2 stem_up beam_stop note_D3 len_1/2 stem_down beam_start note_F#3 len_1/2 stem_down beam_stop note_G#3 len_1/2 stem_down beam_start note_A3 len_1/2 stem_down beam_stop


--------------------------------------------------------------------------------
/tokenization_tools/tokenizer/sample/input_score.musicxml:
--------------------------------------------------------------------------------
  1 | <?xml version="1.0" encoding="utf-8"?>
  2 | <!DOCTYPE score-partwise PUBLIC "-//Recordare//DTD MusicXML 3.1 Partwise//EN" "http://www.musicxml.org/dtds/partwise.dtd">
  3 | <score-partwise version="3.1">
  4 | <work>
  5 | <work-title>Už z hor zní zvon</work-title>
  6 | </work>
  7 | <identification>
  8 | <encoding>
  9 | <software>MuseScore 3.1.0</software>
 10 | <encoding-date>2021-05-13</encoding-date>
 11 | <supports element="accidental" type="yes"/>
 12 | <supports element="beam" type="yes"/>
 13 | <supports attribute="new-page" element="print" type="yes" value="yes"/>
 14 | <supports attribute="new-system" element="print" type="yes" value="yes"/>
 15 | <supports element="stem" type="yes"/>
 16 | </encoding>
 17 | </identification>
 18 | <defaults>
 19 | <scaling>
 20 | <millimeters>7.05556</millimeters>
 21 | <tenths>40</tenths>
 22 | </scaling>
 23 | <page-layout>
 24 | <page-height>1683.78</page-height>
 25 | <page-width>1190.55</page-width>
 26 | <page-margins type="even">
 27 | <left-margin>56.6929</left-margin>
 28 | <right-margin>56.6929</right-margin>
 29 | <top-margin>56.6929</top-margin>
 30 | <bottom-margin>113.386</bottom-margin>
 31 | </page-margins>
 32 | <page-margins type="odd">
 33 | <left-margin>56.6929</left-margin>
 34 | <right-margin>56.6929</right-margin>
 35 | <top-margin>56.6929</top-margin>
 36 | <bottom-margin>113.386</bottom-margin>
 37 | </page-margins>
 38 | </page-layout>
 39 | <word-font font-family="FreeSerif" font-size="10"/>
 40 | <lyric-font font-family="FreeSerif" font-size="11"/>
 41 | </defaults>
 42 | <credit page="1">
 43 | <credit-words default-x="1133.86" default-y="1504.44" font-size="12" justify="right" valign="bottom">Traditional
 44 | </credit-words>
 45 | <credit-words font-size="10">Arranged by Markéta Kapustová
 46 | </credit-words>
 47 | </credit>
 48 | <credit page="1">
 49 | <credit-words default-x="595.275" default-y="1627.09" font-family="Monotype Corsiva" font-size="25" justify="center" valign="top">Amazing grace</credit-words>
 50 | </credit>
 51 | <part-list>
 52 | <score-part id="P1">
 53 | <part-name>Piano</part-name>
 54 | <part-abbreviation>Pno.</part-abbreviation>
 55 | <score-instrument id="P1-I1">
 56 | <instrument-name>Piano</instrument-name>
 57 | </score-instrument>
 58 | <midi-device id="P1-I1" port="1"/>
 59 | <midi-instrument id="P1-I1">
 60 | <midi-channel>1</midi-channel>
 61 | <midi-program>1</midi-program>
 62 | <volume>78.7402</volume>
 63 | <pan>0</pan>
 64 | </midi-instrument>
 65 | </score-part>
 66 | </part-list>
 67 | <part id="P1">
 68 | <measure number="13" width="176.70"><attributes><divisions>4</divisions></attributes><attributes><clef number="2">
 69 | <sign>F</sign>
 70 | <line>4</line>
 71 | </clef></attributes><attributes><clef number="1">
 72 | <sign>G</sign>
 73 | <line>2</line>
 74 | </clef></attributes><attributes><key>
 75 | <fifths>3</fifths>
 76 | </key></attributes><attributes><time>
 77 | <beats>3</beats>
 78 | <beat-type>4</beat-type>
 79 | </time></attributes>
 80 | <note default-x="10.36" default-y="-5.00">
 81 | <pitch>
 82 | <step>E</step>
 83 | <octave>5</octave>
 84 | </pitch>
 85 | <duration>8</duration>
 86 | <tie type="stop"/>
 87 | <voice>1</voice>
 88 | <type>half</type>
 89 | <stem>down</stem>
 90 | <staff>1</staff>
 91 | <notations>
 92 | <tied type="stop"/>
 93 | </notations>
 94 | </note>
 95 | <note default-x="96.82" default-y="-15.00">
 96 | <pitch>
 97 | <step>C</step>
 98 | <alter>1</alter>
 99 | <octave>5</octave>
100 | </pitch>
101 | <duration>2</duration>
102 | <voice>1</voice>
103 | <type>eighth</type>
104 | <stem>down</stem>
105 | <staff>1</staff>
106 | <beam number="1">begin</beam>
107 | </note>
108 | <note default-x="135.96" default-y="-5.00">
109 | <pitch>
110 | <step>E</step>
111 | <octave>5</octave>
112 | </pitch>
113 | <duration>2</duration>
114 | <voice>1</voice>
115 | <type>eighth</type>
116 | <stem>down</stem>
117 | <staff>1</staff>
118 | <beam number="1">end</beam>
119 | </note>
120 | <backup>
121 | <duration>12</duration>
122 | </backup>
123 | <direction placement="below">
124 | <direction-type>
125 | <pedal default-y="-82.90" line="yes" type="start"/>
126 | </direction-type>
127 | <staff>2</staff>
128 | </direction>
129 | <note default-x="10.36" default-y="-140.00">
130 | <pitch>
131 | <step>A</step>
132 | <octave>2</octave>
133 | </pitch>
134 | <duration>8</duration>
135 | <voice>5</voice>
136 | <type>half</type>
137 | <stem>down</stem>
138 | <staff>2</staff>
139 | </note>
140 | <note default-x="10.36" default-y="-120.00">
141 | <chord/>
142 | <pitch>
143 | <step>E</step>
144 | <octave>3</octave>
145 | </pitch>
146 | <duration>8</duration>
147 | <voice>5</voice>
148 | <type>half</type>
149 | <stem>down</stem>
150 | <staff>2</staff>
151 | </note>
152 | <note default-x="10.36" default-y="-110.00">
153 | <chord/>
154 | <pitch>
155 | <step>G</step>
156 | <alter>1</alter>
157 | <octave>3</octave>
158 | </pitch>
159 | <duration>8</duration>
160 | <voice>5</voice>
161 | <type>half</type>
162 | <stem>down</stem>
163 | <staff>2</staff>
164 | </note>
165 | <note default-x="96.82" default-y="-140.00">
166 | <pitch>
167 | <step>A</step>
168 | <octave>2</octave>
169 | </pitch>
170 | <duration>4</duration>
171 | <voice>5</voice>
172 | <type>quarter</type>
173 | <stem>down</stem>
174 | <staff>2</staff>
175 | </note>
176 | <note default-x="96.82" default-y="-120.00">
177 | <chord/>
178 | <pitch>
179 | <step>E</step>
180 | <octave>3</octave>
181 | </pitch>
182 | <duration>4</duration>
183 | <voice>5</voice>
184 | <type>quarter</type>
185 | <stem>down</stem>
186 | <staff>2</staff>
187 | </note>
188 | <note default-x="96.82" default-y="-110.00">
189 | <chord/>
190 | <pitch>
191 | <step>G</step>
192 | <alter>1</alter>
193 | <octave>3</octave>
194 | </pitch>
195 | <duration>4</duration>
196 | <voice>5</voice>
197 | <type>quarter</type>
198 | <stem>down</stem>
199 | <staff>2</staff>
200 | </note>
201 | <direction placement="below">
202 | <direction-type>
203 | <pedal line="yes" type="stop"/>
204 | </direction-type>
205 | <staff>2</staff>
206 | </direction>
207 | </measure>
208 | <measure number="14" width="248.35">
209 | <note default-x="10.36" default-y="-5.00">
210 | <pitch>
211 | <step>E</step>
212 | <octave>5</octave>
213 | </pitch>
214 | <duration>8</duration>
215 | <voice>1</voice>
216 | <type>half</type>
217 | <stem>up</stem>
218 | <staff>1</staff>
219 | </note>
220 | <note default-x="161.78" default-y="-15.00">
221 | <pitch>
222 | <step>C</step>
223 | <alter>1</alter>
224 | <octave>5</octave>
225 | </pitch>
226 | <duration>2</duration>
227 | <voice>1</voice>
228 | <type>eighth</type>
229 | <stem>up</stem>
230 | <staff>1</staff>
231 | <beam number="1">begin</beam>
232 | </note>
233 | <note default-x="199.54" default-y="-20.00">
234 | <pitch>
235 | <step>B</step>
236 | <octave>4</octave>
237 | </pitch>
238 | <duration>1</duration>
239 | <voice>1</voice>
240 | <type>16th</type>
241 | <stem>up</stem>
242 | <staff>1</staff>
243 | <beam number="1">continue</beam>
244 | <beam number="2">begin</beam>
245 | </note>
246 | <note default-x="223.14" default-y="-25.00">
247 | <pitch>
248 | <step>A</step>
249 | <octave>4</octave>
250 | </pitch>
251 | <duration>1</duration>
252 | <voice>1</voice>
253 | <type>16th</type>
254 | <stem>up</stem>
255 | <staff>1</staff>
256 | <beam number="1">end</beam>
257 | <beam number="2">end</beam>
258 | </note>
259 | <backup>
260 | <duration>12</duration>
261 | </backup>
262 | <note default-x="10.36" default-y="-25.00">
263 | <pitch>
264 | <step>A</step>
265 | <octave>4</octave>
266 | </pitch>
267 | <duration>8</duration>
268 | <voice>2</voice>
269 | <type>half</type>
270 | <stem>down</stem>
271 | <staff>1</staff>
272 | </note>
273 | <note default-x="161.78" default-y="-40.00">
274 | <pitch>
275 | <step>E</step>
276 | <octave>4</octave>
277 | </pitch>
278 | <duration>4</duration>
279 | <voice>2</voice>
280 | <type>quarter</type>
281 | <stem>down</stem>
282 | <staff>1</staff>
283 | </note>
284 | <backup>
285 | <duration>12</duration>
286 | </backup>
287 | <direction placement="below">
288 | <direction-type>
289 | <pedal default-y="-82.90" line="yes" type="start"/>
290 | </direction-type>
291 | <staff>2</staff>
292 | </direction>
293 | <note default-x="10.72" default-y="-150.00">
294 | <pitch>
295 | <step>F</step>
296 | <alter>1</alter>
297 | <octave>2</octave>
298 | </pitch>
299 | <duration>2</duration>
300 | <voice>5</voice>
301 | <type>eighth</type>
302 | <stem>up</stem>
303 | <staff>2</staff>
304 | <beam number="1">begin</beam>
305 | </note>
306 | <note default-x="48.49" default-y="-130.00">
307 | <pitch>
308 | <step>C</step>
309 | <alter>1</alter>
310 | <octave>3</octave>
311 | </pitch>
312 | <duration>2</duration>
313 | <voice>5</voice>
314 | <type>eighth</type>
315 | <stem>up</stem>
316 | <staff>2</staff>
317 | <beam number="1">end</beam>
318 | </note>
319 | <note default-x="86.25" default-y="-115.00">
320 | <pitch>
321 | <step>F</step>
322 | <alter>1</alter>
323 | <octave>3</octave>
324 | </pitch>
325 | <duration>2</duration>
326 | <voice>5</voice>
327 | <type>eighth</type>
328 | <stem>down</stem>
329 | <staff>2</staff>
330 | <beam number="1">begin</beam>
331 | </note>
332 | <note default-x="124.01" default-y="-110.00">
333 | <pitch>
334 | <step>G</step>
335 | <alter>1</alter>
336 | <octave>3</octave>
337 | </pitch>
338 | <duration>2</duration>
339 | <voice>5</voice>
340 | <type>eighth</type>
341 | <stem>down</stem>
342 | <staff>2</staff>
343 | <beam number="1">end</beam>
344 | </note>
345 | <note default-x="161.78" default-y="-105.00">
346 | <pitch>
347 | <step>A</step>
348 | <octave>3</octave>
349 | </pitch>
350 | <duration>2</duration>
351 | <voice>5</voice>
352 | <type>eighth</type>
353 | <stem>down</stem>
354 | <staff>2</staff>
355 | <beam number="1">begin</beam>
356 | </note>
357 | <note default-x="199.54" default-y="-100.00">
358 | <pitch>
359 | <step>B</step>
360 | <octave>3</octave>
361 | </pitch>
362 | <duration>2</duration>
363 | <voice>5</voice>
364 | <type>eighth</type>
365 | <stem>down</stem>
366 | <staff>2</staff>
367 | <beam number="1">end</beam>
368 | </note>
369 | <direction placement="below">
370 | <direction-type>
371 | <pedal line="yes" type="stop"/>
372 | </direction-type>
373 | <staff>2</staff>
374 | </direction>
375 | </measure>
376 | <measure number="15" width="279.61">
377 | <print new-system="yes">
378 | <system-layout>
379 | <system-margins>
380 | <left-margin>21.00</left-margin>
381 | <right-margin>0.00</right-margin>
382 | </system-margins>
383 | <system-distance>129.65</system-distance>
384 | </system-layout>
385 | <staff-layout number="2">
386 | <staff-distance>65.00</staff-distance>
387 | </staff-layout>
388 | </print>
389 | <note default-x="105.83" default-y="-15.00">
390 | <pitch>
391 | <step>C</step>
392 | <alter>1</alter>
393 | <octave>5</octave>
394 | </pitch>
395 | <duration>8</duration>
396 | <voice>1</voice>
397 | <type>half</type>
398 | <stem>up</stem>
399 | <staff>1</staff>
400 | </note>
401 | <note default-x="220.74" default-y="-20.00">
402 | <pitch>
403 | <step>B</step>
404 | <octave>4</octave>
405 | </pitch>
406 | <duration>4</duration>
407 | <voice>1</voice>
408 | <type>quarter</type>
409 | <stem>up</stem>
410 | <staff>1</staff>
411 | </note>
412 | <backup>
413 | <duration>12</duration>
414 | </backup>
415 | <note default-x="105.83" default-y="-30.00">
416 | <pitch>
417 | <step>G</step>
418 | <alter>1</alter>
419 | <octave>4</octave>
420 | </pitch>
421 | <duration>12</duration>
422 | <voice>2</voice>
423 | <type>half</type>
424 | <dot/>
425 | <stem>down</stem>
426 | <staff>1</staff>
427 | </note>
428 | <backup>
429 | <duration>12</duration>
430 | </backup>
431 | <direction placement="below">
432 | <direction-type>
433 | <pedal default-y="-80.35" line="yes" type="start"/>
434 | </direction-type>
435 | <staff>2</staff>
436 | </direction>
437 | <note default-x="106.19" default-y="-155.00">
438 | <pitch>
439 | <step>E</step>
440 | <octave>2</octave>
441 | </pitch>
442 | <duration>2</duration>
443 | <voice>5</voice>
444 | <type>eighth</type>
445 | <stem>up</stem>
446 | <staff>2</staff>
447 | <beam number="1">begin</beam>
448 | </note>
449 | <note default-x="134.83" default-y="-130.00">
450 | <pitch>
451 | <step>C</step>
452 | <alter>1</alter>
453 | <octave>3</octave>
454 | </pitch>
455 | <duration>2</duration>
456 | <voice>5</voice>
457 | <type>eighth</type>
458 | <stem>up</stem>
459 | <staff>2</staff>
460 | <beam number="1">end</beam>
461 | </note>
462 | <note default-x="163.46" default-y="-120.00">
463 | <pitch>
464 | <step>E</step>
465 | <octave>3</octave>
466 | </pitch>
467 | <duration>2</duration>
468 | <voice>5</voice>
469 | <type>eighth</type>
470 | <stem>down</stem>
471 | <staff>2</staff>
472 | <beam number="1">begin</beam>
473 | </note>
474 | <note default-x="192.10" default-y="-115.00">
475 | <pitch>
476 | <step>F</step>
477 | <alter>1</alter>
478 | <octave>3</octave>
479 | </pitch>
480 | <duration>2</duration>
481 | <voice>5</voice>
482 | <type>eighth</type>
483 | <stem>down</stem>
484 | <staff>2</staff>
485 | <beam number="1">end</beam>
486 | </note>
487 | <note default-x="220.74" default-y="-95.00">
488 | <pitch>
489 | <step>C</step>
490 | <alter>1</alter>
491 | <octave>4</octave>
492 | </pitch>
493 | <duration>2</duration>
494 | <voice>5</voice>
495 | <type>eighth</type>
496 | <stem>down</stem>
497 | <staff>2</staff>
498 | <beam number="1">begin</beam>
499 | </note>
500 | <note default-x="249.38" default-y="-110.00">
501 | <pitch>
502 | <step>G</step>
503 | <alter>1</alter>
504 | <octave>3</octave>
505 | </pitch>
506 | <duration>2</duration>
507 | <voice>5</voice>
508 | <type>eighth</type>
509 | <stem>down</stem>
510 | <staff>2</staff>
511 | <beam number="1">end</beam>
512 | </note>
513 | <direction placement="below">
514 | <direction-type>
515 | <pedal line="yes" type="stop"/>
516 | </direction-type>
517 | <staff>2</staff>
518 | </direction>
519 | </measure>
520 | <measure number="16" width="187.23">
521 | <note default-x="13.44" default-y="-25.00">
522 | <pitch>
523 | <step>A</step>
524 | <octave>4</octave>
525 | </pitch>
526 | <duration>8</duration>
527 | <voice>1</voice>
528 | <type>half</type>
529 | <stem>up</stem>
530 | <staff>1</staff>
531 | </note>
532 | <note default-x="128.35" default-y="-35.00">
533 | <pitch>
534 | <step>F</step>
535 | <alter>1</alter>
536 | <octave>4</octave>
537 | </pitch>
538 | <duration>4</duration>
539 | <voice>1</voice>
540 | <type>quarter</type>
541 | <stem>up</stem>
542 | <staff>1</staff>
543 | </note>
544 | <backup>
545 | <duration>12</duration>
546 | </backup>
547 | <note default-x="13.44" default-y="-45.00">
548 | <pitch>
549 | <step>D</step>
550 | <octave>4</octave>
551 | </pitch>
552 | <duration>12</duration>
553 | <voice>2</voice>
554 | <type>half</type>
555 | <dot/>
556 | <stem>down</stem>
557 | <staff>1</staff>
558 | </note>
559 | <backup>
560 | <duration>12</duration>
561 | </backup>
562 | <direction placement="below">
563 | <direction-type>
564 | <pedal default-y="-80.35" line="yes" type="start"/>
565 | </direction-type>
566 | <staff>2</staff>
567 | </direction>
568 | <note default-x="13.80" default-y="-160.00">
569 | <pitch>
570 | <step>D</step>
571 | <octave>2</octave>
572 | </pitch>
573 | <duration>2</duration>
574 | <voice>5</voice>
575 | <type>eighth</type>
576 | <stem>up</stem>
577 | <staff>2</staff>
578 | <beam number="1">begin</beam>
579 | </note>
580 | <note default-x="42.44" default-y="-140.00">
581 | <pitch>
582 | <step>A</step>
583 | <octave>2</octave>
584 | </pitch>
585 | <duration>2</duration>
586 | <voice>5</voice>
587 | <type>eighth</type>
588 | <stem>up</stem>
589 | <staff>2</staff>
590 | <beam number="1">end</beam>
591 | </note>
592 | <note default-x="71.08" default-y="-125.00">
593 | <pitch>
594 | <step>D</step>
595 | <octave>3</octave>
596 | </pitch>
597 | <duration>2</duration>
598 | <voice>5</voice>
599 | <type>eighth</type>
600 | <stem>down</stem>
601 | <staff>2</staff>
602 | <beam number="1">begin</beam>
603 | </note>
604 | <note default-x="99.71" default-y="-115.00">
605 | <pitch>
606 | <step>F</step>
607 | <alter>1</alter>
608 | <octave>3</octave>
609 | </pitch>
610 | <duration>2</duration>
611 | <voice>5</voice>
612 | <type>eighth</type>
613 | <stem>down</stem>
614 | <staff>2</staff>
615 | <beam number="1">end</beam>
616 | </note>
617 | <note default-x="128.35" default-y="-110.00">
618 | <pitch>
619 | <step>G</step>
620 | <alter>1</alter>
621 | <octave>3</octave>
622 | </pitch>
623 | <duration>2</duration>
624 | <voice>5</voice>
625 | <type>eighth</type>
626 | <stem>down</stem>
627 | <staff>2</staff>
628 | <beam number="1">begin</beam>
629 | </note>
630 | <note default-x="156.99" default-y="-105.00">
631 | <pitch>
632 | <step>A</step>
633 | <octave>3</octave>
634 | </pitch>
635 | <duration>2</duration>
636 | <voice>5</voice>
637 | <type>eighth</type>
638 | <stem>down</stem>
639 | <staff>2</staff>
640 | <beam number="1">end</beam>
641 | </note>
642 | <direction placement="below">
643 | <direction-type>
644 | <pedal line="yes" type="stop"/>
645 | </direction-type>
646 | <staff>2</staff>
647 | </direction>
648 | </measure>
649 | </part>
650 | </score-partwise>


--------------------------------------------------------------------------------
/tokenization_tools/tokenizer/sample/sample_usage.ipynb:
--------------------------------------------------------------------------------
 1 | {
 2 |  "cells": [
 3 |   {
 4 |    "cell_type": "code",
 5 |    "execution_count": 1,
 6 |    "metadata": {},
 7 |    "outputs": [],
 8 |    "source": [
 9 |     "# import \"score_to_tokens.py\" (assuming the file is in the same directory)\n",
10 |     "from score_to_tokens import *"
11 |    ]
12 |   },
13 |   {
14 |    "cell_type": "code",
15 |    "execution_count": 2,
16 |    "metadata": {},
17 |    "outputs": [
18 |     {
19 |      "data": {
20 |       "text/plain": [
21 |        "'R bar clef_treble key_sharp_3 time_3/4 note_E5 len_2 stem_down tie_stop note_C#5 len_1/2 stem_down beam_start note_E5 len_1/2 stem_down beam_stop bar <voice> note_E5 len_2 stem_up note_C#5 len_1/2 stem_up beam_start note_B4 len_1/4 stem_up beam_continue_start note_A4 len_1/4 stem_up beam_stop_stop </voice> <voice> note_A4 len_2 stem_down note_E4 len_1 stem_down </voice> bar <voice> note_C#5 len_2 stem_up note_B4 len_1 stem_up </voice> <voice> note_G#4 len_3 stem_down </voice> bar <voice> note_A4 len_2 stem_up note_F#4 len_1 stem_up </voice> <voice> note_D4 len_3 stem_down </voice> L bar clef_bass key_sharp_3 time_3/4 note_G#3 note_E3 note_A2 len_2 stem_down note_G#3 note_E3 note_A2 len_1 stem_down bar note_F#2 len_1/2 stem_up beam_start note_C#3 len_1/2 stem_up beam_stop note_F#3 len_1/2 stem_down beam_start note_G#3 len_1/2 stem_down beam_stop note_A3 len_1/2 stem_down beam_start note_B3 len_1/2 stem_down beam_stop bar note_E2 len_1/2 stem_up beam_start note_C#3 len_1/2 stem_up beam_stop note_E3 len_1/2 stem_down beam_start note_F#3 len_1/2 stem_down beam_stop note_C#4 len_1/2 stem_down beam_start note_G#3 len_1/2 stem_down beam_stop bar note_D2 len_1/2 stem_up beam_start note_A2 len_1/2 stem_up beam_stop note_D3 len_1/2 stem_down beam_start note_F#3 len_1/2 stem_down beam_stop note_G#3 len_1/2 stem_down beam_start note_A3 len_1/2 stem_down beam_stop'"
22 |       ]
23 |      },
24 |      "execution_count": 2,
25 |      "metadata": {},
26 |      "output_type": "execute_result"
27 |     }
28 |    ],
29 |    "source": [
30 |     "# load MusicXML file and convert its content to tokens\n",
31 |     "tokens = MusicXML_to_tokens('input_score.musicxml')\n",
32 |     "' '.join(tokens)"
33 |    ]
34 |   },
35 |   {
36 |    "cell_type": "code",
37 |    "execution_count": 3,
38 |    "metadata": {},
39 |    "outputs": [],
40 |    "source": [
41 |     "# write out\n",
42 |     "with open('generated_tokens.txt', 'w') as f:\n",
43 |     "    f.write(' '.join(tokens))"
44 |    ]
45 |   }
46 |  ],
47 |  "metadata": {
48 |   "kernelspec": {
49 |    "display_name": "Python 3",
50 |    "language": "python",
51 |    "name": "python3"
52 |   },
53 |   "language_info": {
54 |    "codemirror_mode": {
55 |     "name": "ipython",
56 |     "version": 3
57 |    },
58 |    "file_extension": ".py",
59 |    "mimetype": "text/x-python",
60 |    "name": "python",
61 |    "nbconvert_exporter": "python",
62 |    "pygments_lexer": "ipython3",
63 |    "version": "3.8.8"
64 |   }
65 |  },
66 |  "nbformat": 4,
67 |  "nbformat_minor": 4
68 | }
69 | 


--------------------------------------------------------------------------------
/tokenization_tools/tokenizer/score_to_tokens.py:
--------------------------------------------------------------------------------
  1 | from bs4 import BeautifulSoup
  2 | from bs4.element import Tag
  3 | from fractions import Fraction
  4 | import pretty_midi
  5 | 
  6 | def attributes_to_tokens(attributes, staff=None): # tokenize 'attributes' section in MusicXML
  7 |     tokens = []
  8 |     divisions = None
  9 | 
 10 |     for child in attributes.contents:
 11 |         type_ = child.name
 12 |         if type_ == 'divisions':
 13 |             divisions = int(child.text)
 14 |         elif type_ in ('clef', 'key', 'time'):
 15 |             if staff is not None:
 16 |                 if 'number' in child.attrs and int(child['number']) != staff:
 17 |                     continue
 18 |             tokens.append(attribute_to_token(child))
 19 | 
 20 |     return tokens, divisions
 21 | 
 22 | def attribute_to_token(child): # clef, key signature, and time signature
 23 |     type_ = child.name
 24 |     if type_ == 'clef':
 25 |         if child.sign.text == 'G':
 26 |             return 'clef_treble'
 27 |         elif child.sign.text == 'F':
 28 |             return 'clef_bass'
 29 |     elif type_ == 'key':
 30 |         key = int(child.fifths.text)
 31 |         if key < 0:
 32 |             return f'key_flat_{abs(key)}'
 33 |         elif key > 0:
 34 |             return f'key_sharp_{key}'
 35 |         else:
 36 |             return f'key_natural_{key}'
 37 |     elif type_ == 'time':
 38 |         times = [int(c.text) for c in child.contents if isinstance(c, Tag)] # excluding '\n'
 39 |         if times[1] == 2:
 40 |             return f'time_{times[0]*2}/{times[1]*2}'
 41 |         elif times[1] > 4:
 42 |             fraction = str(Fraction(times[0], times[1]))
 43 |             if int(fraction.split('/')[1]) == 2: # X/2
 44 |                 return f"time_{int(fraction.split('/')[0])*2}/{int(fraction.split('/')[0])*2}"
 45 |             else:
 46 |                 return 'time_' + fraction
 47 |         else:
 48 |             return f'time_{times[0]}/{times[1]}'
 49 | 
 50 | def aggregate_notes(voice_notes): # notes to chord
 51 |     for note in voice_notes[1:]:
 52 |         if note.chord is not None:
 53 |             last_note = note.find_previous('note')
 54 |             last_note.insert(0, note.pitch)
 55 |             note.decompose()
 56 | 
 57 | def note_to_tokens(note, divisions=8, note_name=True): # notes and rests
 58 |     beam_translations = {'begin': 'start', 'end': 'stop', 'forward hook': 'partial-right', 'backward hook': 'partial-left'}
 59 | 
 60 |     if note.duration is None: # gracenote
 61 |         return []
 62 | 
 63 |     duration_in_fraction = str(Fraction(int(note.duration.text), divisions))
 64 | 
 65 |     if note.rest:
 66 |         return ['rest', f'len_{duration_in_fraction}'] # for rests
 67 | 
 68 |     tokens = []
 69 | 
 70 |     # pitches
 71 |     for pitch in note.find_all('pitch'):
 72 |         if note_name:
 73 |             if pitch.alter:
 74 |                 alter_to_symbol= {'-2': 'bb', '-1': 'b', '0':'', '1': '#', '2': '##'}
 75 |                 tokens.append(f"note_{pitch.step.text}{alter_to_symbol[pitch.alter.text]}{pitch.octave.text}")
 76 |             else:
 77 |                 tokens.append(f"note_{pitch.step.text}{pitch.octave.text}")
 78 |         else:
 79 |             note_number = pretty_midi.note_name_to_number(pitch.step.text + pitch.octave.text) # 'C4' -> 60
 80 |             if pitch.alter:
 81 |                 note_number += int(pitch.alter.text)
 82 |             tokens.append(f'note_{note_number}')
 83 | 
 84 |     # len
 85 |     tokens.append(f'len_{duration_in_fraction}')
 86 | 
 87 |     if note.stem:
 88 |         tokens.append(f'stem_{note.stem.text}')
 89 | 
 90 |     if note.beam:
 91 |         beams = note.find_all('beam')
 92 |         tokens.append('beam_' + '_'.join([beam_translations[b.text] if b.text in beam_translations else b.text for b in beams]))
 93 | 
 94 |     if note.tied:
 95 |         tokens.append('tie_' + note.tied.attrs['type'])
 96 | 
 97 |     return tokens
 98 | 
 99 | def element_segmentation(measure, soup, staff=None): # divide elements into three sections
100 |     voice_starts, voice_ends = {}, {}
101 |     position = 0
102 |     for element in measure.contents:
103 |         if element.name == 'note':
104 |             if element.duration is None: # gracenote
105 |                 continue
106 | 
107 |             voice = element.voice.text
108 |             duration = int(element.duration.text)
109 |             if element.chord: # rewind for concurrent notes
110 |                 position -= last_duration
111 | 
112 |             if element.staff and int(element.staff.text) == staff:
113 |                 voice_starts[voice] = min(voice_starts[voice], position) if voice in voice_starts else position
114 |                 start_tag = soup.new_tag('start')
115 |                 start_tag.string = str(position)
116 |                 element.append(start_tag)
117 | 
118 |             position += duration
119 | 
120 |             if element.staff and int(element.staff.text) == staff:
121 |                 voice_ends[voice] = max(voice_ends[voice], position) if voice in voice_ends else position
122 |                 end_tag = soup.new_tag('end')
123 |                 end_tag.string = str(position)
124 |                 element.append(end_tag)
125 | 
126 |             last_duration = duration
127 |         elif element.name == 'backup':
128 |             position -= int(element.duration.text)
129 |         elif element.name == 'forward':
130 |             position += int(element.duration.text)
131 |         else: # other types
132 |             start_tag = soup.new_tag('start')
133 |             end_tag = soup.new_tag('end')
134 | 
135 |             start_tag.string = str(position)
136 |             end_tag.string = str(position)
137 | 
138 |             element.append(start_tag)
139 |             element.append(end_tag)
140 | 
141 |     # voice section
142 |     voice_start = sorted(voice_starts.values())[1] if voice_starts else 0
143 |     voice_end = sorted(voice_ends.values(), reverse=True)[1] if voice_ends else 0
144 | 
145 |     pre_voice_elements, post_voice_elements, voice_elements = [], [], []
146 |     for element in measure.contents:
147 |         if element.name in ('backup', 'forward'):
148 |             continue
149 |         if element.name == 'note' and element.duration is None: # gracenote
150 |             continue
151 |         if staff is not None:
152 |             if element.staff and int(element.staff.text) != staff:
153 |                 continue
154 | 
155 |         if voice_starts or voice_ends:
156 |             if int(element.end.text) <= voice_start:
157 |                 pre_voice_elements.append(element)
158 |             elif voice_end <= int(element.start.text):
159 |                 post_voice_elements.append(element)
160 |             else:
161 |                 voice_elements.append(element)
162 |         else:
163 |             pre_voice_elements.append(element)
164 | 
165 |     return pre_voice_elements, voice_elements, post_voice_elements
166 | 
167 | def measures_to_tokens(measures, soup, staff=None, note_name=True):
168 |     divisions = 0
169 |     tokens = []
170 |     for measure in measures:
171 | 
172 |         tokens.append('bar')
173 |         if staff is not None:
174 |             notes = [n for n in measure.find_all('note') if n.staff and int(n.staff.text) == staff]
175 |         else:
176 |             notes = measure.find_all('note')
177 | 
178 |         voices = list(set([n.voice.text for n in notes if n.voice]))
179 |         for voice in voices:
180 |             voice_notes = [n for n in notes if n.voice and n.voice.text == voice]
181 |             aggregate_notes(voice_notes)
182 | 
183 |         if len(voices) > 1:
184 |             pre_voice_elements, voice_elements, post_voice_elements = element_segmentation(measure, soup, staff)
185 | 
186 |             for element in pre_voice_elements:
187 |                 if element.name == 'attributes':
188 |                     attr_tokens, div = attributes_to_tokens(element, staff)
189 |                     tokens += attr_tokens
190 |                     divisions = div if div else divisions
191 |                 elif element.name == 'note':
192 |                     tokens += note_to_tokens(element, divisions, note_name)
193 | 
194 |             if voice_elements:
195 |                 for voice in voices:
196 |                     tokens.append('<voice>')
197 |                     for element in voice_elements:
198 |                         if (element.voice and element.voice.text == voice) or (not element.voice and voice == '1'):
199 |                             if element.name == 'attributes':
200 |                                 attr_tokens, div = attributes_to_tokens(element, staff)
201 |                                 tokens += attr_tokens
202 |                                 divisions = div if div else divisions
203 |                             elif element.name == 'note':
204 |                                 tokens += note_to_tokens(element, divisions, note_name)
205 |                     tokens.append('</voice>')
206 | 
207 |             for element in post_voice_elements:
208 |                 if element.name == 'attributes':
209 |                     attr_tokens, div = attributes_to_tokens(element, staff)
210 |                     tokens += attr_tokens
211 |                     divisions = div if div else divisions
212 |                 elif element.name == 'note':
213 |                     tokens += note_to_tokens(element, divisions, note_name)
214 |         else:
215 |             for element in measure.contents:
216 |                 if staff is not None:
217 |                     if element.name in ('attributes', 'note') and element.staff and int(element.staff.text) != staff:
218 |                         continue
219 |                 if element.name == 'attributes':
220 |                     attr_tokens, div = attributes_to_tokens(element, staff)
221 |                     tokens += attr_tokens
222 |                     divisions = div if div else divisions
223 |                 elif element.name == 'note':
224 |                     tokens += note_to_tokens(element, divisions, note_name)
225 | 
226 |     return tokens
227 | 
228 | def load_MusicXML(mxml_path): # load MusicXML contents using BeautifulSoup
229 |     soup = BeautifulSoup(open(mxml_path, encoding='utf-8'), 'lxml-xml', from_encoding='utf-8') # MusicXML
230 |     for tag in soup(string='\n'): # eliminate line breaks
231 |         tag.extract()
232 | 
233 |     parts = soup.find_all('part')
234 | 
235 |     return [part.find_all('measure') for part in parts], soup
236 | 
237 | def MusicXML_to_tokens(soup_or_mxml_path, note_name=True): # use this method
238 |     if type(soup_or_mxml_path) is str:
239 |         parts, soup = load_MusicXML(soup_or_mxml_path)
240 |     else:
241 |         soup = soup_or_mxml_path
242 |         for tag in soup(string='\n'): # eliminate line breaks
243 |             tag.extract()
244 | 
245 |         parts = [part.find_all('measure') for part in soup.find_all('part')]
246 | 
247 |     if len(parts) == 1:
248 |         tokens = ['R'] + measures_to_tokens(parts[0], soup, staff=1, note_name=note_name)
249 |         tokens += ['L'] + measures_to_tokens(parts[0], soup, staff=2, note_name=note_name)
250 |     elif len(parts) == 2:
251 |         tokens = ['R'] + measures_to_tokens(parts[0], soup, note_name=note_name)
252 |         tokens += ['L'] + measures_to_tokens(parts[1], soup, note_name=note_name)
253 | 
254 |     return tokens
255 | 


--------------------------------------------------------------------------------