└── README.md /README.md: -------------------------------------------------------------------------------- 1 | # Audio Set Ontology 2 | 3 | Dan Ellis dpwe@google.com 4 | 5 | v1 2017-03-06 6 | 7 | The file [ontology.json](ontology.json) contains the current definition of 8 | the [AudioSet](https://research.google.com/audioset/) ontology, a hierarchical 9 | set of audio event classes. 10 | 11 | The json file describes a list of sound entity objects. Each object contains 12 | the fields: 13 | 14 | * `id`: The machine identifier for this class, a short sequence like 15 | `/m/0dgw9r`. Where possible, these are based on Knowledge Graph IDs, 16 | which were originally used in 17 | [Freebase](https://en.wikipedia.org/wiki/Freebase). 18 | 19 | * `name`: The Display Name that refers to the class. Should be unique and 20 | unambiguous. Several classes have one or more synonyms included in their 21 | display names, for example "Male speech, man speaking". 22 | 23 | * `description`: A description of the class in a few lines. 24 | 25 | * `citation_uri`: A pointer to any text used as the basis for the 26 | description. Frequently points to a Wikipedia page. 27 | 28 | * `positive_examples`: A list of compact URLs to segments within YouTube files 29 | that provide confirmed positive examples of this class. Each entry is of 30 | the form `youtu.be/8uI9H5jGRV8?start=30&end=40`, meaning the example is the 31 | 10 seconds starting 30 seconds in to the the YouTube video whose ID is 32 | `8uI9H5jGRV8`. 33 | 34 | * `child_ids`: A list of the `id` fields for any classes that children of this 35 | class in the class hierarchy. 36 | 37 | * `restrictions`: A list that can include the following values: 38 | 39 | * `abstract` for a class that is principally a container within the 40 | hierarchy, but will not have any explicit examples for itself. "Human 41 | voice" is an abstract class. Abstract classes will always have children. 42 | 43 | * `blacklist` for classes that have been excluded from rating for the time 44 | being. These are classes that we found were too difficult for raters to 45 | mark reliably, or for which we had too much trouble finding candidates, or 46 | which we decided to drop from labeling for some other reason. 47 | 48 | The ontology is made available by Google Inc. under a Creative Commons 49 | Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) license. 50 | --------------------------------------------------------------------------------