File:Frame-level Instrument Recognition by Timbre and Pitch.pdf
From Wikimedia Commons, the free media repository
Jump to navigation
Jump to search
Size of this JPG preview of this PDF file: 424 × 600 pixels. Other resolutions: 170 × 240 pixels | 339 × 480 pixels | 543 × 768 pixels | 1,239 × 1,752 pixels.
Original file (1,239 × 1,752 pixels, file size: 2.3 MB, MIME type: application/pdf, 8 pages)
File information
Structured data
Captions
Summary
[edit]DescriptionFrame-level Instrument Recognition by Timbre and Pitch.pdf |
English: Instrument recognition is a fundamental task in music information retrieval, yet little has been done to predict the presence of instruments in multi-instrument music for each time frame. This task is important for not only automatic transcription but also many retrieval problems. In this paper, we use the newly released MusicNet dataset to study this front, by building and evaluating a convolutional neural network for making frame-level instrument prediction. We consider it as a multi-label classification problem for each frame and use frame-level annotations as the supervisory signal in training the network. Moreover, we experiment with different ways to incorporate pitch information to our model, with the premise that doing so informs the model the notes that are active per frame, and also encourages the model to learn relative rates of energy buildup in the harmonic partials of different instruments. Experiments show salient performance improvement over baseline methods. We also report an analysis probing how pitch information helps the instrument prediction task. |
Date | |
Source | Content available at arxiv.org (Dedicated link) (archive.org) |
Author | Yun-Ning Hung, Yi-Hsuan Yang |
Licensing
[edit]This file is licensed under the Creative Commons Attribution 4.0 International license.
- You are free:
- to share – to copy, distribute and transmit the work
- to remix – to adapt the work
- Under the following conditions:
- attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
File history
Click on a date/time to view the file as it appeared at that time.
Date/Time | Thumbnail | Dimensions | User | Comment | |
---|---|---|---|---|---|
current | 06:28, 11 November 2018 | 1,239 × 1,752, 8 pages (2.3 MB) | Acagastya (talk | contribs) | User created page with UploadWizard |
You cannot overwrite this file.
File usage on Commons
There are no pages that use this file.
Metadata
This file contains additional information such as Exif metadata which may have been added by the digital camera, scanner, or software program used to create or digitize it. If the file has been modified from its original state, some details such as the timestamp may not fully reflect those of the original file. The timestamp is only as accurate as the clock in the camera, and it may be completely wrong.
Software used | TeX |
---|---|
Conversion program | pdfTeX-1.40.17 |
Encrypted | no |
Page size | 595.276 x 841.89 pts (A4) |
Version of PDF format | 1.5 |
Hidden category: