![]() ![]() Did that automatically mean building a mega model? Or, could we build a model that did more (recognize notes played by a much wider range of instruments), but that was also lighter, and just as accurate, as a heavier, more power-hungry solution? Large models with targeted use cases can have very good results: using a dataset with tons of piano audio will be effective at recognizing piano input.īut we wanted a model that could work with input from a variety of instruments and polyphonic recordings to create a tool that’s useful for piano virtuosos as well as shower crooners. Think OpenAI’s Jukebox with its billions of parameters, or DALL-E and its 12 billion parameters, or Meta’s OPT-175B, a 175-billion-parameter large language model. When we look around at popular ML models today, we see a tendency toward computationally extreme solutions. Usually in ML, to make things more accurate, the easiest way is to add more data and make our models bigger. In general, it’s hard to make systems that are both accurate and efficient. To build Basic Pitch, we trained a neural network to predict MIDI note events given audio input. So now they can capture their ideas whenever inspiration strikes and get a head start on their compositions using the instrument of their choice, whether that’s guitar, flugelhorn, or their own voice.Įasy peasy! Well… Does better always have to mean bigger? Bottom: The output of Basic Pitch.īasic Pitch gives musicians and audio producers access to the power and flexibility of MIDI, whether they own specialized MIDI gear or not. The MIDI output can then be imported into a digital audio workstation for further adjustments.Ĭomparing nuance and accuracy using a guitar example. 2022).īy combining these properties, Basic Pitch lets you take input from a variety of instruments and easily turn it into MIDI output, with a high degree of nuance and accuracy. Speed: Basic Pitch is light on resources, and is able to run faster than real time on most modern computers ( Bittner et al.Basic Pitch supports this right out of the box. However, this valuable information is often lost when turning audio into MIDI. ![]() ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |