4.1 - Redefining overall structure

As seen in this updated mindmap below, there is a lot going on internally when making music, but there also is emotional influence of the listener, either intended or not, by the maker. This makes for a more wholesome structure of the research field and includes all the different parts that make music in itself an interesting thing to study. 

When looking into the technical details of making an artificial music generator there is a part which analyses data, implements learned details (which melodic and song structure are made up of)  and the actual generator part which uses the aforementioned learned details and rules to generate music. 

In a set up with GANs there is the possibility to generate more data with the encoder, while the decoder is fed this information to discriminate between. The encoder is therefore atuned to generate different types of subsets and learns better what the difference with the original data set is.

The sequential aspect of music makes it less suitable for an image implementation of GANs, however as sequentiality can be overcome with atuning the GAN to make its output sequential instead of predicting multiple possible variations each step, there might be a very good use for this type of algorithm. However, there is another possibility, which is optimising the next iteration in the future step of the GAN to optimise for a reinforcement learner (RL), which then atunes every step to be optimal. This optimisation then can be linked to the amount of expected return for emotional conveyance to the listener. Which will result, when using the right listener model, to an emotional music organiser, coined here as EMO.

Comments

Popular posts from this blog

4.0 - More data, more data!

2.0 - Literature draft