Classifying Song Genre by Lyrics and...

Features

Classifying Song Genre by Lyrics and SoundsJayen Ram (jjram@stanford.edu) Daniel Salz (dasalz@stanford.edu)

The Data

MotivationBoth of us are huge music in general, and as a result, we were inspired by the spam detection problem from the class to create a genre detection classifier using various techniques from the class. We started with rap (binary) classification and moved towards multi class classification (3 genres).

a. LyricsCounted occurrences of all the words in the list and foundwords that were found a medium amount of times (1000 < x< 3000) and were not well-distributed.

b. SoundsFrequency: The features are the full 250,000 fft values of theFFT dataset. Different voices, instruments and beats thatare represented by different frequencies in an FFT vector.

Dataa. LyricsWe used a dataset from Kaggle that has the lyrics to55,000 songs This dataset mapped artist to song to lyrics,and we individually classified artistsb. SoundsWe downloaded all the songs from the lyrics dataset andran them all through the fft function in matlab. We then tookonly the 2,000,000 indices (as frequency ranges above thisseemed redundant) and then sampled every 8 values toreduce our input vector. We also had a different data setconsisting of several song properties.

Figure 2. Spectrogram of Rap Song: Big Poppa by Notorious B.I.G.

Lyrics Classification

a. MethodologyWe applied Naive Bayes, an SVM with an RBF Kernel, and a linear regression onour feature vectors from values that were found 1000 < x < 3000 times. NaiveBayes had smoothing of 𝜆 =1.0, Linear Regression used square loss(𝐿 𝑥 = %

&∑ 𝑤)𝑥 − 𝑦-./%

& , and SVM used RBF kernel with squared loss. Weimplemented other iterations of these, but only added the most successfulresults.b. ResultsThe SVM performed the best on both the multi-class type and the binaryclassification. Across the board, training and testing error was extremely similar.There were certain words that indicated rap, country, and rock, which we cannotput on this poster.c. AnalysisAfter seeing the songs that our predictor found were wrong, we found that the SVM would find differences between country androck better than the Naive Bayes. This was a result of very few words between country and rock being particular to each. The NaiveBayes were relatively effective for binary because there were certain words that were only found in rap songs.d. Future WorkWe want to expand to a greater number of classes (instead of 3 classes) and determine key-words, which we have done for rap)that indicate certain genres.

Binary Train Acc.

Binary Test Acc.

3-class Train Acc.

SVM 94.4 93.2 73.1 72.8

Naive Bayes 92.3 91.6 70.2 69.8

Linear Reg 89.7 88.5 69.5 66.4

Sound Classificationa. MethodologyWe implemented a Neural Net using batch Adam gradient descent and softmaxoutput layer: 01

∑ 0123245

. Parameters were 2 hidden layers w/ 100 neurons each, and afinal learning rate of .05.b. ResultsInitially the Neural Net achieved at most 76% accuracy, but after revisiting ourdataset and model parameters it went up to 92% accuracy on binaryclassification for rap based purely on the FFT vector.c. AnalysisAfter initially running the NN on binary classification we attempted to expand theproblem to 8 classes (genres), however our accuracy dropped to the 20-30%range. After seeing these results we decided to focus on binary classification withthe NN and were able to dramatically improve its prediction accuracy.d. Future WorkWe want to combine the confidence of both the sound and lyrics models toimprove overall accuracy. We want to expand our feature vector to include songproperties such as tempo, timbre, pitch, duration, etc. and expand the NN toidentify more genres.

s3 spectogram at M = 1024 samples for BIG

200 400 600 800 1000 1200 1400Time (s)

Figure 2. Accuracy Table of Lyric Classifiers

Epoch Iteration

Figure 4. Cost with Learning Rate of 0.05

Learning Rate =

Learning Rate = 0.05

TestError

88.4% 92.2% 90.1% 86.8%

Train Error

90.3% 94.5% 91.3% 87.8%

Figure 3. Learning Rate vs Train & Test Error

Classifying Song Genre by Lyrics and...

Documents

classifying load

Classifying the Brain's Motor Activity via Deep Learningcs229.stanford.edu/proj2014/Tania_Morimoto,_Sean_Sketch... · 2017-09-23 · Classifying the Brain's Motor Activity via Deep

Classifying figures

Dispersive charge density wave excitations in … · Dispersive charge density wave excitations in Bi 2Sr ... Correspondence to: leews@stanford.edu, zxshen@stanford.edu, tpd@stanford.edu

Classifying Galaxies

Classifying Structures

GAN-Based Image Data Augmentationcs229.stanford.edu/proj2020spr/poster/Liu_Hu.pdf · GAN-Based Image Data Augmentation Nathan Hu zixia314@stanford.edu David Liu dliud@stanford.edu

Classifying Variability

New Christopher Choy JunYoung Gwak Silvio Savarese · 2019. 6. 17. · Christopher Choy chrischoy@stanford.edu JunYoung Gwak jgwak@stanford.edu Silvio Savarese ssilvio@stanford.edu

Matrix Completion from Noisy Entries · 2021. 1. 12. · Raghunandan H. Keshavan RAGHURAM@STANFORD.EDU Andrea Montanari∗ MONTANARI@STANFORD.EDU Sewoong Oh SWOH@STANFORD.EDU Department

Alisha Rege (amr6114@stanford.edu)cs229.stanford.edu › proj2016 › poster › Rege-Viewpoint...Alisha Rege (amr6114@stanford.edu) Purpose Artificial Intelligence can play a key

Classifying Industries

Classifying Organisms

Automatic lyrics alignment for Cantonese popular musickhwong/j2007_MMS_Automatic Lyrics Alignme… · Automatic Lyrics Alignment for Cantonese Popular Music ... Automatic Lyrics Alignment

Http://Www.allthelyrics.com/Lyrics/Dbsk/ Yakusoku Lyrics

Classifying Chemical Reactions Chapter 7courses.chem.psu.edu/chem11/pdf's/Lectures/11Lect20... · Classifying Chemical Reactions Chapter 7. Classifying Chemical Reactions Chemical

Classifying Syllables in Imagined Speech using EEG Datacs229.stanford.edu/proj2014/Barak Oshri, Nishith Khandwala, Manu... · and predictions about the structure of EEG data that

SanjayLallandStephenBoyd EE104 StanfordUniversityee104.stanford.edu/lectures/course.pdfCoursemechanics I allclassinfo,lectures,homeworksonclasswebpage: ee104.stanford.edu I staﬀmailinglist:

Predicting the Popularity of Songs - CS221...Predicting the Popularity of Songs - CS221 Maika Isogawa (misogawa@stanford.edu) Sam Masling (smasling@stanford.edu) Monica Pan (jpan5@stanford.edu)

Classifying Galaxies