title FAQ for Musicthatmakesyoudumb user virgil ip 71.83.221.19 vol 1 lock ******** Answers to common questions about /link http://musicthatmakesyoudumb.virgil.gr/ Musicthatmakesyoudumb /box( /b See also: /b /link http://virgil.gr/42.html Booksthatmakeyoudumb FAQ /box) /html
Soca < Gospel < Jazz < Hip Hop < Pop < Oldies < Raggae < Alternative < Classical < R&B < Rap < Rock < Country < Classic Rock < Techno < Indie* /link http://musicthatmakesyoudumb.virgil.gr/musicdetails.php?music=Counting+Crows Counting Crows is an interesting one near the top of the list. However, when Princeton, Duke, Williams, and Upenn make up ~20% of the weight you'd expect a high average score. * Beethoven wins by a LAND SLIDE of about 120 points. Further, the difference between "Beethoven" and "Classical" is about 400 points. * Unbeknownst to me when beginning this study, the /link http://caltech.edu California Institute of Technology actually has the highest average SAT score of any institution in the world. As a Caltech Ph.D. student, this pleased me immensely. /link http://mit.edu MIT, at #6, can suck it. * /link http://musicthatmakesyoudumb.virgil.gr/musicdetails.php?music=Tool Tool is a little above average. /link http://musicthatmakesyoudumb.virgil.gr/musicdetails.php?music=Pink+Floyd Pink Floyd enjoys massive popularity across the board, but in the end comes out to only slightly above average. * Most of the musical artists are popular across wide SAT ranges. However, /link http://musicthatmakesyoudumb.virgil.gr/musicdetails.php?music=Jars+Of+Clay Jars of Clay is not. Their fans cluster quite regularly around the mean. Oddly enough the same goes for /link http://musicthatmakesyoudumb.virgil.gr/musicdetails.php?music=Casting+Crowns Casting Crowns. * My roommate opines: /i "I don't want to be smart if I have to listen to Counting Crows" * For more, see /link http://musicthatmakesyoudumb.virgil.gr/comments.php The Comments /list) */b Do people with SAT >1400 just not listen to music? They do listen to music -- just look at those high-tier schools at /link http://musicthatmakesyoudumb.virgil.gr/schools.php The Schools. /i Averages are being plotted. So even though high-scoring schools are included in this average, the average is pulled down by the less selective schools. */b Do you have a press photo? No I don't. But I do like /link http://virgil.gr/42.sizes.@VirgilGriffithFace.jpg?size1=L this one. */b Why did you make Booksthatmakeyoudumb and Musicthatmakesyoudumb? /list( -num * I made both of these projects as a demonstration to the data-mining and mashup community that you can take two data-sets that by themselves are "boring", and combine them into something interesting. * To further my quest be #1 on Google for the search term "virgil". /list) */b Do you actually believe that there's a connection between people's intelligence and what the listen to? It doesn't matter what I believe. The data says there undeniably is a connection between what students report is their favorite music and the selectivity of the school they attend. */b Why do you think people have a more diverse taste in books than in music? I don't think this, I know this. I know this from looking at the raw data. There were 3,164 distinct favorite books, but only 1,455 distinct favorite musics from the same sample size (10 entries x 1,352 schools). The only way that can happen is if schools have more homogeneous music tastes than in books. */b Could you do this for international schools? This could be done internationally, but I don't know of any central repositories for college entrance exam scores other than Collegeboard.com, which lists only the ACT and SAT for colleges in the United States. If Europe/Asia has a website listing the typical entrance exam scores for their schools, you could do the same analysis for those as well. If you do know of a website with entrance exam scores for other parts of the world, do send me an email and we'll see! */b Could you do the same study with movies or tv series? No, I can't do this study with movies or TV series because facebook no longer provides aggregate data about each network. */b Why isn't artist _______ in your list? To be listed in Booksthatmakeyoudumb or Musicthatmakesyoudumb, that book/music must have been listed on the top-10 most popular list for at least 10 schools. You must do this because you're averaging across schools instead of people. A sample size of 10 schools is large enough sample size for the result to be meaningful. Books/Musics with more than 10 schools have a more reliable average and a smaller standard error. */b How old are you? I am 26 years old at the time of this typing (2009-3-12). */b Correlation is not causation blah blah. That's true. However in this case correlation is enough -- the results are provocative (not to mention hilarity incarnate) regardless of whether A causes B or B causes A, or even an unknown C causes A and B. */b What is the SAT, ACT? The SAT and ACT are respectively the most and second most common college entrance exam in the United States. The ACT is scored from 1-36. The SAT was until very recently scored from 400-1600. A few years ago this was changed so that the SAT is now scored from 600-2400, however, most schools on collegeboard.com don't yet list statistics on the new 600-2400 scale. So I used the older 400-1600 scale. */b Is the facebook data from each person's profile. Isn't that a selection bias towards people that have their profiles visible to everyone? No, it is not. I collect the data from Facebook's "Network Stats" page for each college. This makes a selection bias towards people that list their favorite books, but there is no selection bias towards people that have their profile publicly viewable. If you list it on Facebook, Facebook counts it. And if your books make the top 10, it then I can (and do) count it. */b Why isn't school _______ in your database? I tried to include everyone. Really, I did. But if your school isn't listed, it's either because I couldn't find your school's network on Facebook, or Collegeboard didn't have reliable SAT/ACT scores for that school. */b What music on that list do you listen to? My favorite band is Daft Punk, but there's not on that list. My favorite music on the list is Tool, Radiohead, and Metallica. */b How is the weight of a "favorite music" calculated? Every school moves the average SAT of a "favorite music" in its top 10 closer to its own SAT score. How much does a school move the average? It depends on: 1) the # of undergrads at that school (big schools count more than small schools) and 2) where in the top-10 list the book appeared (being #1 at a school is worth more than being #10). If you want to know the full formula, it is: /tt schoolweight_i = #ugrads * (11-bookrank)/10 / /tt totalweight = sum{ schoolweights } This is a regular linear falloff and is the most reasonable one I could think of, however, I tried 4 other functions including exponential and logarithmic falloff and only the books in the middle changed much at all. The same 20 "favorite music" were always the smartest, and the same 20 "favorite music" were always the dumbest. */b How is the Adjusted Average SAT calculated? // What is m ? The Adjusted Average SAT is a True Bayesian Estimate -- it's the same way IMDB uses to calculate their /link http://www.imdb.com/chart/top Top 250 movies. In short, the true Bayesian estimate is the weighted average with an additional term 'm'. Increasing m takes books with a small number of samples (weight) and moves them towards the mean. The justification behind this is that if a book doesn't have very many samples we can't trust its mean as much as books with many samples. However, there's a problem with this -- what value should m be? You never know. IMDB arbitrarily sets m=1300. With Musicthatmakesyoudumb you can set m to be whatever you want and see the new rankings. However, since we are only looking at music favorites with a high sample size (>=10 samples and totalweight >=10,000), the raw weighted average is well representative and leaving m=0 is probably the right thing to do. Not that it matters much though, if you look at the musicdetails page you'll see that the rankings change very little with high m. */b Doesn't this violate Facebook's Terms of Service? No. It doesn't. Facebook prohibits the "use automated scripts to collect information from or otherwise interact with the Service or the Site". A friend of mine manually collected our books data. */b The data you scrape from Collegeboard doesn't give you the mean SAT/ACT. That's true. Technically it's the average of the 1st and 3rd quartile. Unfortunately, there was no data available on the raw means, and the average of the 1st and 3rd quartile isn't going to be the mean or median unless the distribution is roughly symmetric (which may be true for most schools, but probably won't be true for high-scoring schools). However, even in the worst cases the average of the 1st and 3rd quartile is still decently representative of the center of the distribution. And even if it wasn't, there's not much worry about -- the top/bottom of the rankings (which are pretty much all people care about anyway) are exceedingly robust (see above). /list)