From the queries and visualizations we developed in the Step four – analyzing the data and Step five – visualizing the data sections, we can now attempt to answer each of the three questions that prompted this project in the first place.
With our first question, we wanted to find counts of the different paste sites mentioned by URL in posts and comments. The q1.php
script and bar graph we made to visualize the data show that, at least in the test data, JSFiddle was the most commonly referenced of the six paste site URLs we looked at.
The second question was about whether paste site URLs were more prevalent in questions or answers. Our queries show that paste site URLs were about twice as likely to occur in questions as opposed to answers, but the numbers for both were very small, at least in our test set.
For the third question, we wanted to look for whether people were actually heeding the advice of Stack Overflow and posting code in addition to a paste site...