Personal project in which I scrape comments from the jensen.nl website and store them in CSV fromat. I started doing this because thorugh a class in my bachelor program I was introduced to data science in combination with political communicaiton and I immediately took interest. Furthermore the comments from this website removed after a ceratin period of time, by starting collecting now and worrying about the use later I am collecting as many comments as possible.
So far I've mainly looked at straight forward things. I have for example:
- identified the users with the most replies and their respective points (likes / score / etc.).
- Investigated the links that are posted underneath the videos. The only 'tradional media' website in the top 10 most posted links is youtube. All other 9 websites are either small personal websites such as Jensen, or platforms with fewer restrictions. This striked my as interesting, because it really showcases the online bubble that people create for themselves. Maybe the algorithms are not the only sources of evil in this matter.
- Identified the video that brought forth the most commotion (at least from the data I collected, as mentioned comments are removed after a certain period of time)