Energy Engineer (UnB) │ Data Scientist and Analytics (USP)
📧 E-mail │ 🎯 Linkedin │ GitHub
Project Notebook [BR 🇧🇷]
With the aim of applying knowledge in Text Mining, Sentiment Analysis, NLP, Machine Learning, Crowlers and Web Scrapping, data solutions were developed with themes relevant to the energy sector.
These solutions scrape data from the CNN Brasil website (with a focus on the international energy scenario); and from government websites of agencies such as ANP, ANEEL and MME (with a focus on the national energy scenario).
Once scraped, the data is manipulated and added to a dataframe, and finally presented via plotly and wordcloud, as shown in the figures below.
All notebooks were developed via jupyter notebook and are available in my GitHub repository (github.com/viniciusgribas).
The results obtained (listed in the comments) were very interesting! They allow us to extract insights into what is happening in Brazil and in the world in the energy theme.
Feel free to contact me if you have any feedback, interesting websites to scrape, or insights to share.
#energy #github #machinelearning #nlp #textmining #mme #cnn #aneel #anp
1️ - CNN-NEWS Results (energy):
- 💻 Notebook: https://tinyurl.com/ydapn5ct
- 📊 ScatterPlot Analysis: https://lnkd.in/djAaaB5M -📰 WordCloud Analysis: https://lnkd.in/duGg-s72
2️ - ANEEL Results:
- 💻 Notebook: https://lnkd.in/dW7Pm69N
- 📊 ScatterPlot Analysis: https://lnkd.in/d262_Bu2 -📰 WordCloud Analysis: https://lnkd.in/dTpESCA7
3️ - ANP Results:
- 💻 Notebook: https://lnkd.in/dzVhxQcg
- 📊 ScatterPlot Analysis: https://lnkd.in/dJXg-JnT -📰 WordCloud Analysis: https://lnkd.in/d98aiUQm
4️- MME Results:
- 💻 Notebook: https://lnkd.in/dZMcSkEv
- 📊 ScatterPlot Analysis: https://lnkd.in/dHKNSjsu -📰 WordCloud Analysis: https://lnkd.in/dzZUd8r7