Outbreak aims to address misinformation by extracting relevant information in news headlines and organizing this information in an accessible manner. By providing this information, users can determine where the disease is most prominent without having to read through numerous news articles.
Due to the pandemic, information about disease outbreaks is essential to remain safe. Fortunately, news articles provide information about COVID-19, but sifting through thousands of news articles poses a challenge for many.
Outbreak uses a combination of web scraping and unsupervised machine learning to capture news headlines and cluster them accordingly. Headlines are filtered based on location and manipulated to form a comprehensive map.
The backend was created using Python ML libraries and AWS's Lambda service, while the frontend utilizes HTML, CSS, and Javascript.
Our open-source platform makes it easy to check for updates and run the code yourself on your local machine.
Web scraping through hundreds of the most popular news sources and finding information about each pertinent article using Python libraries like Beautiful Soup.
Read MoreUsing unsupervised clustering machine learning algorithms to find specific points of disease outbreaks depending on the information given by the news headline.
Read MorePlotting the clustered data found from the ML algorithms onto a geographic map using libraries like Basemap and Geonamescache.
Read MoreOutbreak allows for extremely quick processing so that individuals can search through specific keywords almost instantaneously.
Outbreak doesn’t require significant or comprehensive knowledge to operate and can be used with little to no instruction.
Outbreak allows the user to input any keyword that they want to find trends for, regardless of whether it relates to diseases or not.
Outbreak utilizes several clustering algorithms in order to identify distinct geographic locations based on the news headlines given.
Outbreak constantly updates its site to provide the most up-to-date trends on significant keywords like the coronavirus.
Outbreak provides current information about the state of relevant diseases in today’s society, including COVID-19.