In this repository you will find a series of projects about data extraction and analysis made with Python. Specifically, inside the folders you will find data extraction projects from APIs as well as different analysis techniques, such as sentiment analysis and natural language processing. All available projects are listed below, along with a summary of their content.
-
AB Test for an ad-tech company.
Analysis project on the results of a 30-day AB test for a fictitious ad-tech company in order to find out which advertising strategy is the best to implement based on a series of metrics contrasted by hypothesis testing. The notebook explains methods and strategies to clean and explore the data, as well as what kind of tests to perform in order to decide the type of test statistic to use, parametric or non-parametric, based on the collected data.
-
Data extraction project of all products in an Amazon product category. It includes explanations on which libraries and methods to use to access the information of all the products of several pages in Amazon, as well as a way to save them into a dataframe.
-
Sentiment Analysis of Elon Musk Tweets.
Extraction and analysis project in which I show how to obtain all tweets written by a user (in this case Elon Musk), as well as information about the date of publication and other basic statistics, using the Twitter API. Additionally, I show how to perform a deep text cleanup, as well as a brief sentiment and text analysis.
-
This data mining project shows step by step how to make calls to the Youtube API in order to acquire information about the videos of a channel and their respective statistics. At the same time, the project includes a section showing how to import the obtained results to a cloud database (in this case AWS) with Python.
Feel free to follow me for updates and reach out to me at martacasdelg@gmail.com if you have any suggestions.