AbusiveLanguage2020 is an open source dataset with over 1.5M tweets labeled for abusive language.
-
Updated
Sep 23, 2021 - Jupyter Notebook
AbusiveLanguage2020 is an open source dataset with over 1.5M tweets labeled for abusive language.
Labelling dataset with Snorkel and TextBlob, building model with Scikit-Learn (SVM), wiring up a web app using Flask.
Weak Labeling of Fake News Articles with Snorkel and Snuba
Interactive Label Generation Tool for Time Series Data
In this project, we are using Snorkel Python to work with ML algorithms with an unlabeled text dataset.
Utilizing the snorkel machine learning model to label biomimicry papers. Snorkel uses weak supervision to label large amounts of training data using programmatic labeling functions based on keyword rules.
This is a POC project of a system that aim to label unlabeled streaming data using Snorkel
Personal project that wants to predict the sexual content on reggaeton songs lyrics using natural language processing
snorkel tutorial docker image
Snorkel MeTaL: A framework for training models with multi-task weak supervision
Fine-grained semantic indexing of biomedical literature (a Weakly-Supervised approach)
A curated list of awesome Weak-Supervision-Sequence-Labeling (WSSL) papers, methods & resources.
Data labeling using weak supervision
Convertir máscara Snorkel a CPAP y a máscara de Equipo de Prevención Individual (EPIs)
Process flow to generate labels on Text data using Snorkel and maintain DB to repurpose unlabelled data
Semi-supervised labelling of news snippets to extract cleantech news
"Snorkel with Turtle" is a realtime labeling system based on Snorkel
Add a description, image, and links to the snorkel topic page so that developers can more easily learn about it.
To associate your repository with the snorkel topic, visit your repo's landing page and select "manage topics."