Skip to content

The dataset provides natural language data for training risk classification models in German language. It contains labelled text data for both risks and chances.

Notifications You must be signed in to change notification settings

michael-eble/german-nlp-dataset-risk-management

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

German NLP dataset for classification models in risk management

The dataset provides natural language data for training binary risk classification models in German language. It contains labelled text data for both risks and chances.

License: CC-BY-4.0, see https://choosealicense.com/licenses/cc-by-4.0/

Purpose and characteristics of the data set

  • Goal: Provide natural language text data for training binary classification models in risk management applications
  • Reason why: There aren't yet enough data sets publicly available that cover both German language and risk classes
  • Each record of the text data set is labelled as follows: 1 = the issue is a risk, 0 = the issue is a chance

Current status of the data set

  • Number of text data records labelled as "risk": 503
  • Number of text data records labelled as "chance": 503

About

The dataset provides natural language data for training risk classification models in German language. It contains labelled text data for both risks and chances.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published