Skip to content

Latest commit

 

History

History
44 lines (26 loc) · 1.93 KB

README.md

File metadata and controls

44 lines (26 loc) · 1.93 KB

TripClick

Synopsis

TripClick is a large-scale dataset of click logs in the health domain, obtained from user interactions of the Trip Database health web search engine. The clicklog dataset comprises approximately 5.2 million user interactions, collected between 2013 and 2020. This dataset is accompanied with an IR evaluation benchmark and the required files to train deep learning IR models.

Files and Folders

.
├── 1_TripClick_Logs_Dataset
│   └── logs.tar.gz
├── 2_TripClick_IR_Benchmark
│   └── benchmark.tar.gz
└── 3_TripClick_DL_Training_Package
    ├── dlfiles_runs_test.tar.gz
    └── dlfiles.tar.gz

Research and Usecases

Our own research and experiments with this data sets.

License Information

@breuert signed a request form.

The provided datasets are intended for non-commercial research purposes to promote advancement in the field of natural language processing, information retrieval and related areas, and are made available free of charge without extending any license or other intellectual property rights. In particular:

Any parts of the datasets cannot be publicly shared or hosted (with exception for aggregated findings and visualizations);

The datasets can only be used for non-commercial research purposes;

The statistical models or any further resources created based on the datasets cannot be shared publicly without the permission of the data owners. These include for instance the weights of deep learning models trained on the provided data.

Upon violation of any of these terms, my rights to use the dataset will end automatically. The datasets are provided “as is” without warranty. The side granting access to the datasets is not liable for any damages related to use of the dataset.

Data Source

The data can be retrieved from this site.

Publications