pyterrier

Here are 13 public repositories matching this topic...

giulio-derasmo / Search-Engine-Evaluation-and-Near-Duplicate-Detection

Exploiting the PyTerrier library to build a Search Engine and resolve the Near Duplicate Detection tasks.

python search-engine data-mining lsh locality-sensitive-hashing search-engine-optimization near-duplicate-detection pyterrier

Updated Sep 20, 2022
Jupyter Notebook

soldni / pyterrier_sentence_transformers

Star

Create PyTerrier compatible dense indices using any sentence_transformers model

information-retrieval transformers terrier pyterrier

Updated Jan 7, 2023
Python

Trojan13 / packgaabwir2022

Star

This repository contains the code for a research project that implements and evaluates local word embeddings based on co-authorship and citations for query expansion in PyTerrier on the TREC-Covid dataset.

local word2vec irds cord-19-dataset pyterrier queryexpansion

Updated Feb 28, 2023
Jupyter Notebook

SelmaDM / Pyterrier

Star

Information retrieval techniques using Pyterrier

python indexing ranking referral-link pyterrier pytreceval

Updated Mar 17, 2023
Jupyter Notebook

mihirs16 / multi-stage-retrieval-using-rm3-and-t5

Star

Multi-stage Retrieval using SPLADE or RM3 and T5.

search api search-engine flask machine-learning information-retrieval transformers ranking query-expansion zero-shot-learning reranking pyterrier

Updated Apr 24, 2023
Python

nicolo-urbani / FactFinder

Star

Fact Finder - a Fact Search Engine

search information-retrieval searchengine pyterrier

Updated May 5, 2024
Jupyter Notebook

Bosy-Ayman / Twitter-Search-Engine

Star

This project creates a basic search engine for text documents, covering data collection, preprocessing, indexing, query processing, expansion, UI development, and performance evaluation. Its goal is to efficiently retrieve relevant information from the document collection.

information-retrieval nltk seach-engine pyterrier

Updated May 12, 2024
Jupyter Notebook

toninf / dense_retrieval

Star

Word2vec, sentenceBert, BM25 and IVFFlat Index quality and speed comparison

word2vec sentence-embeddings faiss pyterrier

Updated Jun 7, 2024
Jupyter Notebook

youssef-saaed / Simple-Search-Engine-for-Information-Retrieval

Star

This repository hosts the implementation of a Simple Search Engine designed for efficient information retrieval. The project encompasses several stages from data collection to evaluation, ensuring a comprehensive approach to search and retrieval.

nlp search-engine information-retrieval pyterrier

Updated Jun 9, 2024
Jupyter Notebook

Unusuala1l2e3x4 / Emotion-Classification-of-Tweets-with-Search-Engine

Star

nlp search-engine natural-language-processing information-retrieval scikit-learn ngrams emotion-classification pyterrier

Updated Jun 19, 2024
Jupyter Notebook

mb-emektar / wiki-search

Star

This project constructs an ad-hoc information retrieval system using the 𝐷𝑃𝑅𝑊𝑖𝑘𝑖100 dataset with PyTerrier. NLTK handles query processing, including tokenization and stemming. BM25 ranking is used with enhancing performance through optimizations. The system features a minimalistic tkinter-based user interface for an intuitive experience.

information-retrieval nltk tkinter bm25 pyterrier