feat(cli serve): add a new reranker BCEReranking #25

bjwswang · 2024-03-01T07:04:10Z

What type of PR is this?

What this PR does / why we need it

Which issue(s) this PR fixes

Fixes #

Special notes for your reviewer

Abirdcfly · 2024-03-01T08:36:09Z

libs/cli/kubeagi_cli/serve/reranking.py

            if isinstance(result, float):
                result = [result]
            return result
        else:
            return None
+
+
+def select_reranking(model_name_or_path: str) -> BaseReranking:


look like if we use transformers, we can have same code to serve diffrent rerank model?

import torch from transformers import AutoTokenizer, AutoModelForSequenceClassification # init model and tokenizer tokenizer = AutoTokenizer.from_pretrained('maidalun1020/bce-reranker-base_v1') model = AutoModelForSequenceClassification.from_pretrained('maidalun1020/bce-reranker-base_v1') device = 'cuda' # if no GPU, set "cpu" model.to(device) # get inputs inputs = tokenizer(sentence_pairs, padding=True, truncation=True, max_length=512, return_tensors="pt") inputs_on_device = {k: v.to(device) for k, v in inputs.items()} # calculate scores scores = model(**inputs_on_device, return_dict=True).logits.view(-1,).float() scores = torch.sigmoid(scores)

import torch from transformers import AutoModelForSequenceClassification, AutoTokenizer tokenizer = AutoTokenizer.from_pretrained('BAAI/bge-reranker-large') model = AutoModelForSequenceClassification.from_pretrained('BAAI/bge-reranker-large') model.eval() pairs = [['what is panda?', 'hi'], ['what is panda?', 'The giant panda (Ailuropoda melanoleuca), sometimes called a panda bear or simply panda, is a bear species endemic to China.']] with torch.no_grad(): inputs = tokenizer(pairs, padding=True, truncation=True, return_tensors='pt', max_length=512) scores = model(**inputs, return_dict=True).logits.view(-1, ).float() print(scores)

FlagEmbedding/BCEEmbedding provides interfaces to load reranking models which uses the transformers as well. By using FlagEmbedding/BCEEmbedding,we do not need to care much about the base dependecies.

Signed-off-by: bjwswang <bjwswang@gmail.com>

github-actions bot requested a review from wangxinbiao March 1, 2024 07:04

bjwswang force-pushed the main branch from 8c1dc2c to f79e4bf Compare March 1, 2024 07:56

bjwswang mentioned this pull request Mar 1, 2024

feat(worker): add RunnerKubeAGI to host reranking models kubeagi/arcadia#784

Merged

Abirdcfly reviewed Mar 1, 2024

View reviewed changes

feat(worker): add a new core-library-cli runner to host reranking models

39a334a

Signed-off-by: bjwswang <bjwswang@gmail.com>

bjwswang force-pushed the main branch from f79e4bf to 39a334a Compare March 1, 2024 08:54

bjwswang merged commit c164466 into kubeagi:main Mar 1, 2024
2 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(cli serve): add a new reranker BCEReranking #25

feat(cli serve): add a new reranker BCEReranking #25

bjwswang commented Mar 1, 2024

Abirdcfly Mar 1, 2024

bjwswang Mar 1, 2024

feat(cli serve): add a new reranker BCEReranking #25

feat(cli serve): add a new reranker BCEReranking #25

Conversation

bjwswang commented Mar 1, 2024

What type of PR is this?

What this PR does / why we need it

Which issue(s) this PR fixes

Special notes for your reviewer

Abirdcfly Mar 1, 2024

Choose a reason for hiding this comment

bjwswang Mar 1, 2024

Choose a reason for hiding this comment