Support SweRank code re-ranking by aaryanshroff · Pull Request #338 · castorini/rank_llm

aaryanshroff · 2026-02-04T20:26:52Z

Pull Request Checklist

Reference Issue

Please provide the reference to issue this PR is addressing (# followed by the issue number). If there is no associated issue, write "N/A".

ref:

Checklist Items

Before submitting your pull request, please review these items:

Have you followed the contributing guidelines?
Have you verified that there are no existing Pull Requests for the same update/change?
Have you updated any relevant documentation or added new tests where needed?

PR Type

What kind of change does this PR introduce?

Add get_content() to Candidate dataclass to centralize the doc key lookup logic (text/segment/contents/content/body/passage). Update _convert_doc_to_prompt_content and _create_prompt_code to use it, fixing a bug where the CODE path used doc.get("content") missing the "contents" key.

Removes get_content() from Candidate and reverts _convert_doc_to_prompt_content back to taking doc: Dict directly. Fixes the only real bug: the final else branch now uses doc.get("passage", "") instead of doc["passage"] to avoid KeyError on malformed docs. Co-Authored-By: Claude Sonnet 4.6 <[email protected]>

aaryanshroff · 2026-02-19T19:33:08Z

src/rank_llm/rerank/listwise/rank_listwise_os_llm.py

 TEMPLATES = files("rank_llm.rerank.prompt_templates")


+class RerankType(Enum):


SweRank uses a different prompt truncation/creation algorithm than RankLLM. This enum lets us choose which one to use based on the task.

aaryanshroff · 2026-02-19T19:34:21Z

src/rank_llm/rerank/prompt_templates/swerank_github_issue_template.yaml

@@ -0,0 +1,11 @@
+method: "singleturn_listwise"


From Lucas's PR: #323

aaryanshroff · 2026-02-19T19:39:41Z

src/rank_llm/rerank/listwise/rank_listwise_os_llm.py

            token_str = " > ".join([f"[{i+1}]" for i in range(current_window_size)])

-        _output_token_estimate = len(self._tokenizer.encode(token_str)) - 1
+        _output_token_estimate = len(self._tokenizer.encode(token_str)) + 2


SweRank output was getting interrupted, leading to nonsense results

aaryanshroff · 2026-02-19T19:41:38Z

src/rank_llm/rerank/listwise/rank_listwise_os_llm.py

+        else:
+            return re.sub(r"\[(\d+)\]", r"(\1)", s)
+
+    def _create_prompt_code(


Pretty much identical to https://github.com/SalesforceAIResearch/SweRank/blob/694af0999318d7b673cd9b27c11025cd0bf9080c/src/reranker/utils/rank_listwise_os_llm.py#L296

aaryanshroff · 2026-02-19T19:42:06Z

src/rank_llm/rerank/listwise/rank_listwise_os_llm.py

+            return messages, self.get_num_tokens(messages)
+        return prompt, self.get_num_tokens(prompt)
+
+    def _create_prompt_text(


Old / RankLLM-style truncation algorithm

ronakice · 2026-02-19T19:43:05Z

@claude review

claude · 2026-02-19T19:43:22Z

Claude Code is working…

I'll analyze this and get back to you.

View job run

ronakice · 2026-03-03T01:46:05Z

@codex review

chatgpt-codex-connector · 2026-03-03T01:46:10Z

To use Codex here, create a Codex account and connect to github.

Raptors65 and others added 18 commits November 5, 2025 12:07

Add SweRank support

3d57870

Merge remote-tracking branch 'origin/main' into swerank-support

4b15c80

Merge remote-tracking branch 'upstream/main' into swerank-support

1904814

Add swerank rerank demo

5472a6d

Remove outdated usage example

eaa00cd

Increase context size

8db7783

Add find_best_gpu util

df4afab

Allow script to demo both swe-bench and loc-bench

1b898cc

Truncate using SweRank logic

293dee9

Clean up

a8a8819

Set RerankType to CODE in swerank demo script

abd9f0e

DRY

551ae0c

Fix bug in applying permuatation (move inside loop)

1884fb4

Increase context size (paper does not match Swerank repo)

95263ca

Lint fix

0ce5f76

pre-commit

79ef5b1

aaryanshroff commented Feb 19, 2026

View reviewed changes

Reduce output token estimate

bb8c83a

aaryanshroff commented Feb 19, 2026

View reviewed changes

ronakice marked this pull request as ready for review February 19, 2026 19:42

aaryanshroff changed the title ~~Swerank support~~ Support SweRank code re-ranking Feb 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support SweRank code re-ranking#338

Support SweRank code re-ranking#338
aaryanshroff wants to merge 19 commits intocastorini:mainfrom
aaryanshroff:swerank-support

aaryanshroff commented Feb 4, 2026 •

edited

Loading

Uh oh!

aaryanshroff Feb 19, 2026

Uh oh!

aaryanshroff Feb 19, 2026

Uh oh!

aaryanshroff Feb 19, 2026

Uh oh!

aaryanshroff Feb 19, 2026

Uh oh!

aaryanshroff Feb 19, 2026

Uh oh!

ronakice commented Feb 19, 2026

Uh oh!

claude bot commented Feb 19, 2026

Uh oh!

ronakice commented Mar 3, 2026

Uh oh!

chatgpt-codex-connector bot commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		TEMPLATES = files("rank_llm.rerank.prompt_templates")


		class RerankType(Enum):

Conversation

aaryanshroff commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Checklist

Reference Issue

Checklist Items

PR Type

Uh oh!

aaryanshroff Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

aaryanshroff Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

aaryanshroff Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

aaryanshroff Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

aaryanshroff Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

ronakice commented Feb 19, 2026

Uh oh!

claude bot commented Feb 19, 2026

Uh oh!

ronakice commented Mar 3, 2026

Uh oh!

chatgpt-codex-connector bot commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aaryanshroff commented Feb 4, 2026 •

edited

Loading