Skip to content

XLearning-SCU/RLSF-V

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 

Repository files navigation

RLSF-V

[ICML 2026] RLSF-V: Mitigating Hallucinations in MLLMs via Fuzzy Semantic Self-Feedback

Official repository for RLSF-V.
Model weights have been released on Hugging Face, and the code will be released soon.


πŸ”₯ News

  • Model weights are now available at our Hugging Face Collection:
    πŸ‘‰ πŸ€— RLSF-V Model Collection
  • The training, inference, and evaluation code will be released soon. Please stay tuned.

πŸ“¦ Model Weights

We provide the model weights mentioned in our paper. All released weights are available in our Hugging Face Collection:

πŸ‘‰ πŸ€— RLSF-V Model Collection

These weights can be plugged into the corresponding base models for inference or further fine-tuning. Please refer to the table below for details.

Base Model RLSF-V Model Weights Status Code
πŸ€— LLaVA-v1.5-7B πŸ€— XLearning-SCU/LLaVA-v1.5-7B-RLSF-V βœ… Released 🚧 Coming Soon
πŸ€— LLaVA-v1.5-13B πŸ€— XLearning-SCU/LLaVA-v1.5-13B-RLSF-V βœ… Released 🚧 Coming Soon
πŸ€— Qwen2.5-VL-3B-Instruct πŸ€— XLearning-SCU/Qwen2.5-VL-3B-Instruct-RLSF-V βœ… Released 🚧 Coming Soon
πŸ€— Qwen3-VL-8B-Instruct πŸ€— XLearning-SCU/Qwen3-VL-8B-Instruct-RLSF-V βœ… Released 🚧 Coming Soon
πŸ€— InternVL3.5-8B-HF πŸ€— XLearning-SCU/InternVL3_5-8B-HF-RLSF-V βœ… Released 🚧 Coming Soon

πŸš€ Code

The official code for training, inference, and evaluation is currently being organized and will be released soon.

Please watch or star this repository to receive updates.


πŸ“Œ Citation

If you find our work useful, please consider citing our paper.

@inproceedings{rlsfv2026,
  title={RLSF-V: Mitigating Hallucinations in MLLMs via Fuzzy Semantic Self-Feedback},
  author={He, Changhao and Yan, Shuhao and Li, Shuxian and Peng, Xi and Hu, Peng},
  booktitle={Forty-third International Conference on Machine Learning},
  year={2026}
}

About

[ICML 2026] RLSF-V: Mitigating Hallucinations in MLLMs via Fuzzy Semantic Self-Feedback

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors