Skip to content

Roadmap for LightRFT v0.1.2 #28

@puyuan1996

Description

@puyuan1996

🗺️ Roadmap for LightRFT v0.1.2

Expected Release: Feb. 2026

✨ New Features

♻️ Refactoring & Optimization

  • Core Logic (Loss & Filtering)
  • Architecture & Interfaces
    • Dataset & Reward: Refactor Dataset and Reward modules for better modularity (refactor(sunjx): refactor dataset and reward module #13).
    • Model Interface: Standardize generate methods and hyperparameters across all models (aligning with grm_vl).
    • Token Alignment: Unify token interfaces between Actor and Reward Model to minimize conversion overhead.
    • Critic: Refactor and enhance Critic model implementation.
  • Data Pipeline
    • Dataclasses: Unify dataset return formats using Dataclasses to simplify Trainer/ExpMaker.
    • Logic Separation: Remove strategy logic from Datasets and standardize batch padding locations.
  • Performance
    • Optimize efficiency for entropy and logit calculations.

⚙️ Compatibility & Dependencies

  • Configuration
    • LoRA Simplification: Drastically simplify LoRA configuration.
      • Implementation: Restrict entry-level arguments to only use_lora and lora_rank. Move all other detailed parameters into the specific LoRA initialization function.
    • DeepSpeed: Clarify ds_config handling and integration within Model initialization.
  • Dependencies
    • vLLM: Add support for the latest version of vLLM.

🐛 Bug Fixes & Maintenance

  • Fixes
    • Fix issues related to fire library usage.
  • Code Style
    • (Ongoing improvements)

📚 Documentation

  • Tutorials & Best Practices
    • GSM8K: Create a comprehensive, step-by-step tutorial for the simplest GSM8K demo.
    • Best Practices: Add 2-3 articles expanding on best practices for training and configuration.
    • LoRA Example: Add a Geo3K LoRA training demo to showcase the new simplified LoRA workflow.
  • Tools & Deployment
    • Project Assistant: Develop an LLM Q&A Assistant for the project (referencing the SGLang Cookbook implementation).
  • Content Updates
    • (Placeholder for general updates)

Before submitting a new issue...

  • Make sure you already searched for relevant issues and discussions, and this feature hasn't been requested before.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingdocumentationImprovements or additions to documentationenhancementNew feature or requestpolishPolish algorithms, tests or configsrefactorCleanup, formatting, or restructuring of existing code.styleCode or comments formatting

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions