Intel_LLM

This repository contains the files of the project titled "Running GenAI on Intel AI Laptops and Simple LLM Inference on CPU and fine-tuning of LLM Models using Intel® OpenVINO™".

Project Description

This project leverages the TinyLlama model and optimizes it using Intel® OpenVINO™ to create a responsive chatbot. The chatbot is deployed using Gradio for an easy-to-use web interface. The project includes scripts to convert the model to OpenVINO format and compress it for better performance.

How to Run

Step 1: Clone the Repository

First, clone the repository to your local machine:

git clone git@github.com:adilzubair/Bitmasters_Intel_LLM.git
cd Bitmasters_Intel_LLM

Step 2: Install Dependencies

To install the necessary dependencies, run:

python setup.py

Step 3: Convert and Compress the Model

Before running the chatbot, you need to convert the TinyLlama model to the OpenVINO format and optionally compress it for better performance.

To convert and compress the model, run:

python convert_model.py

Step 4: Ensure openvino_model Directory Exists

Make sure the openvino_model directory is created and contains the converted model files. The convert_model.py script will handle this for you.

Step 5: Running the Chatbot

After converting the model, you can run the chatbot using:

python chatbot.py

Usage

The chatbot interface is powered by Gradio. You can adjust the advanced settings such as temperature, top-p, top-k, and repetition penalty to control the behavior of the model's responses. Advanced Options

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.gitignore		.gitignore
Intel_LLM_Project_1c.pdf		Intel_LLM_Project_1c.pdf
Intel_Unnati_Idea_Submission_Bitmasters (1).pptx		Intel_Unnati_Idea_Submission_Bitmasters (1).pptx
README.md		README.md
chatbot.py		chatbot.py
convert_model.py		convert_model.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intel_LLM

Project Description

How to Run

Step 1: Clone the Repository

Step 2: Install Dependencies

Step 3: Convert and Compress the Model

Step 4: Ensure openvino_model Directory Exists

Step 5: Running the Chatbot

Usage

1.Temperature: Controls the randomness of the model's output. Higher values result in more random responses.

2.Top-p: Controls the cumulative probability of token selection.

3.Top-k: Limits the number of token choices to the top k tokens.

4.Repetition Penalty: Penalizes repeated tokens to promote more diverse outputs.

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 3

Uh oh!

Languages

adilzubair/Bitmasters_Intel_LLM

Folders and files

Latest commit

History

Repository files navigation

Intel_LLM

Project Description

How to Run

Step 1: Clone the Repository

Step 2: Install Dependencies

Step 3: Convert and Compress the Model

Step 4: Ensure openvino_model Directory Exists

Step 5: Running the Chatbot

Usage

1.Temperature: Controls the randomness of the model's output. Higher values result in more random responses.

2.Top-p: Controls the cumulative probability of token selection.

3.Top-k: Limits the number of token choices to the top k tokens.

4.Repetition Penalty: Penalizes repeated tokens to promote more diverse outputs.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 3

Uh oh!

Languages

Packages