🎓 NIELIT Ropar Assistant

title	NIELIT Ropar Assistant
emoji	🎓
colorFrom	red
colorTo	purple
sdk	gradio
sdk_version	6.1.0
app_file	app.py
pinned	true
suggested_hardware	zero-a10g
license	mit
short_description	A fine-tuned Llama-3.2-1B for NIELIT Ropar (CPU Optimized).
thumbnail	https://cdn-uploads.huggingface.co/production/uploads/6474405f90330355db146c76/fxsvfNs1T9jIyZkxWMCnd.png

🎓 NIELIT Ropar Assistant

A domain-adapted Small Language Model (SLM) tailored for NIELIT Ropar, optimized for edge inference on CPU hardware.

🚀 Overview

General-purpose Large Language Models (LLMs) often lack the granular, domain-specific context required for specialized organizational tasks. For institutions like NIELIT Ropar, relying on generic models leads to hallucinations regarding specific datasets (e.g., fee structures, faculty details).

This project solves that challenge by engineering a domain-adapted SLM. By fine-tuning Llama-3.2-1B and quantizing it for CPU inference, we created a lightweight, privacy-focused assistant that delivers accurate, verifiable answers on fees, faculty, and coursework without relying on expensive external APIs.

⚙️ Tech Stack

🤖 Base Model: Meta Llama-3.2-1B (Instruct)
⚡ Fine-Tuning: Supervised Fine-Tuning (SFT) using Unsloth (LoRA adapters, 2x speedup).
📉 Quantization: Converted to GGUF format (q4_k_m) via llama.cpp for optimized CPU performance.
🌐 Deployment: Hosted on Hugging Face Spaces via Gradio + llama-cpp-python.

🔗 Quick Links

Resource	Link
🔴 Live Demo	Hugging Face Space
💻 GitHub Repo	lovnishverma/NIELIT-Assistant
📦 Model Weights	Hugging Face Model Repo
📓 Training Code	Google Colab Notebook
📝 Technical Blog	Medium Article

🛠️ Installation (Local Inference)

You can run this model locally on your laptop (CPU-only) using Python.

1. Install Dependencies

pip install llama-cpp-python huggingface_hub

2. Run Inference Script

from huggingface_hub import hf_hub_download
from llama_cpp import Llama

# Download the GGUF model automatically
model_path = hf_hub_download(
    repo_id="LovnishVerma/nielit-ropar-GGUF",
    filename="nielit-ropar.q4_k_m.gguf"
)

# Initialize the model (CPU)
llm = Llama(model_path=model_path, n_ctx=2048, verbose=False)

# Chat
output = llm(
    "Q: What courses are offered at NIELIT Ropar? A:", 
    max_tokens=128,
    stop=["Q:", "\n"],
    echo=True
)
print(output['choices'][0]['text'])

👨‍💻 Author

Developed by Lovnish Verma Project Engineer at NIELIT Ropar

_{#GenerativeAI #LLMOps #EdgeAI #Llama3 #FineTuning #Python #Engineering #NIELIT}

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
README_JETSON.md		README_JETSON.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎓 NIELIT Ropar Assistant

🚀 Overview

⚙️ Tech Stack

🔗 Quick Links

🛠️ Installation (Local Inference)

👨‍💻 Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎓 NIELIT Ropar Assistant

🚀 Overview

⚙️ Tech Stack

🔗 Quick Links

🛠️ Installation (Local Inference)

👨‍💻 Author

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages