Skip to content
View SohaibAli9's full-sized avatar
  • Lahore, Pakistan

Block or report SohaibAli9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SohaibAli9/README.md

Hi, I'm Sohaib 👋

I build AI systems that turn messy real-world documents into reliable structured data — and agents that act on it.

Independent AI engineer focused on intelligent document processing (IDP) and applied LLM / agent systems. I take projects from "a folder of 10,000 ugly PDFs" to clean, validated, production-grade data — and from "we wish this ran itself" to an agent that does.

What I do

🗂 Intelligent Document Processing — High-accuracy extraction from hard documents: scanned forms, technical drawings, financial statements, claims. Staged pipelines (classify → extract → validate against ground truth), corpus-scale batch processing with resume + caching, delivered as Excel / CSV / JSON.

🤖 Agentic & LLM systems — Tool-using agents, local & air-gapped LLM deployments, RAG, and evaluation harnesses. I measure accuracy and cost — not vibes.

⚙️ Shipping — Prototype to a real URL: Docker, observable pipelines, clean deploys. I deliver working software, not notebooks.

Selected work

Project What it is
axon Self-contained edge-intelligence agent for air-gapped infra. Local LLM (Qwen3), tool-calling, single-package deploy.
gerberview Browser-native Gerber / PCB viewer. Rust → WASM + WebGL, 60 fps, zero backend — nothing leaves your machine.
tradestation-backtest-toolkit Automated backtesting + statistical analysis pipeline for quant research.

Toolbelt

Python · TypeScript · Rust · LLM APIs (Claude · Gemini · GPT) · LangChain · PyMuPDF / OCR · PyTorch · Docker · Azure · WASM / WebGL

Let's talk

Got documents you wish were data, or a workflow you wish were an agent?

📫 sohaibali999@gmail.com · 💼 LinkedIn

Popular repositories Loading

  1. SDBS-School-DataBase-System SDBS-School-DataBase-System Public

    This was my 2nd semester's final project for OOP. It manages classes, students, tecahers and all things related to them.

    C++

  2. SohaibAli9 SohaibAli9 Public

    Config files for my GitHub profile.

  3. Eruditus Eruditus Public

    Web Project

    1

  4. StudentAnalysis StudentAnalysis Public

    Python

  5. DS_CrowdS_Source_01 DS_CrowdS_Source_01 Public

    Java

  6. Ghost-FYP Ghost-FYP Public

    Java