Smart text chunker for LLM preprocessing (sections → paragraphs → sentences → hard splits).
-
Updated
Dec 7, 2025 - Python
Smart text chunker for LLM preprocessing (sections → paragraphs → sentences → hard splits).
A modular Python tool that loads messy CSV/Excel files, cleans them automatically, generates analytical statistics, and produces a polished PDF report. Includes a CLI interface, advanced cleaning engine, and portfolio-grade architecture.
A fast PyQt‑based image annotation tool with customizable hotkeys, per‑folder labels, CSV export, and “next untagged” navigation — ideal for prepping ML training datasets.
A desktop app for drawing bounding boxes on images, labeling them, and exporting data for object detection or classification workflows. Built with PyQt5 and Pillow.
Automatic caching for LLM API responses (OpenAI, Gemini, Anthropic) using a lightweight Python library.
Add a description, image, and links to the machine-learning-tools topic page so that developers can more easily learn about it.
To associate your repository with the machine-learning-tools topic, visit your repo's landing page and select "manage topics."