No description
  • TypeScript 98.2%
  • CSS 1.2%
  • HTML 0.6%
Find a file
Vincenzo Fabiano 33de74e035 Update
- Migliorato tutto il sistema
2026-01-16 04:07:10 +01:00
components Update 2026-01-16 04:07:10 +01:00
contexts Init 2026-01-14 16:21:29 +01:00
hooks Update 2026-01-16 04:07:10 +01:00
i18n Update 2026-01-16 04:07:10 +01:00
public Refactoring 2026-01-15 17:34:33 +01:00
services Update 2026-01-16 04:07:10 +01:00
utils Update 2026-01-16 04:07:10 +01:00
.gitignore Init 2026-01-14 16:21:29 +01:00
App.tsx Update 2026-01-16 04:07:10 +01:00
Created Refactoring 2026-01-15 17:34:33 +01:00
index.css Update 2026-01-16 04:07:10 +01:00
index.html Update 2026-01-16 04:07:10 +01:00
index.tsx Update 2026-01-16 04:07:10 +01:00
metadata.json Refactoring 2026-01-15 17:34:33 +01:00
package-lock.json Update 2026-01-16 04:07:10 +01:00
package.json Update 2026-01-16 04:07:10 +01:00
README.md Update 2026-01-16 04:07:10 +01:00
Smart E-commerce_ Integrazione AI e nuove esperienze d'acquisto_Proposta progettuale.pdf Init 2026-01-14 16:21:29 +01:00
tsconfig.json Refactoring 2026-01-15 17:34:33 +01:00
types.ts Update 2026-01-16 04:07:10 +01:00
vite.config.ts Refactoring 2026-01-15 17:34:33 +01:00

LLM OCR Studio

This is a modern web interface for OCR using Vision LLMs.

🚀 Quick Start

  1. Install dependencies:
npm install
npm run dev

Open your browser at http://localhost:3000.

2. Configure LLM

The application runs entirely in the browser but needs an OpenAI-compatible API to process images. The interface will warn you if the configuration is missing.

Click the Settings icon in the top right to configure:

  • LM Studio (Local):
    • Base URL: http://localhost:1234/v1
    • Model: Enter the ID of the model loaded in LM Studio (e.g., llama-3.2-vision).
    • API Key: Leave empty.
  • OpenRouter: https://openrouter.ai/api/v1 + API Key + Model Name.
  • Any OpenAI Compatible Endpoint.

3. Usage

  1. Trascina un PDF nell'area di upload.
  2. Puoi trascinare più file o aggiungerne altri nella schermata di revisione (Batch processing).
  3. The app converts files to images (Client-side).
  4. Sends images to the configured Vision LLM.
  5. Returns formatted Markdown.

🛠 Requisiti di Sistema

  • Node.js: v18+
  • API Access: Access to a Vision model (e.g., GPT-4o, Gemini 1.5, Qwen-VL via LMStudio).