GeoChat is the first grounded Large Vision Language Model, specifically tailored to Remote Sensing(RS) scenarios. Unlike general-domain models, GeoChat excels in handling high-resolution RS imagery, ...
This project implements a full pipeline for detecting a document in an image and applying perspective transformation to generate a clean, top-down scanned result — similar to mobile document scanner ...