41 Commits

Author SHA1 Message Date
showdown 6b13586154 wip 2025-05-19 20:45:50 +02:00
showdown 7192f4bc18 fix: update tool call content formatting and enhance mouse click description 2025-05-19 20:42:18 +02:00
showdown 36cfeffe9c fix: correct center-point calculation in OCR results 2025-05-19 20:35:11 +02:00
showdown 7f5296b2ef fix: increase wait time in search_pc function for improved reliability 2025-05-19 20:33:54 +02:00
showdown e5b3ea8b57 fix: logger 2025-05-19 20:30:01 +02:00
showdown ff7c362cfe fix 2025-05-19 20:28:01 +02:00
showdown b035bee682 fix 2025-05-19 20:21:30 +02:00
showdown c2fb041285 fix: truncate message content and image data to improve processing efficiency 2025-05-19 20:20:00 +02:00
showdown 4369611610 fix: add debug logging and visual indicators for OCR results 2025-05-19 20:19:36 +02:00
showdown 93a01b792b fix: update OCR result coordinates to center position 2025-05-19 20:11:56 +02:00
showdown 3d5f71ec84 fix 2025-05-19 17:19:24 +02:00
showdown 20f05ca991 fix: emphasize priority in search_pc function description 2025-05-19 17:09:21 +02:00
showdown 859e1c2f0b fix: missing bracket 2025-05-19 17:07:03 +02:00
showdown d9a9eba4c7 updated win func 2025-05-19 17:05:38 +02:00
showdown b89051a37f fix 2025-05-19 17:02:48 +02:00
showdown 72a876410c more context to gpt 2025-05-19 16:51:46 +02:00
showdown 46a5bce956 refactor: Update function descriptions for clarity and consistency 2025-05-19 16:41:02 +02:00
showdown e639e1edd3 refactor: Rename press_windows_key to windows_key for consistency 2025-05-19 16:33:59 +02:00
showdown 9bd15d45c5 feat: Add functionality to press Windows key and update function registry 2025-05-19 16:32:09 +02:00
showdown 105ab4a04b feat: wip: give OCR+positions 2025-05-19 16:10:02 +02:00
showdown 5be7f9aadb feat: Add OCR functionality to process method; integrate Tesseract for text extraction from screenshots 2025-05-19 15:59:46 +02:00
showdown 20764d5d19 fix: Simplify click position extraction for screenshot crosshair in tool execution 2025-05-19 13:43:04 +02:00
showdown 158529a2bd fix: Parse tool call arguments as JSON for improved handling in process method 2025-05-19 13:41:25 +02:00
showdown b583094e20 fix: Enhance screenshot functionality; add crosshair drawing and save screenshot to file 2025-05-19 13:39:26 +02:00
showdown d7c4f9b0cb fix: Update image handling in process method; ensure only the last two messages retain images and improve debugging output 2025-05-19 13:27:16 +02:00
showdown 035252c146 fix: Enhance logging for tool calls in process method; handle potential errors in next steps assignment 2025-05-19 13:21:15 +02:00
showdown 892f41f78a fix: Shorten image data in message copies for better debugging; update logging to reflect changes 2025-05-19 13:17:51 +02:00
showdown 0af7dc7699 fix: bug 2025-05-19 13:14:38 +02:00
showdown 2bcddedca5 fix: Adjust message handling in process method; ensure correct image assignment and add next steps output 2025-05-19 13:13:28 +02:00
showdown b881f04acc fix: Update process method return type and handle image attribute correctly; improve error handling 2025-05-19 13:10:46 +02:00
showdown 670066100f feat: Implement logging functionality; add logger configuration and retrieval 2025-05-19 13:05:36 +02:00
showdown 52c455b20c fix: Remove unused PyQt5 and tkinter overlay code; simplify click indicator function 2025-05-19 12:58:34 +02:00
showdown a4e078bc19 tempfix: remove mouse overlay 2025-05-19 12:51:59 +02:00
showdown 1925a77d85 Add screenshot re-execution logic in AIProcessor; append outputs from tool calls 2025-05-19 09:34:21 +02:00
showdown e573ecb553 Add confirmation function and re-execution logic in AIProcessor; clean up web server request handling 2025-05-19 09:30:58 +02:00
showdown f7feb12946 Add screenshot functionality and new commands for wait and reprompt 2025-05-19 09:15:08 +02:00
showdown 66330bfc73 Implement click indicator with red circle display; update server run parameters 2025-05-19 09:00:39 +02:00
showdown 41f7d0e210 Refactor mouse button handling to use string literals instead of ButtonType constants; add debug print for screenshot action in web server 2025-05-19 08:53:46 +02:00
showdown 84d65cb505 Add Pillow dependency and implement screenshot functionality in web server 2025-05-19 01:18:54 +02:00
showdown 7e612c1af7 Add initial implementation of AI agent with mouse and keyboard control features 2025-05-19 00:48:14 +02:00
showdown ed34ebca6a Initial commit 2025-05-18 17:32:16 +00:00