5be7f9aadb
feat: Add OCR functionality to process method; integrate Tesseract for text extraction from screenshots
Showdown76py2025-05-19 15:59:46 +02:00
20764d5d19
fix: Simplify click position extraction for screenshot crosshair in tool execution
Showdown76py2025-05-19 13:43:04 +02:00
158529a2bd
fix: Parse tool call arguments as JSON for improved handling in process method
Showdown76py2025-05-19 13:41:25 +02:00
b583094e20
fix: Enhance screenshot functionality; add crosshair drawing and save screenshot to file
Showdown76py2025-05-19 13:39:26 +02:00
d7c4f9b0cb
fix: Update image handling in process method; ensure only the last two messages retain images and improve debugging output
Showdown76py2025-05-19 13:27:16 +02:00
035252c146
fix: Enhance logging for tool calls in process method; handle potential errors in next steps assignment
Showdown76py2025-05-19 13:21:15 +02:00
892f41f78a
fix: Shorten image data in message copies for better debugging; update logging to reflect changes
Showdown76py2025-05-19 13:17:51 +02:00
2bcddedca5
fix: Adjust message handling in process method; ensure correct image assignment and add next steps output
Showdown76py2025-05-19 13:13:28 +02:00
b881f04acc
fix: Update process method return type and handle image attribute correctly; improve error handling
Showdown76py2025-05-19 13:10:46 +02:00
1925a77d85
Add screenshot re-execution logic in AIProcessor; append outputs from tool calls
Showdown76py2025-05-19 09:34:21 +02:00
e573ecb553
Add confirmation function and re-execution logic in AIProcessor; clean up web server request handling
Showdown76py2025-05-19 09:30:58 +02:00
f7feb12946
Add screenshot functionality and new commands for wait and reprompt
Showdown76py2025-05-19 09:15:08 +02:00
66330bfc73
Implement click indicator with red circle display; update server run parameters
Showdown76py2025-05-19 09:00:39 +02:00
41f7d0e210
Refactor mouse button handling to use string literals instead of ButtonType constants; add debug print for screenshot action in web server
Showdown76py2025-05-19 08:53:46 +02:00
84d65cb505
Add Pillow dependency and implement screenshot functionality in web server
Showdown76py2025-05-19 01:18:54 +02:00
7e612c1af7
Add initial implementation of AI agent with mouse and keyboard control features
Showdown76py2025-05-19 00:48:14 +02:00