Wednesday, June 24, 2026

Still working on Side-scroller game


 I have switched to using the PI agent to work on the game.  I'm still using the Qwen3.6 model through Ollama.  It seems to work a little better through Pi.  I have still been trying to get it to work continuously.  I tried a few extensions and finally ended up making my own that seems to do what I want.

With no extensions I was running into two problems keeping the agent working continuously.  The first issue is that the agent would just stop every so often and I would have to restart it by just saying something like "Continue".  I found an extension for the pi agent that would auto continue when this happens and that seemed to work fine.

That extension didn't solve the second problem that I was running into, however.  Sometimes the agent would just get stuck either going through the same thoughts over and over, or just outputting something very simple, like a br tag, over and over again.  To solve this I ended up making my own extension that solves both problems.  I used pi to make the extension and then modified it slightly by hand.

The way my extension works is, anytime the agent stops it just sends the message "Continue".  It also has a timeout that can be set to some number of seconds.  I set it to 1800 sec. or 30 minutes.  This means every 30 minutes the agent is stopped, and then the extension sends "Continue" and it starts working again.

This seems to keep the agent working well, and stops it often enough that it doesn't get lost thinking about something unimportant.  It seems to be making progress on the game again.  The game is here https://github.com/mattdyer/side-scroller.  I have it writing unit tests and acceptance tests for all parts of the game.  It has written a lot of code, but the game isn't playable yet.

Thursday, June 18, 2026

Opencode side scroller game

I have the Qwen 3.6 coder model working in Opencode on a side scrolling game.  The code is here https://github.com/mattdyer/side-scroller.  We'll see how far it gets.  I installed a goal extension in opencode, but it doesn't seem to be working.  It is supposed to keep running the model until the goal is complete, but so far I have to manually keep re-running it.  In the repo you can see a PLAN.md.  I have a goal set to implement all the steps in the plan.  The model sees the goal and keeps working on it, but it just stops every hour or so and I have to start it again.  Progress has been pretty slow so far.  I might try playing with some of the settings for the goal plugin and see if I can get it working.  Here is the plugin that I installed. https://github.com/prevalentWare/opencode-goal-plugin  Maybe there is a better plugin that I could try.  I'll look around.

Tuesday, June 16, 2026

Combining an Opencode coding model and a ComfyUI image generation model

 


I setup ComfyUI as I posted about previously.  I also have Opencode setup with some local models.  For this I used Qwen 3.6 coder.

I had the model help me write a python script that can call the ComfyUI image generation api endpoint.  After you have a workflow setup in ComfyUI you can download json that can be posted to ComfyUI to run that workflow.  The python script allows passing the image prompt and the dimensions as command line arguments.

Then I asked the model to use the python script and generate an html image gallery with six pictures of nature scenes.  The model was able to come up with 6 prompts, run them through the script and save the images.  It then built a simple html page that shows the images.  You can see this in gallery.html in the repo.

I then setup another workflow in ComfyUI to remove backgrounds from images so you can make pngs with transparent backgrounds.  I took that script and the original one and moved them into a skills folder with a SKILL.md explaining how to use them.

Then I asked the coding agent to make a simple solar system simulation with a star and 4 planets.  I told it to use the new skill to generate images with transparent backgrounds.  This took a few tries and some rearranging of the scripts and the skill before it worked, but the end result looks good.  You can see it in simulation.html in the repo.


Here is the repo with all the code I talked about in this article https://github.com/mattdyer/comfyuiapi

Thursday, June 11, 2026

ComfyUI

 Over the last few days I have been playing with ComfyUI.  I've never tried it before, but I have seen it mentioned many times.  I have mostly been generating images with Qwen-Image 2512.  It works well and I'm able to run it on my laptop.  It is slow.  It takes about 20 min. to generate an image, but it has been fun to mess around with.  Here are a couple of examples of things I generated.


Prompt: dragon: red and black scales, wings, left side of image, facing towards center

female warrior: black hair, messy hair, blue eyes, holding sword right side of image, facing towards center, black armor

background: castle with fires burning on top of towers





Prompt: a space scene with a sun and planets, asteroids shooting across


You can do different resolutions and aspect ratios.  It seems to work fine with simple prompts, but it will follow specific instructions if you provide them.


EDIT: I got this to start using my video card and it is much much faster.  What was taking about 20 minutes now takes 30-40 sec.  I have a GeForce 5060 laptop version.

Monday, June 8, 2026

Choosing a Direction with AI


 Recently I feel blocked.  There is so much that can be done with AI that I don't know where to start.  I am playing with a local image model currently (https://github.com/ideogram-oss/ideogram4), but it is really slow on my computer.  It is fun, but there isn't much to explore there as far as coding.  I have seen some things recently where people took old hardware, an mp3 player in one example, and had AI write new firmware for it.  I have had some interest in trying to write a basic operating system for a computer.  Maybe I could get AI to help me with that.  I think I may have tried that once already, but AI wasn't quite good enough yet.  It probably is now if I use a paid model.  I need to decide what to do there also.  I'm not really making any money on any side projects that I have worked on, so I don't know how many AI accounts I can pay for.  I'm currently paying for Github copilot, but they just changed the billing, so I don't know if it is worth it anymore.  I might switch to something else.

I have run some local coding models using opencode and had decent results. I've also been playing with paperclip AI.  It doesn't seem to work very well with local models, but it works really well with Anthropic models.  I have built a few different projects.  None of them are production ready yet, but I think I could get them there using that interface.

The image at the top was generated with edeogram4 on my laptop.

Still working on Side-scroller game

 I have switched to using the PI agent to work on the game.  I'm still using the Qwen3.6 model through Ollama.  It seems to work a littl...