........
TLDR
I'm aware this section is still long.
I've paste the whole Midjourney user manual to ChatGPT and asked it to create an webapp which would facilitate creating Midjourney prompts. The conversation between me and ChatGPT was more or less like this:
There are quite a lot of parameters to the Midjourney command. Also, there are quite a few very good websites about options you can use to get a good lightning, use a specific style, a particular medium, as created by a known artist, rendering engine, and a lot other options which will affect the final photo.
While there are lot of resources on the net, I could not find a place to list all of these. Some websites only talk about lighting, others talk about rendering engines. This is why I thought it's going to be nice to create an webapp with as many as possible options, which is going to create the prompt for your by using a basic web UI. However, there's a lot of research work here. Also, copy pasting data from the Midjourney documentation to an IDE to create combo controls for the web is no fun either.
So I've decided to put ChatGPT to a test. I've copy pasted the whole Midjourney manual to ChatGPT, explained to it that I want to build an webapp allowing me to input the parameters as described in the manual in order to build a prompt for Midjourney. And amazingly enough, it did it ! Much like working with a human programmer, I described what I want to achieve, I provided the raw data, and I got the code. Quite impressive, I've say.
test the web application here - midjourney-prompt-generator.html
Broadly, pretty similar to working with a human software developer.
What I did above was to simply copy paste all the text from Midjourney manual here - https://midjourney.gitbook.io/docs/user-manual#parameters-to-imagine
So far, so good. I enjoyed that he always summarise what we discussed so far before proceeding. That's something I try to do as well when interacting with humans.
Now it really went a bit too far. It emulates the human condition a little too well. "Depending on the complexity of the task and my workload, it might take me a few hours to a few days to finish".
Really ???
It's really nice having a companion able to do this by itself:
That's my first try when I asked ChatGPT to process some external documentation, understand it, then write a webapp around this. Exactly how I would have done it working with a human programmer. And it really worked.
The first iteration:
One the the final versions:
Was it useful having ChatGPT ? Definitely, a lot ! It's so nice to ask someone "give me the top 25 well known painters as a HTML select input, Bootstrap styled".
Things I hated about the process
- The process is slow. ChatGPT types slow, and it's so verbose. There's no way to interrupt it, once it starts on a topic, you have to wait a minute or more to finish. Yeah, I know it's rude to interrupt someone, but still...Asking it to be less verbose or to type faster does not work.
- Just a real human, it looses focus if the code is too long. It simply stops typing, and I have to ask it to resume "please go on", "why did you stopped ?". Really annoying, as sometimes it might miss a line of code or two. Specifically asking "don't miss any lines !" seems to help as it includes once more the last few lines of code when it resumes. Seems to be a technical limitation for now, I could not convince it to output the whole code in a single piece.
At some point it become very annoying to wait for it to render the code again and again. I start asking it to only give me the updates and I did the changes myself. Later on in the process I only asked it on how to do certain things and I wrote the code myself.
thanks for your effort to show the whole process. greetings from austria
ReplyDeleteWoow, that was very insighful. on what you can do... as a noob in coding I lack the "grammar" to ask stuff to ChatGPT but this post is gold in that sense: it gives me direction! Thanks!
ReplyDeleteLet's see what happened next in your experiment. Very interesting!
ReplyDelete