One major problem (which it me also shows it is not scraping examples) is the every submission of the prompt results in a different completion.
Some work, some don't.
Maybe your "temperature" is too high?
Yeah, I need to play around with that.
But I've also found that adding a requirement like:
append markdown list: each requirement and fully explain detail how met
Not only only improves the quality of what it generated, it also tells me how I specified things incorrect.
It does not require full text, since the prompt size is limited (too big is rejected, I'm setting the highest possible for size) I've found you can really get creative on terse requirements to reduce the size, and since it is explaining what it understood you can refine it as needed.