vibe coding lead to a very lazy and wrong application of constrained decoding - simply cutting off the llm after a closing brace, bad bad bad - also I'm not sure why it was trying to deliver json at all, this plays to all of the weaknesses of gen ai, bad llm bad
checkpoint 2 - we are in a very good position right now, audio transcription is working, gradio page looks good, local llms are working, returning the consultation, the json that will be sent to the mcp client, and a debugging panel