Hey @Exnav29 , thanks a lot for your feedback – those are really valid points.
I actually used real-life emails to refine the sentiment analysis prompt. Since I implemented this for Mike last week, we agreed to review it together with Sarah in about two weeks to see how accurate the urgency scoring is. She’ll flag any incorrect cases directly in Slack (e.g., with a thumbs down), so I can iterate on the setup if needed. That said, when we first connected it to their Gmail account, it processed 38 existing emails and assigned the urgency correctly, which gives me a lot of confidence in the current setup.
The idea of a separate review log is a good one, but in this case Slack effectively acts as the log. Each message includes the confidence score, so after a couple of weeks I should have a good sense of how well the scoring holds up.