You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've been playing around with ChatGPT (api) with many variations of prompts. It is spitting out complete nonsense reasoning and breach predictions. Has any benchmarking been done for the compliance breach prediction pipeline? If so, I would love to look at the documentation.
Short answer: Nope, no benchmarking done
The consensus of using this comes from the prototype breach reports that @lepisma was generating last month. The idea was that we want a system that provides some mechanism which is better than the current method (randomly looking through calls or relying on client feedback). Thus large number of FPs is not a concern. Providing baseline accuracy numbers and tuning this system for better performance were kept as future tasks.
Nonetheless, I'm curious what is 'completely nonsense' reasoning. Lets schedule a call for that
I've been playing around with ChatGPT (api) with many variations of prompts. It is spitting out complete nonsense reasoning and breach predictions. Has any benchmarking been done for the compliance breach prediction pipeline? If so, I would love to look at the documentation.
@harshithere
The text was updated successfully, but these errors were encountered: