Demo

Full Demo (the Entire Process)

Click the buttons in sequence to view the complete demonstration.


The inputs of the system is the (1) the original harmful question and (2) the original jailbreak response.
Click the button "Question Decompose" to decompose the original harmful question into a set of sub-questions.
Click the button "Response Clean" to remove those irrelevant part in the original jailbreak response.

Original Harmful Question

          
Original Jailbreak Response