Originally from Star Trek, this simulation puts trainees in a rescue mission where every choice leads to failure.
It's all about testing ethics and stress management under pressure.
Captain Kirk made history by secretly reprogramming the test so he could win, showing it's okay to challenge "no-win" scenarios.
The way these tests are set up sometimes lets AIs find shortcuts instead of actually reasoning things out.
Since code and answers can be found online, developers may need to make future tests more private or unpredictable, which could affect how reliable AI is in real-world tasks like code review or search.
Contact to : xlf550402@gmail.com
Copyright © boyuanhulian 2020 - 2023. All Right Reserved.