The aggressive fringe of the o1 fashions is their potential to excel at reasoning. That is because of the coaching strategy that taught them to assume earlier than responding and, in consequence, allowed them to attempt completely different methods earlier than touchdown on a solution.
Subsequently, the o1 mannequin may be very robust in math, science, and coding. For instance, it scored 83% on the Worldwide Arithmetic Olympiad (IMO) qualifying examination. For comparability, GPT-4o appropriately solved solely 13% of issues. In consequence, when fixing a tough or complicated STEM drawback, this mannequin is your best choice.
Despite the fact that this efficiency may lead you to consider it’s a higher different in all case eventualities, that isn’t the case. As a result of o1 is an early mannequin, it lacks key ChatGPT options, akin to web shopping and accepting media uploads, and in consequence, for now, GPT-4o stays the higher possibility for many requests.
ChatGPT Plus and Groups customers can toggle between the completely different fashions relying on the immediate, so they do not need to decide to utilizing one or the opposite. To make it simpler to determine which to make use of, OpenAI is engaged on a function that enables ChatGPT to decide on the perfect mannequin robotically based mostly on person prompts.