OK, it just spits predicted tokens, but in answer to what you asked and sensitive to the context you provided and its predictions are arranged such that when you decode them into language they present evidence or arguments used in thinking or argumentation. It also forms conclusions, inferences and produces results to problems, if you allow me to recycle from a dictionary definition of “reasoning”. It’s not perfect and obviously you can’t cram a huge amount into a 16b distillation and it certainly can get things wrong, but you have to squint to not see reasoning when you ask it to guesstimate something or solve a mathematical problem. It is an LLM but there’s reasoning coming out?
It’s still an LLM right? I’m going to have to take issue with your use of the word ‘reasoning’ here
OK, it just spits predicted tokens, but in answer to what you asked and sensitive to the context you provided and its predictions are arranged such that when you decode them into language they present evidence or arguments used in thinking or argumentation. It also forms conclusions, inferences and produces results to problems, if you allow me to recycle from a dictionary definition of “reasoning”. It’s not perfect and obviously you can’t cram a huge amount into a 16b distillation and it certainly can get things wrong, but you have to squint to not see reasoning when you ask it to guesstimate something or solve a mathematical problem. It is an LLM but there’s reasoning coming out?