Ok, you have a moderately complex math problem you needed to solve. You gave the problem to 6 LLMS all paid versions. All 6 get the same numbers. Would you trust the answer?

  • Aatube@kbin.melroy.org
    link
    fedilink
    arrow-up
    5
    ·
    28 days ago

    this is a really weird premise. doing the same thing on 6 models is just not worth it especially when wolfram alpha exists and is far more trustable and speedy

    • FaceDeer@fedia.io
      link
      fedilink
      arrow-up
      3
      ·
      28 days ago

      If the LLMs are part of a modern framework I would expect that they should be calling out to Wolfram Alpha (or a similar specialized math-solver) via an API to get the answer for you, for that matter.

      • GrammarPolice@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        28 days ago

        Finally an intelligent comment. So many comments in here that don’t realize most LLM’s are bundled with calculators that just do the math.

        • FaceDeer@fedia.io
          link
          fedilink
          arrow-up
          2
          ·
          27 days ago

          Anti-AI sentiment is extremely strong in every part of the Fediverse I’ve seen so far, usually my comments get downvoted heavily even when I’m just describing factual details of how it works. I expect a lot of people simply don’t bother after a while.