• dantheclamman@lemmy.worldOP
    link
    fedilink
    English
    arrow-up
    6
    ·
    9 months ago

    We are barreling towards this issue. StackOverflow for example has crashing viewer numbers. But an AI isn’t going to help users navigate and figure out a new python library for example, without data to train on. I’ve already had AIs straight up hallucinate about functions in R that actually don’t exist. It seems to happen primarily in the newer libraries, probably with fewer posts on stackexchange about them

    • GenderNeutralBro@lemmy.sdf.org
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      2
      ·
      9 months ago

      AI isn’t going to help users navigate and figure out a new python library for example

      Current AI will not. Future AI should be able to as long as there is accurate documentation. This is the natural direction for advancement. The only way it doesn’t happen is if we’ve truly hit the plateau already, and that seems very unlikely. GPT-4 is going to look like a cheap toy in a few years, most likely.

      And if the AI researchers can’t crack that nut fast enough, then API developers will write more machine-friendly documentation and training functions. It could be as ubiquitous as unit testing.

      • FaceDeer@kbin.social
        link
        fedilink
        arrow-up
        0
        ·
        9 months ago

        Current AI can already “read” documentation that isn’t part of its training set, actually. Bing Chat, for example, does websearches and bases its answers in part on the text of the pages it finds. I’ve got a local AI, GPT4All, that you can point at a directory full of documents and tell “include that in your context when answering questions.” So we’re we’re already getting there.

        • GenderNeutralBro@lemmy.sdf.org
          link
          fedilink
          English
          arrow-up
          1
          ·
          9 months ago

          Getting there, but I can say from experience that it’s mostly useless with the current offerings. I’ve tried using GPT4 and Claude2 to give me answers for less-popular command line tools and Python modules by pointing them to complete docs, and I was not able to get meaningful answers. :(

          Perhaps you could automate a more exhaustive fine-tuning of an LLM based on such material. I have not tried that, and I am not well-versed in the process.

          • sanguine_artichoke@midwest.social
            link
            fedilink
            English
            arrow-up
            1
            ·
            9 months ago

            What about Github Copilot? It has tons of material available for training. Of course, it’s not necessarily all bug-free or well written.