• 3 Posts
  • 30 Comments
Joined 1 year ago
cake
Cake day: July 16th, 2023

help-circle






  • many words should run into the same issue, since LLMs generally use less tokens per word than there are letters in the word. So they don’t have direct access to the letters composing the word, and have to go off indirect associations between “strawberry” and the letter “R”

    duckassist seems to get most right but it claimed “ouroboros” contains 3 o’s and “phrasebook” contains one c.



  • is-number is a one-line function. (though it’s debatable if a function that complex should be compressed to one line)

    You may have heard of a similar if more extreme “microdependency” called is-even. When you use an NPM package, you also need all the dependencies of that package, and the dependencies of those dependencies recursively. Each package has some overhead, eventually leading to this moment in time.





  • sus@programming.devtoScience Memes@mander.xyzElsevier
    link
    fedilink
    English
    arrow-up
    3
    ·
    edit-2
    3 months ago

    I wonder if it’s common for those steganography techniques to have some mechanism for defeating the fairly simple strategy of getting 2 copies of the file from different sources, and looking at the differences between them to expose all the watermarks.

    (I’d think you would need sections of watermark that are the same for any 2 or n combinations of copies of the data, which may be pretty easy to do in many cases, though the difference makes detecting the general watermarking strategy massively easier for the un-watermarkers)