AniworldAI
Login Posts Comments Notes Artists Tags Pools Wiki Forum More »
Listing Upload Hot Changes Help

Search

  • Help
guro
scat
furry -rating:g

Artist

  • ? fanketchup 2

Copyright

  • ? original 1.4M

General

  • ? 4koma 109k
  • ? animal ears 1.5M
  • ? blindfold 26k
  • ? cat ears 358k
  • ? cat girl 131k
  • ? chinese text 42k
  • ? comic 641k
  • ? english text 335k
  • ? greyscale 604k
  • ? holding 1.9M
  • ? holding paper 13k
  • ? humanization 12k
  • ? long hair 5.4M
  • ? meta 1.3k
  • ? monochrome 752k
  • ? paper 38k
  • ? parody 102k
  • ? short hair 2.8M
  • ? simple background 2.4M
  • ? simplified chinese text 2.8k
  • ? speech bubble 395k
  • ? white background 2.0M

Meta

  • ? chinese commentary 247k
  • ? commentary 2.5M
  • ? highres 6.9M
  • ? translated 613k

Information

  • ID: 9793920
  • Uploader: march happy »
  • Date: 6 months ago
  • Approver: NiceLittleDan »
  • Size: 256 KB .jpg (1080x1920) »
  • Source: twitter.com/FanKetchup/status/1954493099536953592 »
  • Rating: Sensitive
  • Score: 3
  • Favorites: 3
  • Status: Active

Options

  • Resize to window
  • View smaller
  • View original
  • Find similar
  • Download

History

  • Tags
  • Pools
  • Notes
  • Moderation
  • Commentary
Resized to 78% of original (view original)
original drawn by fanketchup

Artist's commentary

  • Original
  • |
  • Translated
  • 画了点图,粗浅地讲讲
    为啥这些模型不知道图片上有几根手指

    I drew some pictures to briefly explain, why LLM models can’t tell how many fingers are in an image.

    • ‹ prev Search: humanization next ›
  • Comments
  • march happy
    6 months ago
    [hidden]

    Explanation:

    Imagine you show a picture to the AI, but instead of “seeing” it like you do, the AI turns the picture into a long list of numbers that capture its patterns, colors, shapes, and features.
    This list is called an embedding—it’s like a “fingerprint” of the image.
    The LLM itself doesn’t look at the raw pixels, it only reads that fingerprint and tries to work from there.
    So if the fingerprint didn’t record something clearly (like exactly how many fingers there are), the LLM can’t guess it precisely.

    3 Reply
    • Copy ID
    • Copy Link
    It’s a human hand. Fair skin, fingers slender, nails trimmed without nail polish, blood vessels faintly visible...
    Description complete!
    LLM-chan! The user sent in a picture and let me help you see it...

    LLM, please listen to the question!

    <prompt> You are a catgirl assistant. Based on the image description provided by the user, Answer the following question... </prompt>
    Image Embedding Model Image embedding is like turning a picture into a bunch of numbers that capture its gist, but not every tiny detail, as LLMs can't natively see a picture.

    How many fingers did the user put in?

    Terms / Privacy / Upgrade / Автор зеркала /