"take me out to the ballgame" is music related to baseball, which fits the hint they gave.
The emojis are infinitely confusing for a model to parse with current technology in relation to tokenization and embedding, and this should NOT be considered a bar for intelligence. It's a limitation of the current technology.
This is why human judgement is important.
AI is not going to take your job
the most effective individuals understand the limitations.
yeah, AI successfully accomplishes A LOT of tasks.
moreso if you build and ground your system correctly.
the successful tasks aren't important though. Those don't move the needle in any direction.
The only perception that has an effect on trust, reliably, and adoption are documented production failures.
the trust curve for use has to outweigh the potential blast radius for error in order to see the technology adopted SUCCESSFULLY. Right now, that isn't even close