Close Menu
Gossips Today
  • Tech & Innovation
  • Healthcare
  • Personal Finance
  • Lifestyle
  • Travel
  • Business
  • Recipes
What's Hot

Disney Has Asian American Culture Hidden in Plain Sight—How to Find the Best Eats, Experiences, and More

Rite Aid store closures update: Latest list includes doomed locations in California, Washington, and Oregon

Court denies Apple’s request to pause ruling on App Store payment fees

Facebook X (Twitter) Instagram
Friday, June 6
Gossips Today
Facebook X (Twitter) Instagram
  • Tech & Innovation

    Court denies Apple’s request to pause ruling on App Store payment fees

    June 6, 2025

    Cursor’s Anysphere nabs $9.9B valuation, soars past $500M ARR

    June 6, 2025

    Toma’s AI voice agents have taken off at car dealerships – and attracted funding from a16z

    June 5, 2025

    iOS 19: All the rumored changes Apple could be bringing to its new operating system

    June 5, 2025

    Bolttech closes Series C at $147M with a $2.1B valuation to bolster its embedded insurance offerings

    June 4, 2025
  • Healthcare

    Healthcare organizations could be unprepared to adopt generative AI: survey

    June 6, 2025

    Nearly 11M would become uninsured under GOP reconciliation bill: CBO

    June 6, 2025

    Amazon Pharmacy’s PillPack expands to Medicare patients

    June 5, 2025

    Appeals court to rehear No Surprises case in bright spot for providers

    June 5, 2025

    Trump administration names national coordinator for health IT

    June 4, 2025
  • Personal Finance

    16 Budgeting Tips to Manage Your Money Better

    May 28, 2025

    How to Stick to a Budget

    May 20, 2025

    4 Steps to Navigate Marriage and Debt

    May 11, 2025

    Buying a Fixer-Upper Home: What to Know

    May 10, 2025

    How to Talk to Your Spouse About Money

    May 10, 2025
  • Lifestyle

    16 Father’s Day Gift Ideas He (or You) Will Love

    June 4, 2025

    The Getup: Sand

    May 25, 2025

    Your Summer Style Starts Here: 17 Memorial Day Sale Picks to Grab Now + 4 Getups

    May 24, 2025

    3 Fixes If You Hate the Way Your Pants Fit (That Have Nothing to Do with Your Waist Size)

    May 14, 2025

    On Sale Now: 9 Nike Sneakers Under $100 You’ll Want to Wear All Summer

    May 10, 2025
  • Travel

    Disney Has Asian American Culture Hidden in Plain Sight—How to Find the Best Eats, Experiences, and More

    June 6, 2025

    Birkenstock Sandals and Comfy Clarks Shoes Are Up to 74% Off in This Secret Summer Sale

    June 6, 2025

    This Small Town in Virginia Is a U.S. Dupe for the English Countryside—Here's How to Visit

    June 5, 2025

    Yes, You Can Buy a Golf Cart at Amazon—and We Found an Electric, 4-seat Option for $8K

    June 5, 2025

    This Airline Route to Europe Was Just Revived After a 16-year Pause—and I Snagged a Seat On the First Flight

    June 4, 2025
  • Business

    Rite Aid store closures update: Latest list includes doomed locations in California, Washington, and Oregon

    June 6, 2025

    We can reshore American manufacturing

    June 6, 2025

    How AI is reshaping the fields of African farmers

    June 5, 2025

    AI isn’t coming for your job—it’s coming for your company

    June 5, 2025

    What to know about the CBO—the office calling out Trump’s tax bill

    June 4, 2025
  • Recipes

    slushy paper plane

    June 6, 2025

    one-pan ditalini and peas

    May 29, 2025

    eggs florentine

    May 20, 2025

    challah french toast

    May 6, 2025

    charred salt and vinegar cabbage

    April 25, 2025
Gossips Today
  • Tech & Innovation
  • Healthcare
  • Personal Finance
  • Lifestyle
  • Travel
  • Business
  • Recipes
Technology & Innovation

Will Smith eating spaghetti and other weird AI benchmarks that took off in 2024

gossipstodayBy gossipstodayJanuary 1, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Will Smith Eating Spaghetti And Other Weird Ai Benchmarks That
Share
Facebook Twitter LinkedIn Pinterest Email

When a company releases a new AI video generator, it’s not long before someone uses it to make a video of actor Will Smith eating spaghetti.

It’s become something of a meme as well as a benchmark: Seeing whether a new video generator can realistically render Smith slurping down a bowl of noodles. Smith himself parodied the trend in an Instagram post in February.

Google Veo 2 has done it.

We are now eating spaghett at last. pic.twitter.com/AZO81w8JC0

— Jerrod Lew (@jerrod_lew) December 17, 2024

Will Smith and pasta is but one of several bizarre “unofficial” benchmarks to take the AI community by storm in 2024. A 16-year-old developer built an app that gives AI control over Minecraft and tests its ability to design structures. Elsewhere, a British programmer created a platform where AI plays games like Pictionary and Connect 4 against each other.

It’s not like there aren’t more academic tests of an AI’s performance. So why did the weirder ones blow up?

Image Credits:Paul Calcraft

For one, many of the industry-standard AI benchmarks don’t tell the average person very much. Companies often cite their AI’s ability to answer questions on Math Olympiad exams, or figure out plausible solutions to PhD-level problems. Yet most people — yours truly included — use chatbots for things like responding to emails and basic research.

Crowdsourced industry measures aren’t necessarily better or more informative.

Take, for example, Chatbot Arena, a public benchmark many AI enthusiasts and developers follow obsessively. Chatbot Arena lets anyone on the web rate how well AI performs on particular tasks, like creating a web app or generating an image. But raters tend not to be representative — most come from AI and tech industry circles — and cast their votes based on personal, hard-to-pin-down preferences.

LMSYS
The Chatbot Arena interface.Image Credits:LMSYS

Ethan Mollick, a professor of management at Wharton, recently pointed out in a post on X another problem with many AI industry benchmarks: they don’t compare a system’s performance to that of the average person.

“The fact that there are not 30 different benchmarks from different organizations in medicine, in law, in advice quality, and so on is a real shame, as people are using systems for these things, regardless,” Mollick wrote.

Weird AI benchmarks like Connect 4, Minecraft, and Will Smith eating spaghetti are most certainly not empirical — or even all that generalizable. Just because an AI nails the Will Smith test doesn’t mean it’ll generate, say, a burger well.

Mcbench
Note the typo; there’s no such model as Claude 3.6 Sonnet.Image Credits:Adonis Singh

One expert I spoke to about AI benchmarks suggested that the AI community focus on the downstream impacts of AI instead of its ability in narrow domains. That’s sensible. But I have a feeling that weird benchmarks aren’t going away anytime soon. Not only are they entertaining — who doesn’t like watching AI build Minecraft castles? — but they’re easy to understand. And as my colleague Max Zeff wrote about recently, the industry continues to grapple with distilling a technology as complex as AI into digestible marketing.

The only question in my mind is, which odd new benchmarks will go viral in 2025?

TechCrunch has an AI-focused newsletter! Sign up here to get it in your inbox every Wednesday.

benchmarks eating Smith spaghetti weird
Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous Article9 Ways to Slow Down and Enjoy Life More
Next Article What to do if your manager keeps watching your Instagram Stories
admin
gossipstoday
  • Website

Related Posts

Court denies Apple’s request to pause ruling on App Store payment fees

June 6, 2025

Cursor’s Anysphere nabs $9.9B valuation, soars past $500M ARR

June 6, 2025

Toma’s AI voice agents have taken off at car dealerships – and attracted funding from a16z

June 5, 2025
Leave A Reply Cancel Reply

Demo
Trending Now

Disney Has Asian American Culture Hidden in Plain Sight—How to Find the Best Eats, Experiences, and More

Rite Aid store closures update: Latest list includes doomed locations in California, Washington, and Oregon

Court denies Apple’s request to pause ruling on App Store payment fees

Healthcare organizations could be unprepared to adopt generative AI: survey

Latest Posts

Disney Has Asian American Culture Hidden in Plain Sight—How to Find the Best Eats, Experiences, and More

June 6, 2025

Rite Aid store closures update: Latest list includes doomed locations in California, Washington, and Oregon

June 6, 2025

Court denies Apple’s request to pause ruling on App Store payment fees

June 6, 2025

Subscribe to News

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

Advertisement
Demo
Black And Beige Minimalist Elegant Cosmetics Logo (4) (1)
Facebook X (Twitter) Pinterest Vimeo WhatsApp TikTok Instagram

Categories

  • Tech & Innovation
  • Health & Wellness
  • Personal Finance
  • Lifestyle & Productivity

Company

  • About Us
  • Contact Us
  • Advertise With Us

Services

  • Privacy Policy
  • Terms & Conditions
  • Disclaimer

Subscribe to Updates

© 2025 Gossips Today. All Right Reserved.

Type above and press Enter to search. Press Esc to cancel.