Close Menu
Gossips Today
  • Tech & Innovation
  • Healthcare
  • Personal Finance
  • Lifestyle
  • Travel
  • Business
  • Recipes
What's Hot

This Airline Route to Europe Was Just Revived After a 16-year Pause—and I Snagged a Seat On the First Flight

What to know about the CBO—the office calling out Trump’s tax bill

Bolttech closes Series C at $147M with a $2.1B valuation to bolster its embedded insurance offerings

Facebook X (Twitter) Instagram
Wednesday, June 4
Gossips Today
Facebook X (Twitter) Instagram
  • Tech & Innovation

    Bolttech closes Series C at $147M with a $2.1B valuation to bolster its embedded insurance offerings

    June 4, 2025

    Windsurf says Anthropic is limiting its direct access to Claude AI models

    June 4, 2025

    Adobe launches beta version of its Photoshop app on Android

    June 3, 2025

    Valla raises $2.7M to make legal recourse more accessible to employees

    June 3, 2025

    For the love of God, stop calling your AI a co-worker

    June 2, 2025
  • Healthcare

    Trump administration names national coordinator for health IT

    June 4, 2025

    Trump administration rescinds Biden-era guidance protecting access to emergency abortions

    June 4, 2025

    Share of physicians working in private practice continues to fall: AMA

    June 3, 2025

    HHS releases more detailed 2026 budget disclosing scope of cuts

    June 3, 2025

    Abortion providers could face more prosecution under Trump, experts say

    June 2, 2025
  • Personal Finance

    16 Budgeting Tips to Manage Your Money Better

    May 28, 2025

    How to Stick to a Budget

    May 20, 2025

    4 Steps to Navigate Marriage and Debt

    May 11, 2025

    Buying a Fixer-Upper Home: What to Know

    May 10, 2025

    How to Talk to Your Spouse About Money

    May 10, 2025
  • Lifestyle

    16 Father’s Day Gift Ideas He (or You) Will Love

    June 4, 2025

    The Getup: Sand

    May 25, 2025

    Your Summer Style Starts Here: 17 Memorial Day Sale Picks to Grab Now + 4 Getups

    May 24, 2025

    3 Fixes If You Hate the Way Your Pants Fit (That Have Nothing to Do with Your Waist Size)

    May 14, 2025

    On Sale Now: 9 Nike Sneakers Under $100 You’ll Want to Wear All Summer

    May 10, 2025
  • Travel

    This Airline Route to Europe Was Just Revived After a 16-year Pause—and I Snagged a Seat On the First Flight

    June 4, 2025

    This Caribbean Island Is Famous For Beautiful Beaches and All-inclusive Resorts—and It’s the Birthplace of Reggae and Jerk Cooking

    June 4, 2025

    This Popular European Country Just Got a Heightened Travel Advisory Over Terrorism—What Travelers Should Know

    June 3, 2025

    I Bought a Home in One of the Hottest U.S. Neighborhoods—Here's What It's Like to Live There

    June 3, 2025

    This Country Will Now Fine Airline Passengers for Standing Up Too Early After Landing

    June 2, 2025
  • Business

    What to know about the CBO—the office calling out Trump’s tax bill

    June 4, 2025

    Sarah Spain on the future of sports media and women’s leagues

    June 4, 2025

    China’s critical mineral export ban gets pushback from global auto industry

    June 3, 2025

    5 read-it-later alternatives now that Pocket is shutting down

    June 3, 2025

    Trump administration asks Supreme Court to allow federal layoffs

    June 2, 2025
  • Recipes

    one-pan ditalini and peas

    May 29, 2025

    eggs florentine

    May 20, 2025

    challah french toast

    May 6, 2025

    charred salt and vinegar cabbage

    April 25, 2025

    simplest brisket with braised onions

    April 2, 2025
Gossips Today
  • Tech & Innovation
  • Healthcare
  • Personal Finance
  • Lifestyle
  • Travel
  • Business
  • Recipes
Technology & Innovation

OpenAI launches program to design new ‘domain-specific’ AI benchmarks

gossipstodayBy gossipstodayApril 9, 2025No Comments2 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Openai launches program to design new 'domain specific' ai benchmarks
Share
Facebook Twitter LinkedIn Pinterest Email

OpenAI thinks AI benchmarks are broken. Now the company is launching a program to fix how AI models are scored.

The new OpenAI Pioneers Program will focus on creating evaluations for AI models that “set the bar for what good looks like,” as OpenAI phrased it in a blog post.

“As the pace of AI adoption accelerates across industries, there is a need to understand and improve its impact in the world,” the company continued in its post. “Creating domain-specific evals are one way to better reflect real-world use cases, helping teams assess model performance in practical, high-stakes environments.”

As the recent controversy with the crowdsourced benchmark LM Arena and Meta’s Maverick model illustrate, it’s tough to know, these days, precisely what differentiates one model from another. Many widely-used AI benchmarks measure performance on esoteric tasks, like solving doctorate-level math problems. Others can be gamed, or don’t align well with most people’s preferences.

Through the Pioneers Program, OpenAI hopes to create benchmarks for specific domains like legal, finance, insurance, healthcare, and accounting. The lab says that, in the coming months, it’ll work with “multiple companies” to design tailored benchmarks and eventually share those benchmarks publicly, along with “industry-specific” evaluations.

“The first cohort will focus on startups who will help lay the foundations of the OpenAI Pioneers Program,” OpenAI wrote in the blog post. “We’re selecting a handful of startups for this initial cohort, each working on high-value, applied use cases where AI can drive real-world impact.”

Companies in the program will also have the opportunity to work with OpenAI’s team to create model improvements via reinforcement fine tuning, a technique that optimizes models for a narrow set of tasks, OpenAI says.

The big question is whether the AI community will embrace benchmarks whose creation was funded by OpenAI. OpenAI has supported benchmarking efforts financially before, and designed its own evaluations. But partnering with customers to release AI tests may be seen as an ethical bridge too far.

benchmarks Design domainspecific launches OpenAI program
Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleMedicaid cuts unpopular with Trump voters: poll
Next Article BTC price: Bitcoin, Ethereum, crypto coins rally on 90-day Trump tariff pause
admin
gossipstoday
  • Website

Related Posts

Bolttech closes Series C at $147M with a $2.1B valuation to bolster its embedded insurance offerings

June 4, 2025

Windsurf says Anthropic is limiting its direct access to Claude AI models

June 4, 2025

Adobe launches beta version of its Photoshop app on Android

June 3, 2025
Leave A Reply Cancel Reply

Demo
Trending Now

This Airline Route to Europe Was Just Revived After a 16-year Pause—and I Snagged a Seat On the First Flight

What to know about the CBO—the office calling out Trump’s tax bill

Bolttech closes Series C at $147M with a $2.1B valuation to bolster its embedded insurance offerings

Trump administration names national coordinator for health IT

Latest Posts

This Airline Route to Europe Was Just Revived After a 16-year Pause—and I Snagged a Seat On the First Flight

June 4, 2025

What to know about the CBO—the office calling out Trump’s tax bill

June 4, 2025

Bolttech closes Series C at $147M with a $2.1B valuation to bolster its embedded insurance offerings

June 4, 2025

Subscribe to News

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

Advertisement
Demo
Black And Beige Minimalist Elegant Cosmetics Logo (4) (1)
Facebook X (Twitter) Pinterest Vimeo WhatsApp TikTok Instagram

Categories

  • Tech & Innovation
  • Health & Wellness
  • Personal Finance
  • Lifestyle & Productivity

Company

  • About Us
  • Contact Us
  • Advertise With Us

Services

  • Privacy Policy
  • Terms & Conditions
  • Disclaimer

Subscribe to Updates

© 2025 Gossips Today. All Right Reserved.

Type above and press Enter to search. Press Esc to cancel.