The Exploring The Jungle Between My Wife’s Crotchfloodgates have opened for building AI reasoning models on the cheap.
Researchers at Stanford and the University of Washington have developed a model that performs comparably to OpenAI o1 and DeepSeek R1 models in math and coding — for less than $50 of cloud compute credits.
What's more, the model was trained on only 1,000 questions, and took just 26 minutes and 16 Nvidia H100 GPUs. Stanford researcher Niklas Muennighoff said in a email to Mashable that the cost is an estimate based on the GPU runtime and number of H100 GPUs used.
The AI industry of late is all about how new approaches to the pre and post training process can massively save computing costs, as evidenced by DeepSeek's disruptive impact. On top of that, developers are now able to build on top of existing AI models at little or no cost, through APIs, open-source access, and even closed-source models by distilling their data, bringing the costs down even more.
According to the team's research paper which was published last Friday, s1 was trained on a dataset consisting of "1,000 carefully curated questions paired with reasoning traces and answers distilled from Gemini Thinking Experimental." Google's Gemini Thinking Experimental model is accessible with daily limits through AI Studio. While it's a closed-source model, that clearly hasn't stopped researchers from making use of its responses.
SEE ALSO: OpenAI launches 'deep research' AI agent for ChatGPTNext, the researchers used an "off the shelf" pretrained model from Alibaba-owned lab, Qwen, and performed supervised fine-tuning of its curated dataset. Then, the team created a token budget to control the amount of compute time for testing the model. If s1 went over budget on thinking tokens, it was cut off and forced to generate whatever answer it came up with. If the researchers wanted the model to spend more "test-time compute" on a problem, they would simply tell the model to "wait," which extended its thinking time and led to more accurate results.
By controlling the amount of time and compute spent on a problem, the researchers were able to show how increased thinking team leads to improved performance.
S1 is one example of open-source reasoning models that have been developed for a fraction of the cost of flagship models from Google and OpenAI. In January, UC Berkeley researchers released an open-source reasoning model called Sky-T1 that cost $450, "demonstrating that it is possible to replicate high-level reasoning capabilities affordably and efficiently," per its blog post. There's also the open-source rStar-Math reasoning model from Microsoft Asia researchers, Tulu 3 from non profit research institute Ai2, and HuggingFace has its own initiative to replicate DeepSeek's R1.
As high-quality models become more accessible and cheaper, we're starting to see a power shift from the few AI heavy hitters, to the many.
Topics Artificial Intelligence OpenAI
Nintendo Switch Pro Controller deal: Save $20.99 at WalmartParamount+ Black Friday deal: Slash 76% off two months of streamingElon Musk's xAI could release a standalone Grok app soonBest Black Friday TV deal: Get a 55Black Friday Kindle deal [2024]Black Friday TV deals from Amazon, Best Buy, and WalmartMax Black Friday streaming deal: Score 6 months for $2.99/monthWebb telescope clears the haze around a stunning galactic iconParamount+ Black Friday deal: Slash 76% off two months of streamingShop the best early Black Friday deals under $50Kohl's Black Friday 2024: Ad and best dealsBest Black Friday deals at Best Buy: Sony earbuds gaming laptops, and moreBest PS5 Black Friday deals: Compare Best Buy, Amazon, Target, and moreParamount+ Black Friday deal: Slash 76% off two months of streamingEarly Black Friday keyboard deals for daily use and gamingBest Black Friday Sticker Printer deal: Save 43% at AmazonEarly Black Friday keyboard deals for daily use and gamingBest Black Friday thermal camera deal: P2 Thermal Camera for $179.99Liverpool vs. Real Madrid 2024 livestream: Watch Champions League for freeEarly Black Friday keyboard deals for daily use and gaming Is anyone happy about Trump's administration reversing the ban on elephant trophies? 'Mythic Quest' Season 2 review: More heart, less focus These are the real, terrible human turkeys Trump has pardoned Fake news about Keanu Reeves and 'blood of babies' tops YouTube search BBC's 'The Pursuit of Love' lets Andrew Scott hilariously steal the show Azzedine Alaïa, known for pioneering designs and for 'Clueless,' dies at 82 Facebook will try to make sure you've read an article before you share it Demi Lovato took Danica Roem, the first openly transgender state legislator, to the AMAs Woman threatened for 'f*ck Trump' sticker says f*ck the sheriff too How to add your pronouns to your Instagram profile Titanic documentary 'The Six' spotlights the Chinese survivors the world forgot 'Weird Al' Yankovic is protecting his good name from other, terrible Als Lena Dunham issues a hollow apology for defending a 'Girls' writer accused of rape Oprah and Prince Harry's show 'The Me You Can't See' premieres May 21 NBC won't air the 2022 Golden Globes Sanders made White House reporters say what they were thankful for No, P!nk was not throwing shade at Christina Aguilera's AMAs tribute YouTube terminates the massive and super creepy kids channel 'Toy Freaks' What is Itch.io? Indie games store is a vital source of creativity. Clubhouse beta launches on Android in U.S.
2.6076s , 10133.8046875 kb
Copyright © 2025 Powered by 【Exploring The Jungle Between My Wife’s Crotch】,Exquisite Information Network