Best Podcast Episodes About Nvidia
Everything podcasters are saying about Nvidia — curated from top podcasts
Updated: Apr 26, 2026 – 53 episodes
Listen to the Playlist
Ridealong has curated the best and most interesting podcasts and clips about Nvidia.
Top Podcast Clips About Nvidia
“… was using i mean it's from super microsoft co-founder rather not micro strategy super microsoft co-founder smuggling 2.5 billion dollars worth of nvidia chips to china through a middleman and then they were doing like fake paperwork and using a hairdryer to take off the serial numbers and replace them with the model numbers i mean and and then talking about it on social media there was a video going around where he was talking about it any take on the brazen insanity of this jake and people are talking about it in”
“… asked chat gpt what's a corny name for an inference thing infranciania it sounds like a spell from harry potter or something so what do we think of the ban i don't know if you saw this uh jake the ban on chips in china and then the micro strategy ceo was using i mean it's from super microsoft co-founder rather not micro strategy super microsoft co-founder smuggling 2.5 billion dollars worth of nvidia chips to china through a middleman and then they were doing like fake paperwork and using a hairdryer to take off the serial numbers and replace them with the model numbers i mean and and then talking about it on social media there was a video going around where he was talking about it any take on the brazen insanity of this jake and people are talking about it in”
View more
Ridealong summary
In a shocking turn of events, a Microsoft co-founder was caught smuggling $2.5 billion worth of Nvidia chips to China, using fake paperwork and even a hairdryer to alter serial numbers. This brazen act has sparked discussions about the implications of chip bans and the lengths individuals will go to circumvent them. The audacity of this scheme, shared on social media, highlights the growing tensions in the global tech market.
“I'm old enough to remember when a trillion dollar market cap was a big deal. And now here we are, AI is booming, and NVIDIA CEO Jensen Huang has kicked off the company's annual GTC conference with a massive prediction that the company will see a trillion dollars in revenue between now and 2027. At every GTC, Jensen's keynote, which is planned but not fully scripted, is the big event. This one was no exception. It was two and a half hours long, totally jam-packed with big announcements. We got confirmation of the new Grok-powered server focused on inference. The new …”
“I'm old enough to remember when a trillion dollar market cap was a big deal. And now here we are, AI is booming, and NVIDIA CEO Jensen Huang has kicked off the company's annual GTC conference with a massive prediction that the company will see a trillion dollars in revenue between now and 2027. At every GTC, Jensen's keynote, which is planned but not fully scripted, is the big event. This one was no exception. It was two and a half hours long, totally jam-packed with big announcements. We got confirmation of the new Grok-powered server focused on inference. The new rack-mounted system will combine 256 Grok chips with 72 NVIDIA Rubin GPUs, delivering 35 times the inference efficiency of current generation Blackwell chips, with the system expected to ship in the second half of this year. Jensen also unveiled a new Gen.AI system that can enhance video game graphics on the fly. Called DLSS5, the technology …”
View more
Ridealong summary
Nvidia's trillion-dollar revenue projection is a bold signal of unprecedented demand and growth potential, positioning the company alongside giants like Walmart and Amazon.
Nvidia's forecast of a trillion dollars in revenue by 2027 signals unparalleled growth driven by massive demand for AI technologies.
OpenClaw represents a pivotal shift in AI, making agents viable and prompting a new era of digital opportunities, but also raises concerns about security and the fundamental restructuring of software businesses.
OpenClaw represents AI's second moment, making agents viable for business, but raises concerns about security and the fundamental shift in software and digital business operations.
Nvidia's forecast of reaching a trillion dollars in revenue by 2027 signals unprecedented growth driven by massive computing demand, positioning it alongside giants like Walmart and Amazon.
Nvidia's forecast of reaching a trillion dollars in revenue by 2027 signals unparalleled growth driven by soaring demand for AI computing and inference.
OpenClaw represents a transformative shift in AI capabilities, but its broad access to personal data is both useful and terrifying.
“… blackwell in addition to ruben they also teased fineman already even though ruben is months to years away from actually being deployed at scale so nvidia is essentially 18 months give or take a few ahead of what the current reality looks like. And I think this is really important to note, is currently with the bleeding edge of AI, we're running Blackwell right now. And we just started running Blackwell. And Blackwell has about 12 months of improvements to be made before we start to feel the effects of Rubin. By the time we feel the effects of Rubin, which is that 10x performance per watt …”
“… it's so exciting because there's such a clear path to going to where i think every ai lab wants to go yes getting to that agi level and beyond and this chart that we're showing on screen here is a beautiful example of this because in addition to blackwell in addition to ruben they also teased fineman already even though ruben is months to years away from actually being deployed at scale so nvidia is essentially 18 months give or take a few ahead of what the current reality looks like. And I think this is really important to note, is currently with the bleeding edge of AI, we're running Blackwell right now. And we just started running Blackwell. And Blackwell has about 12 months of improvements to be made before we start to feel the effects of Rubin. By the time we feel the effects of Rubin, which is that 10x performance per watt improvement, they already have Feynman ready to go and to be deployed into these data centers. And already we have two incremental steps, two exponential steps ahead of where we currently sit. and it's hard to imagine that with the build-out that's happening with the performance per watt increase that we're seeing from all these chipsets that we're not …”
View more
Ridealong summary
Nvidia's trillion-dollar revenue projection is grounded in their strategic foresight and readiness to deploy future chip architectures like Feynman, ensuring sustained AI growth.
Nvidia's DLSS 5 technology is a game-changer for graphics, but faces backlash from the gaming community who view it as AI slop rather than a genuine enhancement.
Nvidia's strategic advancements in AI chip technology position it for exponential growth, suggesting that the company's future is already secured through its innovative roadmap.
Nvidia's aggressive roadmap with AI chips like Blackwell, Rubin, and Feynman positions it for exponential growth, making it a frontrunner in the AI sector.
Nvidia's future AI chip architectures are set to drive exponential growth in AI computing, solidifying its dominance and justifying its trillion-dollar valuation.
“… of the progress we've seen in 2025 was that supported context sizes are longer. So it really depends, but there are even like open weight LLMs like Nvidia Nemotron that can do up to one million tokens. Of course, it's going to be more expensive. You need more GPU power for that. But I think even like, you know, Chachapiti Online, the version, I think it can do 100,000 to 100,000 tokens. And I think that's about the size. I mean, I might be wrong, but I think it's about the size of one of the Harry Potter books, the first one or something. It's a long context. And so for many people, this is …”
“… answer, basically. It's not perfect because you are chunking the document and it's not the full context. You have always little chunks. But one of the I wouldn't say breakthroughs, because it's more like a continuous development, but one of the parts of the progress we've seen in 2025 was that supported context sizes are longer. So it really depends, but there are even like open weight LLMs like Nvidia Nemotron that can do up to one million tokens. Of course, it's going to be more expensive. You need more GPU power for that. But I think even like, you know, Chachapiti Online, the version, I think it can do 100,000 to 100,000 tokens. And I think that's about the size. I mean, I might be wrong, but I think it's about the size of one of the Harry Potter books, the first one or something. It's a long context. And so for many people, this is actually sufficient. So you don't need any specific fancy application around the LLM to process that. You just put it in there. And there is the problem of it's called like the needle in the haystack problem where what people found, though, I mean, there are multiple problems, but there's also something related to attention sinks where the LLM kind of …”
View more
Ridealong summary
Using Retrieval Augmented Generation (RAG) can transform how organizations handle proprietary data with LLMs. By chunking documents into manageable pieces and querying them, companies can effectively retrieve relevant information without overwhelming the system. This method not only saves costs but also enhances the accuracy of responses, making it a game-changer for fields like law and healthcare.
“… of Supermicro stock, still risked it all, and now he's facing 30 years in federal prison. That is crazy. who's charged with smuggling billions in NVIDIA servers to China, use Southeast Asian's Shell Company to funnel two and a half billion in servers to Chinese buyers, 500 million worth shipped in just three weeks in spring of 2025. That's a lot. Two and a half billion in servers feels like enough for like a frontier training run. Like that's a big, big, that's a big push. Built thousands of fake dummy servers to fool U.S. compliance auditors, caught on surveillance camera using a hairdryer to …”
“… to enforce our export control laws to protect that advantage. So the company, SMIC, Supermicro Computer Inc. They got a co-founder. They caught him red-handed. James Wally, he was arrested today or yesterday. He personally holds half a billion dollars of Supermicro stock, still risked it all, and now he's facing 30 years in federal prison. That is crazy. who's charged with smuggling billions in NVIDIA servers to China, use Southeast Asian's Shell Company to funnel two and a half billion in servers to Chinese buyers, 500 million worth shipped in just three weeks in spring of 2025. That's a lot. Two and a half billion in servers feels like enough for like a frontier training run. Like that's a big, big, that's a big push. Built thousands of fake dummy servers to fool U.S. compliance auditors, caught on surveillance camera using a hairdryer to swap serial number stickers. And so OX Gigi says, this man is a billionaire and was removing labels with a hairdryer personally. You're simply not grinding hard enough. There's always a grind set lesson in any story like this. Let's take this over to LinkedIn. It's also notable because there's the export tax or tip that you're trying to bring chips …”
View more
Ridealong summary
A billionaire co-founder of Supermicro faces 30 years in prison for attempting to smuggle $2.5 billion worth of NVIDIA servers to China. He was caught using fake documents and even a hairdryer to swap serial numbers on dummy servers to evade U.S. export laws. This shocking case highlights the lengths individuals will go to for profit, risking everything in the process.
“… charles liang of super micro just standing my personal opinion and this this is an opinion but okay my opinion is that there is no way that nvidia couldn't have known this was happening i i just think it's like actually impossible that they were not aware of gpu smuggling but wouldn't they don't these gpus have some mechanism of phoning home oh for sure they can be they can be identified through the internet 100 i mean yeah they they have the firmware has identifying information in it for the gpus you don't need the serial number on the sticker to know what it is you know you don't have …”
“… uh serial numbers and the stickers then using a a middle in logistics company to basically repackage and forward them and this story this story's just kind of disappeared already yeah like there is a picture where jensen huang is standing next to co-founder charles liang of super micro just standing my personal opinion and this this is an opinion but okay my opinion is that there is no way that nvidia couldn't have known this was happening i i just think it's like actually impossible that they were not aware of gpu smuggling but wouldn't they don't these gpus have some mechanism of phoning home oh for sure they can be they can be identified through the internet 100 i mean yeah they they have the firmware has identifying information in it for the gpus you don't need the serial number on the sticker to know what it is you know you don't have to physically see it to know what it is and so yeah if this is like connecting to some kind of server somewhere or whatever it may be doing uh if in any way that's linked back to nvidia they would know about it if there's ever a service request i would assume at some point someone figures out and maybe it's a okay we'll kind of you know blind eye …”
View more
Ridealong summary
NVIDIA likely knew about the ongoing GPU smuggling operations, raising questions about their corporate oversight. As high-value GPUs were trafficked, the implications for the tech industry and media coverage reveal a troubling narrative of complicity and ignorance. This episode dives into the murky waters of corporate accountability and the hidden dynamics behind data center construction.
“… can't be $100 forever, and Trump will probably backpedal in the next few weeks ahead of the Trump. So let's recap a few of the key stories around NVIDIA. We just came off of GTC, and there's a lot going on at the company. I mean, it's a huge company. Maybe it'd be good to start with just next generation chips, changes to strategy, what people are actually buying. Maybe that means Grace CPU standalone sales or the development with the Grok partnership. What's sticking out just on the actual AI product side to you that you're most excited about? Well, inference demand is exploding, driven by the …”
“with the Iran war. Things will eventually subside. Oil can't be $100 forever, and Trump will probably backpedal in the next few weeks ahead of the Trump. So let's recap a few of the key stories around NVIDIA. We just came off of GTC, and there's a lot going on at the company. I mean, it's a huge company. Maybe it'd be good to start with just next generation chips, changes to strategy, what people are actually buying. Maybe that means Grace CPU standalone sales or the development with the Grok partnership. What's sticking out just on the actual AI product side to you that you're most excited about? Well, inference demand is exploding, driven by the AI agents and coding assistants. I met with Ian Buck, I met with dozens of engineers at Meta, Google, NVIDIA, and all of them are seeing crazy inference demand and AI compute shortages. So across the board, people are in crazy clamoring need for AI. And we're, I mean, yeah, you're seeing that from talking to engineering leaders at big tech …”
View more
Ridealong summary
Nvidia is perfectly positioned to capitalize on the exploding demand for AI inference and computing, with strategic moves like acquiring Grok and securing supply agreements ahead of time.
“… it opens new horizons, new vistas, new avenues and new opportunities. And no one is experiencing this more. No firm is experiencing this more than NVIDIA. Last year, the CFO said we've got about 500 billion dollars of committed orders. This is effectively promised revenue. It's orders that investors and the markets can see. And it's a really good measure of the health of a company like NVIDIA. The semiconductors are highly cyclical businesses. They're prone to booms and they're prone to busts. And being able to see an order book out like that is a real sign of health. Well, this year we learned …”
“… which is what produces manufactured intelligence, that thing that we call artificial intelligence. Well, that's also effectively infinite. And we're experiencing this within exponential view that the more we use, the more we need to use because it opens new horizons, new vistas, new avenues and new opportunities. And no one is experiencing this more. No firm is experiencing this more than NVIDIA. Last year, the CFO said we've got about 500 billion dollars of committed orders. This is effectively promised revenue. It's orders that investors and the markets can see. And it's a really good measure of the health of a company like NVIDIA. The semiconductors are highly cyclical businesses. They're prone to booms and they're prone to busts. And being able to see an order book out like that is a real sign of health. Well, this year we learned at GTC that the scale of the committed orders was now a trillion dollars for Blackwell's, for the new Vera Rubin products, just out through to 2027. And the thing to understand is, yes, NVIDIA maintains a dominant market share, but it's not 100% of the market. There's also competition and supply from Google's TPUs, from Amazon's Tranium, from AMD …”
View more
Ridealong summary
The demand for AI and compute is effectively infinite, driving Nvidia's growth and showcasing its market leadership with a trillion dollars in committed orders.
Nvidia's trillion-dollar order book reflects the infinite demand for compute power, underscoring its market health and dominance despite competition.
The shift to reasoning models and increased AI usage is driving a million-fold expansion in compute demand, underscoring Nvidia's critical role in meeting this explosive growth.
“… Bloomberg and software and whether AI will eat software. And I want to talk a bit more about that in respect to what you learned this week at the NVIDIA conference. Before we get into those specific topics, what was it like? Describe the scene. The energy at NVIDIA GTC is very unique and it's kind of electric. It takes over the entire city of San Jose, 30,000 people. I interviewed the mayor of San Jose last year about this. And he was like, the coffee shops make their entire year's rent in the course of a couple days. Of course, that means if you want to get a cup of coffee, you are waiting in …”
“… that makes updates around your schedule not in the middle of it. They don't build tech for tech's sake. They build it for you. Find technology for the way you work at dell.com slash xps. Built for you. So before the break we were talking about Bloomberg and software and whether AI will eat software. And I want to talk a bit more about that in respect to what you learned this week at the NVIDIA conference. Before we get into those specific topics, what was it like? Describe the scene. The energy at NVIDIA GTC is very unique and it's kind of electric. It takes over the entire city of San Jose, 30,000 people. I interviewed the mayor of San Jose last year about this. And he was like, the coffee shops make their entire year's rent in the course of a couple days. Of course, that means if you want to get a cup of coffee, you are waiting in line for an hour. You're waiting in line for an hour for everything, to get into panels, to get into buildings, to get coffee, to get a granola bar. You're out there in the wilderness. You have to fight to get a sandwich. The four trillion company doesn't provide any free coffee and sandwiches No I mean it like it wild though I mean it does like …”
View more
Ridealong summary
OpenClaw represents a paradigm shift in AI interaction, moving from real-time agent collaboration to asynchronous, autonomous task execution.
OpenClaw represents a paradigm shift in AI interaction, moving from real-time agent prompts to asynchronous, autonomous task execution.
“… And so we were down, we were down there for a while and, and, uh, we clawed our way back slowly, but we carried CUDA on G-Force. I always say that NVIDIA is the house that G-Force built because it was GeForce that took CUDA out to everybody. Researchers, scientists, they discovered CUDA on GeForce because they were all, you know, many of them were gamers.”
“… $8 billion or something, $6-7 billion or something like that. after we launched CUDA, I recognized that it was going to add so much cost, but it was something we believed in. You know, our market cap went down to like one and a half billion dollars. And so we were down, we were down there for a while and, and, uh, we clawed our way back slowly, but we carried CUDA on G-Force. I always say that NVIDIA is the house that G-Force built because it was GeForce that took CUDA out to everybody. Researchers, scientists, they discovered CUDA on GeForce because they were all, you know, many of them were gamers.”
View more
Ridealong summary
AI's rapid growth is not hindered by power constraints due to advancements in energy efficiency and extreme co-design, which will continue to drive down token costs.
“So this is leading to a transformation of the AI industry. In fact, NVIDIA CEO Jensen Huang thinks that we're at an inflection point. So let's talk about what this means for the AI economy and why Jensen Huang thinks that the rise of AI agents could add an extra $500 billion to NVIDIA's revenues. So we're starting to see the rise of agentic AI. So let's talk about why investors are paying attention. See, for the last few years, AI has been about training large language models. That required tens of thousands of GPUs. …”
“So this is leading to a transformation of the AI industry. In fact, NVIDIA CEO Jensen Huang thinks that we're at an inflection point. So let's talk about what this means for the AI economy and why Jensen Huang thinks that the rise of AI agents could add an extra $500 billion to NVIDIA's revenues. So we're starting to see the rise of agentic AI. So let's talk about why investors are paying attention. See, for the last few years, AI has been about training large language models. That required tens of thousands of GPUs. It consumed enormous amounts of energy. These giant data centers packed with GPUs ran 24-7 for weeks or months at a time to train these models. Companies like OpenAI, Anthropic, Meta, and Google have been spending tens of billions of dollars on training AI models for the last few years. But now the focus is shifting from training AI models to …”
View more
Ridealong summary
The rise of AI agents and the shift to inference could significantly boost Nvidia's revenues by $500 billion, as these developments demand more computing power and specialized hardware.
AI agent orchestration could add an extra $500 billion to NVIDIA's revenues, marking a significant shift in the AI economy towards inference over training.
The rise of agentic AI and the shift to inference will exponentially increase demand for compute, potentially adding $500 billion to NVIDIA's revenues.
The rise of agentic AI and the shift to inference will significantly boost Nvidia's revenues, potentially adding $500 billion.
The shift from AI model training to inference is set to significantly boost Nvidia's revenue, with agentic AI driving demand for more computing power and specialized hardware.
The rise of AI agents and the shift to inference will significantly boost Nvidia's revenues, marking a pivotal transformation in the AI industry.
The rise of AI agents and the shift to inference will significantly boost Nvidia's revenues, potentially adding $500 billion, as the demand for compute power and specialized hardware increases.
The shift from AI training to inference is set to significantly boost Nvidia's revenues, with agentic AI driving demand for new types of computing power.
“… Jensen Wang has actually been pretty consistent. He's actually been cautious not to guide too far into the future. Up until just six months ago, NVIDIA was only guiding one quarter at a time. Six months ago, he took a departure to say, you know, I think there's going to be half a trillion dollars of Blackwell and Rubin by the end of 26 or in the next 18 months, which is slightly past 26. And he pointed to that and said, look, here I am. Five months later, I'm telling you that number is too low. I need to take that to a trillion dollars if I extend it to 2027. So he's being thoughtful and he, …”
“… people. Elon Musk has a long, illustrious track record of over-promising and compressing timelines, even when he knows it's going to take a lot longer. Even though he usually delivers on the promises eventually, the timeline gets extended a lot. Jensen Wang has actually been pretty consistent. He's actually been cautious not to guide too far into the future. Up until just six months ago, NVIDIA was only guiding one quarter at a time. Six months ago, he took a departure to say, you know, I think there's going to be half a trillion dollars of Blackwell and Rubin by the end of 26 or in the next 18 months, which is slightly past 26. And he pointed to that and said, look, here I am. Five months later, I'm telling you that number is too low. I need to take that to a trillion dollars if I extend it to 2027. So he's being thoughtful and he, at least in the way he's communicated and clarified that statement, feels like he's under promising. He clarified this doesn't include CPUs or the Grok chips or the networking equipment or Ruben Ultra. So he actually said it's more than a trillion. So from his perspective, this is a number that he has high visibility into because he has these orders …”
View more
Ridealong summary
Despite NVIDIA's CEO Jensen Huang projecting over $1 trillion in revenue from data centers by 2027, investors remain unconvinced. This skepticism arises from concerns about the market's future growth and the reality of returns on massive investments in data infrastructure. The conflicting signals suggest a cautious outlook on the data center buildout, even as AI promises transformative effects.
“… and yeah yeah yeah yeah um but i don't know tool making is a special special industry. Well, speaking of Dylan Patel, Semi Analysis was featured at NVIDIA GTC. The inference king has been crowned. NVIDIA won a massive belt, and it looks like Jensen's holding it up. He is, in fact, standing in front of an LED wall or projector, but a beautiful thing to see the Semi Analysis logo on the big screen. It's all WWE. The entire world is WWE. NVIDIA Extreme co-design revolutionized token cost. The GBNBL72 is the inference king with 50x higher performance per watt on inference X by semi-analysis. 35x …”
“… in an order for a tool or is he going to do terra asml like because there like the supply chain is what 10 000 companies i mean historically they've done they've gone super super early in the supply chain right they've been like you know mining stuff and yeah yeah yeah yeah um but i don't know tool making is a special special industry. Well, speaking of Dylan Patel, Semi Analysis was featured at NVIDIA GTC. The inference king has been crowned. NVIDIA won a massive belt, and it looks like Jensen's holding it up. He is, in fact, standing in front of an LED wall or projector, but a beautiful thing to see the Semi Analysis logo on the big screen. It's all WWE. The entire world is WWE. NVIDIA Extreme co-design revolutionized token cost. The GBNBL72 is the inference king with 50x higher performance per watt on inference X by semi-analysis. 35x lower cost. Very, very exciting. And congrats to Jensen for becoming the inference king and winning the inference max award or inference X as it's now known. Jensen is also confirming what we see in our GPU availability data. There is an epic scramble for compute B200 basically unavailable availability for GH, Grace Hopper, 200, H200, and A100 also …”
View more
Ridealong summary
Nvidia's DLSS 5 technology is set to revolutionize visual fidelity in video games, making them more realistic than ever before.
Nvidia's dominance in AI inference is solidified as it achieves unprecedented performance and cost efficiency, underscoring its pivotal role in meeting the surging demand for AI computing.
Ridealong summary
DLSS 4.5 introduces a revolutionary feature that allows gamers to target their monitor's refresh rate for frame generation, enhancing performance consistency. This change addresses a critical flaw in previous versions where frame generation was unpredictable, leading to potential lag. With this update, gamers can finally enjoy smoother gameplay that meets their display's capabilities.
“… what the numbers were for a memory at sk hynex and samsung and so forth but um if you look at so think about how the neocloud business works and how NVIDIA works with that or how the RL environment business works and how Anthropic works with that. In both those cases, NVIDIA is purposely trying to fracture the complementary industry to make sure that they have as much leverage as possible. So they're giving, you know, allocation to random neoclouts to make sure that there's not one person that has all the compute. Similarly, Anthropic or OpenAI, when they're working with the data providers, they …”
“that um uh or around that area and then i i forget what the numbers were for a memory at sk hynex and samsung and so forth but um if you look at so think about how the neocloud business works and how NVIDIA works with that or how the RL environment business works and how Anthropic works with that. In both those cases, NVIDIA is purposely trying to fracture the complementary industry to make sure that they have as much leverage as possible. So they're giving, you know, allocation to random neoclouts to make sure that there's not one person that has all the compute. Similarly, Anthropic or OpenAI, when they're working with the data providers, they say, no, we're going to just seed a huge industry of these things so that we're not locked into any one supplier for data environments. And I wonder why on the three nanometer process, that's going to be Tranium 3, that's going to be TPU v7, other accelerators potentially. And why is TSMC just giving it all up to NVIDIA rather than, you know, trying …”
View more
Ridealong summary
NVIDIA's aggressive supply chain strategy gives it an edge over competitors like Amazon and Google, securing the majority of TSMC's three-nanometer chip production. By committing early and signaling high demand, NVIDIA has positioned itself uniquely in the AI chip market, while rivals face delays and shifting priorities. This strategic maneuvering could redefine market dynamics in the tech industry.
“… That's where I'm wondering, yeah. It's something to try. It's something to try, no doubt. And back to chips. China's ByDance gets access to top NVIDIA AI chips, according to the Washington Street Journal. So they are seeing that ByDance is assembling significant computing power outside of China using these chips. They are supposedly working with the Southeast Asian firm Aulani Cloud and they plan to deploy approximately 500 NVIDIA Blackwell computing systems in Malaysia totaling 36,000 B200 chips. So that's 500 computing systems, presumably meaning racks, that amounts to tens of thousands of …”
“… different companies' agentic needs seems like a very much feasible future. Yeah, definitely not questioning the existence of a market, right? I'm just sort of skeptical of their positioning, as you said, to like, are they the ones to capture this? Yeah. That's where I'm wondering, yeah. It's something to try. It's something to try, no doubt. And back to chips. China's ByDance gets access to top NVIDIA AI chips, according to the Washington Street Journal. So they are seeing that ByDance is assembling significant computing power outside of China using these chips. They are supposedly working with the Southeast Asian firm Aulani Cloud and they plan to deploy approximately 500 NVIDIA Blackwell computing systems in Malaysia totaling 36,000 B200 chips. So that's 500 computing systems, presumably meaning racks, that amounts to tens of thousands of actual GPUs. So yeah, it's, I suppose, giving us an indication that the company, ByDance being a massive company that has their products used outside of China, no doubt, is able to make use of these chips in other countries.”
View more
Ridealong summary
ByteDance is gaining a significant edge by accessing NVIDIA's top-tier AI chips, enabling them to build impressive computing power outside China. They are collaborating with Aulani Cloud in Malaysia to deploy 500 Blackwell computing systems, totaling 36,000 GPUs. This strategic expansion highlights ByteDance's ambition to enhance its AI capabilities on a global scale.
“It is a big week for NVIDIA as their GTC developer conference kicks off in San Jose. CEO Jensen Huang was scheduled to deliver his keynote on Monday morning, so we'll likely know more by the time this episode goes out. In the lead-up to the event, much of the speculation was around a new chip system developed in collaboration with Grok. That is G-R-O-Q, not G-R-O-K. Grok with a Q is the one that is not an Elon Musk company. NVIDIA acquired the chip-making startup in …”
“It is a big week for NVIDIA as their GTC developer conference kicks off in San Jose. CEO Jensen Huang was scheduled to deliver his keynote on Monday morning, so we'll likely know more by the time this episode goes out. In the lead-up to the event, much of the speculation was around a new chip system developed in collaboration with Grok. That is G-R-O-Q, not G-R-O-K. Grok with a Q is the one that is not an Elon Musk company. NVIDIA acquired the chip-making startup in December and are expected to announce the first collaborative product this week. The information described the new product as integrating Grok's language processing chips into NVIDIA's rack-scale servers. If that's the case, this will be NVIDIA's first attempt to directly address inference demand. Until now, NVIDIA's chips have been world-leading in AI …”
View more
Ridealong summary
NVIDIA is transforming from a chip company into a full-stack AI infrastructure platform, with agent orchestration as a key component of its strategy.
NVIDIA is transforming from a chip company into a full-stack AI infrastructure platform, positioning itself as a leader in AI training, inference, and agent orchestration.
NVIDIA is transitioning from a chip company to a full-stack AI infrastructure platform, integrating Grok's inference chips and expanding its supply chain beyond Taiwan.
NVIDIA is transforming from a chip company into a full-stack AI infrastructure platform, positioning itself as a leader in AI training, inference, and agent orchestration.
NVIDIA is transitioning from a chip company to a full-stack AI infrastructure platform, positioning itself as a leader in AI advancements and infrastructure.
NVIDIA is transforming from a chip company into a full-stack AI infrastructure platform, addressing the rising demand for AI inference with new Grok-integrated products.
“… that we have a new model out there. introducing Nemotron 3 Super, an open hybrid mama transformer. MOE for agentic reasoning. This was announced by NVIDIA. Nemotron 3 Super has 120 billion total parameters with 12 billion active parameters per inference. It has apparently a 1 million token context window, although I do want to mention when we say 1 million token context window, as we have GPU 5.4 and I think Sonnet 4.6. On paper, it's about 1 million in practice as you get into the upper end maybe the model starts being very stupid The model compares favorably to other open source models although …”
“I figured with Fro-In projects and open source, we have one story here and it actually is related to tools in that we have a new model out there. introducing Nemotron 3 Super, an open hybrid mama transformer. MOE for agentic reasoning. This was announced by NVIDIA. Nemotron 3 Super has 120 billion total parameters with 12 billion active parameters per inference. It has apparently a 1 million token context window, although I do want to mention when we say 1 million token context window, as we have GPU 5.4 and I think Sonnet 4.6. On paper, it's about 1 million in practice as you get into the upper end maybe the model starts being very stupid The model compares favorably to other open source models although probably not all of them The charts that they provide, just compare it to GPT-OSS 120 and QAN 3.5, 122. So in that size class, it is on the benchmarks doing well, and the throughput is way, way higher. So that's one of the cool things with both an MOE model and a Mamba model.”
View more
Ridealong summary
Nvidia's Nemotron 3 Super is a game-changer in AI with its impressive parameters and high throughput, setting a new benchmark for open-source models.
Nvidia's Nemotron 3 Super is a breakthrough in AI with its massive parameter count and impressive throughput, positioning it as a leader in open-source AI models.
“… Wow. Wow. Okay, Tyler undefeated. Brutal. I'm a pangram truster. Trust the pangram. Trust the pangram. China's ByteDance got access to the top NVIDIA AI chips. Uh-oh. TikTok parent pushing global expansion plans to tap Blackwell processors that are barred for export to China. They're just flexing on us at this point. How did this leak? Fight Dance is working with a Southeast Asian company called Alani Cloud on plans to use some 500 Blackwell computing systems, totaling around 36,000 B200 chips. Is that a lot of chips? That's not as much as Elon's talking about, and I don't think that's at …”
“… is, you know, in the same style. Let me see. How do I scan for AI? I need to create an account. Let's see. Let's see, John. I'm trying. Let's see. Send me the link. Okay, I got it. I got it. What's my role? Okay, it says 100% human written. I got roasted. Wow. Wow. Okay, Tyler undefeated. Brutal. I'm a pangram truster. Trust the pangram. Trust the pangram. China's ByteDance got access to the top NVIDIA AI chips. Uh-oh. TikTok parent pushing global expansion plans to tap Blackwell processors that are barred for export to China. They're just flexing on us at this point. How did this leak? Fight Dance is working with a Southeast Asian company called Alani Cloud on plans to use some 500 Blackwell computing systems, totaling around 36,000 B200 chips. Is that a lot of chips? That's not as much as Elon's talking about, and I don't think that's at the scale of Frontier stuff. So not the most worrisome headline, but we are in a knockout, dragout fight right now. How many B200s would you provision if you were at a frontier lab right now, Tyler? I mean, I have no idea, but in the next paragraph it says, I mean, this is like what a 25x increase”
View more
Ridealong summary
ByteDance, the parent company of TikTok, has gained access to top NVIDIA AI chips, raising eyebrows in the tech community. This move is part of their ambitious global expansion plans, collaborating with Alani Cloud to utilize 500 advanced Blackwell computing systems. The implications of this access could shift the balance in the ongoing tech competition, especially concerning AI capabilities.
“… hypothetical, Like clearly the markets did react and a lot of names sold off But in a world where you believe that narrative you would think that NVIDIA would be going up But you saying that there are other factors at play that are sort of tamping down the excitement in the market broadly? I mean, there's no doubt. Just like tariffs a year ago, NVIDIA had 30% drawdown when their business was actually flying, the actual funnel of the business. I think the same thing is happening here with the Iran war. Things will eventually subside. Oil can't be a hundred dollars for forever and trump will …”
“… in $100 oil, this stuff is unsustainable and probably subside. Okay, so because I like the deep-seek analogy, and I feel like the market half-digested the agentic coding narrative and the Citrini article, whether you thought it went too far or was too hypothetical, Like clearly the markets did react and a lot of names sold off But in a world where you believe that narrative you would think that NVIDIA would be going up But you saying that there are other factors at play that are sort of tamping down the excitement in the market broadly? I mean, there's no doubt. Just like tariffs a year ago, NVIDIA had 30% drawdown when their business was actually flying, the actual funnel of the business. I think the same thing is happening here with the Iran war. Things will eventually subside. Oil can't be a hundred dollars for forever and trump will probably backpedal in the next few weeks ahead of the trump so let's uh let's recap a few of the key stories around nvidia we just came off of gtc and there's a lot going on uh at the company i mean it's a huge company uh maybe it'd be good to start with just uh next generation chips changes to strategy what people are actually buying maybe that means …”
View more
Ridealong summary
AI agent orchestration is driving explosive inference demand, with Nvidia strategically positioned to capitalize on this trend by securing supply agreements ahead of time.
Inference demand is exploding, driven by AI agents and coding assistants, highlighting Nvidia's strategic foresight in securing supply agreements ahead of this surge.
Top Podcasts About Nvidia
TBPN
12 episodes
The AI Daily Brief: Artificial Intelligence News and Analysis
4 episodes
Better Offline
4 episodes
The Rundown
3 episodes
Intelligent Machines (Audio)
3 episodes
Prof G Markets
2 episodes
Brad & Will Made a Tech Pod.
2 episodes
Last Week in AI
2 episodes
Stories Mentioning Nvidia
Best Podcasts on Hyperscaler AI Spending
New analyst reports indicate that the five largest US tech infrastructure providers (Amazon, Microsoft, Google, Meta, and Oracle) have committed to spending between $660 and $690 billion on CapEx in 2026. This near-doubling of 2025 spending levels is driven primarily by the construction of data centers and the acquisition of AI chips to support training clusters, raising questions among investors about the timeline for return on investment.
