Conversation with Bibo Xu: How agent conversations are evolving with Google AI
Bibo Xu is a Product Manager at Google DeepMind and leads Gemini’s multimodal modeling. This video dives into Google AI’s journey from basic voice commands to advanced dialogue systems that comprehend not just what is said, but also tone, emotion, and visual context. Check out this conversation to gain a deeper understanding of the challenges and opportunities in integrating diverse AI capabilities when creating universal assistants. Resources: Chapters: 0:00 - Intro 1:43 - Introducing Bibo Xu 2:40 - Bibo’s Journey: From business school to voice AI 3:59 - The genesis of Google Assistant and Google Home 6:50 - Milestones in speech recognition technology 13:30 - Shifting from command-based AI to natural dialogue 19:00 - The power of multimodal AI for human interaction 21:20 - Real-time multilingual translation with LLMs 25:20 - Project Astra: Building a universal assistant 28:40 - Developer challenges in multimodal AI integration 29:50 - Unpacking the "can't see" debugging story 35:10 - The importance of low latency and interruption 38:30 - Seamless dialogue and background noise filtering 40:00 - Redefining human-computer interaction 41:00 - Ethical considerations for humanlike AI 44:00 - Responding to user emotions and frustration 45:50 - Politeness and expectations in AI conversations 49:10 - AI as a catalyst for research and automation 52:00 - The future of AI assistants and tool use 52:40 - AI interacting with interfaces 54:50 - Transforming the future of work and communication 55:19 - AI for enhanced writing and idea generation 57:13 - Conclusion and future outlook for AI development Subscribe to Google for Developers → https://goo.gle/developers Speakers: Bibo Xu, Christina Warren, Ashley Oldacre Products Mentioned: Google AI, Gemini, Generative AI, Android, Google Home, Google Voice, Project Astra, Gemini Live, Google DeepMind
--------
57:42
--------
57:42
What is Vets Who Code: Teaching veterans and leveraging AI
Learn about Jerome Hardaway's incredible journey from military service to self-taught software engineer and founder of Vets Who Code. This video delves into how he built a thriving community, teaching veterans to code and secure jobs in tech. Discover his unique "crawl, walk, run" learning methodology and how he integrates modern AI tools like Gemini and JetBrains into his curriculum, preparing developers for the evolving landscape of software engineering and data science. Chapters: 0:00 - Introduction to Jerome Hardaway's journey 1:17 - Vets Who Code: Building a community 2:15 - The "crawl, walk, run" learning process 8:49 - The impact of structured learning 11:00 - The vision for teaching veterans to code 16:10 - Measuring success and defining a "coder" 18:09 - Leveraging AI with Gemini for performance 19:01 - Customizing learning paths with AI 26:42 - Data-driven curriculum and job market trends 29:29 - Quickfire questions: Automating schedules with AI 33:35 - The next problems to solve: Workforce changes and AI 35:29 - The future of AI agents Resources: VetsWhoCode https://vetswhocode.io/ Jerome on Threads https://www.threads.com/@jeromehardaway Vets Who Code GitHub https://github.com/Vets-Who-Code Watch more People of AI → https://goo.gle/PAI Subscribe to Google for Developers → https://goo.gle/developers Speaker: Christina Warren, Jerome Hardaway Products Mentioned: Google AI, Gemini, Generative AI
--------
36:00
--------
36:00
Creative storytelling with AI: The making of Ancestra
In this episode of People of AI , we take you behind the scenes of "ANCESTRA," a groundbreaking film that integrates generative artificial intelligence into its core. Hear from the director Eliza McNitt and key collaborators from the Google DeepMind team about how they leveraged AI as a new creative tool, navigated its capabilities and limitations, and ultimately shaped a unique cinematic experience. Understand the future role of AI in filmmaking and its potential for developers and storytellers. Chapters: 0:00 - Introduction to Ancestra: AI in filmmaking 3:38 - The Origin Story of ANCESTRA 5:35 - Google DeepMind and Primordial Soup collaboration 11:47 - Veo and the creative process 20:21 - Behind the scenes: Making the film 28:47 - Generating videos: Gemini and Veo tools 38:11 - AI as a creative tool, not a replacement 47:41 - AI's impact and the future of the film industry 53:51 - Generative models: A new kind of camera 57:46 - Rapid fire & conclusion Resources: Ancestra → https://goo.gle/4mVScNW Making of ANCESTRA → https://goo.gle/3JVJil1 Veo 3 → https://goo.gle/4mWn3Kz Veo 3 Documentation → https://goo.gle/46qqFOV Veo 3 Cookbook → https://goo.gle/3VMVFSZ Google Flow → https://goo.gle/3VMVR4F Watch more People of AI → https://goo.gle/PAI Subscribe to Google for Developers → https://goo.gle/developers #PeopleofAI Speaker: Christina Warren, Ashley Oldacre, Eliza McNitt, Ben Wiley, Corey Matthewson, Products Mentioned: Google AI, Gemini, Veo 2, Veo 3
--------
1:01:40
--------
1:01:40
The evolved developer with Muhammad Farooq
Meet Muhammad Farooq, a machine learning engineer and a YouTuber on the channel @engineerprompt, where he explains AI concepts and builds his own projects. In this episode of People of AI, hosts Ashley and Christina chat with Muhammad about his journey in tech, his open source project localGPT, and how he's helping organizations deploy scalable AI solutions. Chapters: 0:00 Intro 3:15 From Ph.D to YouTube Creator 4:50 First viral YouTube hit 8:20 RAG and why it matters 11:20 The origins of localGPT 15:20 What has changed with building in the last 2 years 18:35 How AI tools and models have matured 19:50 Becoming a technical manager to LLMs 22:20 What agentic coding has enabled Muhammad to build in localGPT 2.0 27:45 Better maintaining open source projects by leveraging AI 28:50 What to watch out for when building with agentic systems 31:20 Advice for junior developers 35:40 Differences between how junior devs and senior devs use AI tools 39:19 Specs-driven development 41:20 Advice for experimenting with AI tools 42:50 The problem with overly-agreeable LLMs 45:10 Why long-context windows are useful for coding 47:30 Advice for someone launching a tech-focused YouTube channel 51:40 Rapid Fire Questions Resources: @engineerprompt YouTube Channel → https://goo.gle/4m8HCmn localGPT → https://goo.gle/45qwiuH localGPT 2.0 - Building the Best Private RAG System → https://goo.gle/4fwvyJb Muhammad’s website → https://goo.gle/47aHaj9 Muhammad on X → https://goo.gle/4lraJjz Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity → https://goo.gle/4oqVGct Watch more People of AI → https://goo.gle/PAI Subscribe to Google for Developers → https://goo.gle/developers #PeopleofAI Speaker: Christina Warren, Ashley Oldacre Products Mentioned: AI
--------
55:31
--------
55:31
Season 5 - Shaping the agentic future with Clement Farabet
Join hosts Ashley Oldacre and Christinia Warren as they kick off Season 5 of the People of AI podcast with their first guest, Clement Farabet, VP of Research at Google DeepMind. They discuss the evolution of AI, from early neural networks to the latest advancements in large language models and AI agents. Learn how research and product teams work together to bring developers the best tools to build with and are setting the stage for an agentic future. Resources: Clement Farabet → https://goo.gle/3UrbHkO “Attention is all you need” Paper → https://goo.gle/4kTmrmM Google DeepMind Models →https://goo.gle/4f1doPq AI Studio → https://goo.gle/4kSvjJu Agentic systems → https://goo.gle/41aqOmq Watch more People of AI → https://goo.gle/PAI
People of AI is a podcast showcasing inspiring people with interesting stories in the world of Artificial Intelligence (AI) and its subset, Machine Learning (ML). The podcast will interview leaders, practitioners, researchers and learners in the field of AI/ML and invite them to share their stories, what they are building, lessons learned along the way, and excitement for the AI/ML industry.