While AI capabilities have advanced dramatically, key gaps remain. The scientific community's understanding of frontier AI systems lags behind rapidly advancing capabilities. Knowledge of how these systems are trained is concentrated within the top research labs, limiting both the public discourse on AI and people's abilities to use AI effectively. And, despite their potential, these systems remain difficult for people to customize to their specific needs and values. To bridge the gaps, we're building Thinking Machines Lab to make AI systems more widely understood, customizable and generally capable. We are scientists, engineers, and builders who've created some of the most widely used AI products, including ChatGPT and Character.ai, open-weights models like Mistral, as well as popular open source projects like PyTorch, OpenAI Gym, Fairseq, and Segment Anything. Our MissionWe are focused on three key objectives:
Science is better when sharedScientific progress is a collective effort. We believe that we'll most effectively advance humanity's understanding of AI by collaborating with the wider community of researchers and builders. We plan to frequently publish technical blog posts, papers, and code. We think sharing our work will not only benefit the public, but also improve our own research culture. AI that works for everyoneEmphasis on human-AI collaboration. Instead of focusing solely on making fully autonomous AI systems, we are excited to build multimodal systems that work with people collaboratively. More flexible, adaptable, and personalized AI systems. We see enormous potential for AI to help in every field of work. While current systems excel at programming and mathematics, we're building AI that can adapt to the full spectrum of human expertise and enable a broader spectrum of applications. Solid foundations matterModel intelligence as the cornerstone. In addition to our emphasis on human-AI collaboration and customization, model intelligence is crucial and we are building models at the frontier of capabilities in domains like science and programming. Ultimately, the most advanced models will unlock the most transformative applications and benefits, such as enabling novel scientific discoveries and engineering breakthroughs. Infrastructure quality as a top priority. Research productivity is paramount and heavily depends on the reliability, efficiency, and ease of use of infrastructure. We aim to build things correctly for the long haul, to maximize both productivity and security, rather than taking shortcuts. Advanced multimodal capabilities. We see multimodality as critical to enabling more natural and efficient communication, preserving more information, better capturing intent, and supporting deeper integration into real-world environments. Learning by doingResearch and product co-design. Products enable iterative learning through deployment, while great products and research strengthen each other. Products keep us grounded in reality and guide us to solve the most impactful problems. Empirical and iterative approach to AI safety. The most effective safety measures come from a combination of proactive research and careful real-world testing. We plan to contribute to AI safety by:
We believe that methods developed for present day systems, such as effective red-teaming and post-deployment monitoring, provide valuable insights that will extend to future, more capable systems. Measure what truly matters. We'll focus on understanding how our systems create genuine value in the real world. The most important breakthroughs often come from rethinking our objectives, not just optimizing existing metrics. |
Our Team

(CEO)

Co-Founder

(CTO)

(Head of Ops)

(Chief Scientist)

(Infrastructure)

(Multimodality)

(Technical Staff)

(Research Scientist)

(Technical Staff)

(Founding Team)

(SW Engr in Boulder)

(Tech Staff in UK)

(Technical Staff)

(Research Engr NYC)

(Technical Staff)

(Research Engineer)

(IT)

(Founding Team)

(Head of HR)

(Tech Staff NYC)

(Founding Team)

(Technical Staff)

(Research Engineer)

(Research Engineer)

(Research in NYC)

(Software Engineer)

(Research in NYC)

(Technical Staff)
Join UsWe're building AI systems that push technical boundaries while delivering real value to as many people as possible. Our team combines rigorous engineering with creative exploration, and we're looking for collaborators to help shape this vision. You can follow us on X at @thinkymachines or submit job applications here if you're interested in working with us. Product BuildersJoin us in the exciting early stages of building something transformative. We are looking for people with a strong track record of building successful AI-driven products from the ground up and enthusiasm about wearing multiple hats--building functional product prototypes, crafting smooth UI designs and directing product decisions--to bring cutting-edge AI to the real world. Machine Learning ExpertsWe're putting together a small, high-caliber team of machine learning scientists and engineers. The activities will range from building training infrastructure to carrying out exploratory research projects. Whether you hold a PhD or are self-taught, we're interested in candidates who can demonstrate concrete achievements in ML research and engineering through:
Research Program ManagerAn efficient process can transform an entire team's productivity. As our first Research Program Manager, you'll shape how our team operates, scale our human data efforts, and lead key projects like GPU compute planning. If you excel at problem solving, fast learning, and driving operational excellence, this is your chance to make a big impact. A strong technical foundation and the ability to learn things fast will be highly valued. |
Posts

(CEO)
Today, we are excited to announce Thinking Machines Lab
(ThinkingMachines.ai),
an artificial intelligence research and product company.
We are scientists, engineers, and builders behind some
of the most widely used AI products and libraries,
including ChatGPT, Character.ai, PyTorch, and Mistral.
Our mission is to make artificial intelligence work for you
by building a future where everyone has access to the knowledge
and tools to make AI serve their unique needs.
We are committed to open science through publications and code releases, while focusing on human-AI collaboration that serves diverse domains. Our approach embraces co-design of research and products to enable learning from real-world deployment and rapid iteration. This work requires three core foundations: state-of-the-art model intelligence, high-quality infrastructure, and advanced multimodal capabilities. We are committed to building models at the frontier of capabilities to deliver on this promise. If you’re interested in joining our team, consider applying here: click here to apply I started Thinking Machines Lab alongside a remarkable team of scientists, engineers, and builders. We're building three things:
|

Co-Founder
This is something we have been cooking together for a few months
and I'm very excited to announce it today.
Thinking Machines Lab is my next adventure and I'm feeling very proud and lucky to start it with a group of talented colleagues. Learn more about our vision at: ThinkingMachines.ai |
AI TermsArtificial intelligence (AI) is a technology that enables computers to perform tasks that typically require human intelligence. AI uses math, logic, and large amounts of data to learn and improve its performance over time. AI JobsAI Engineer is a professional who develops, programs, and maintains artificial intelligence (AI) systems. They use AI and machine learning (ML) to create applications that can perform human-like tasks. |
History
Date | Event |
---|---|
2025-07-04 | GPT-5 model will be released |
2025-02-18 | T-M-L launched new website |
2024-12-15 | T-M-L was founded |
2024-09-07 | T-M-L registered domain name |
2023-10-30 | Biden signs Safe AI Executive Order |
2023-08-01 | DALL-E v3 model released |
2023-05-14 | GPT-4 model released |
2022-11-01 | ChatGPT model released |
2022-04-01 | DALL-E v2 model released |
2021-01-01 | DALL-E v1 model released |
2020-06-01 | GPT-3 model released |
2019-02-14 | GPT-2 model released |
2018-06-11 | GPT-1 model released |
Papers
Date | Document |
---|---|
2024-12-05 | OpenAI o1 System Card |
2024-11-28 | Reward Hacking |
2024-01-01 | The Rise of Thinking Machines |
2023-04-31 | Let’s Verify Step by Step |
2023-01-31 | Scaling Laws for Learning |
2022-03-21 | Language & Coding Creativity |
2021-07-14 | Evaluating LLMs Trained on Code |
2020-01-01 | Deterrence of Thinking Machines |
1983-01-01 | Thoughts of Thinking Machines |
InterviewsPresentationsReports |
Pictures






































