
Lepton AI
Founded Year
2023Stage
Acquired | AcquiredTotal Raised
$11MAbout Lepton AI
Lepton AI focuses on AI model deployment and cloud-native infrastructure within the AI industry. The company provides services including reliability in deployment, development environments, and support for job processing and collaborative workflows. Lepton AI serves sectors that require AI model deployment and inference. It was founded in 2023 and is based in Cupertino, California. In April 2025, Lepton AI was acquired by NVIDIA.
Loading...
Loading...
Research containing Lepton AI
Get data-driven expert analysis from the CB Insights Intelligence Unit.
CB Insights Intelligence Analysts have mentioned Lepton AI in 1 CB Insights research brief, most recently on Apr 11, 2025.
Expert Collections containing Lepton AI
Expert Collections are analyst-curated lists that highlight the companies you need to know in the most important technology spaces.
Lepton AI is included in 2 Expert Collections, including Artificial Intelligence.
Artificial Intelligence
10,047 items
Generative AI
2,314 items
Companies working on generative AI applications and infrastructure.
Latest Lepton AI News
Jun 13, 2025
I cover emerging technologies with a focus on infrastructure and AI Follow Author Share Nvidia In April 2025, Nvidia quietly acquired Lepton AI, a Chinese startup specializing in GPU cloud services. Founded in 2023, Lepton AI focused on renting out GPU compute that’s aggregated from diverse infrastructure and cloud providers. While the deal value is unknown, the founders of Lepton AI, Yangqing Jia (former VP of Technology at Alibaba) and Junjie Bai, joined Nvidia to continue building the product. Lepton AI had previously raised $11 million in seed funding from investors such as CRV and Fusion Fund. Nvidia has rebranded Lepton AI as DGX Cloud Lepton and relaunched it in June 2025. According to Nvidia, the service delivers a unified AI platform and compute marketplace that connects developers to tens of thousands of GPUs from a global network of cloud providers. How Does DGX Cloud Lepton Work DGX Cloud Lepton serves as a unified AI platform and marketplace, bringing the global network of GPU resources closer to developers. It aggregates the GPU capacity offered by cloud providers, such as AWS, CoreWeave and Lambda, through a consistent software interface. This enables developers to access GPU compute through a centralized interface, regardless of the cluster’s location. Lepton Cloud Nvidia While leveraging the underlying GPU compute, Nvidia is exposing a consistent software platform powered by NIM, Nemo, Blueprints and Cloud Functions. Irrespective of the cloud infrastructure, developers can expect the same software stack to run their AI workflows. MORE FOR YOU Dev Pods: Interactive development environments (e.g., Jupyter notebooks, SSH, VS Code) for prototyping and experimentation. Batch Jobs: Large-scale, non-interactive workloads (e.g., model training, data preprocessing) that can be distributed across multiple nodes, with real-time monitoring and detailed metrics. Inference Endpoints: Deploy and manage models (base, fine-tuned, or custom) as scalable, high-availability endpoints, with support for both NVIDIA NIM and custom containers Apart from this, DGX Cloud Lepton delivers operational features such as real-time monitoring and observability, on-demand auto-scaling, custom workspaces, security and compliance. Developers can choose the region of their preference to maintain data locality and comply with data sovereignty requirements. DGX Lepton’s Growing Network At the recently held GTC event in Paris, Nvidia announced that it is working with some of the leading European cloud providers to enable local developers to meet the data sovereignty needs. It also announced a partnership with Hugging Face to deliver training clusters as a service. Nvidia collaborates with European venture capital firms, Accel, Elaia, Partech, and Sofinnova Partners, to provide up to $100,000 in GPU capacity credits and assistance from NVIDIA specialists for eligible portfolio firms via DGX Cloud Lepton. While the pricing varies based on the cloud provider, the service is currently in preview. Developers can sign up at https://developer.nvidia.com/dgx-cloud/get-lepton to apply for early access to Lepton. With DGX Cloud Lepton, Nvidia aims to make GPU computing accessible to global developers. Instead of launching its own cloud platform that competes with the hyperscalers, Nvidia has chosen to partner with them to deliver aggregated compute resources to developers.
Lepton AI Frequently Asked Questions (FAQ)
When was Lepton AI founded?
Lepton AI was founded in 2023.
Where is Lepton AI's headquarters?
Lepton AI's headquarters is located at 20863 Stevens Creek Boulevard, Cupertino.
What is Lepton AI's latest funding round?
Lepton AI's latest funding round is Acquired.
How much did Lepton AI raise?
Lepton AI raised a total of $11M.
Who are the investors of Lepton AI?
Investors of Lepton AI include NVIDIA, Charles River Ventures, HongShan and Fusion Fund.
Who are Lepton AI's competitors?
Competitors of Lepton AI include Baseten and 5 more.
Loading...
Compare Lepton AI to Competitors
Moreh focuses on hyperscale artificial intelligence (AI) infrastructure, providing a full-stack infrastructure software platform that integrates PyTorch with graphics processing unit (GPU) support for the development and deployment of large language models (LLMs). The company serves sectors that require advanced AI data center solutions and training and deployment of AI models. It was founded in 2020 and is based in Santa Clara, California.

Baseten deploys and serves machine learning models, concentrating on aspects like performance, scalability, and cost-efficiency. The company provides a platform for cloud-native infrastructure, embedded engineering, model management, and deployment, which are used in various artificial intelligence (AI) applications including transcription, large language models, image generation, and text-to-speech. Baseten serves the technology sector and offers solutions that support the transition of AI models from prototype to production. It was founded in 2019 and is based in San Francisco, California.
Xorbits provides Artificial Intelligence (AI) inference platforms within the technology sector. It offers products for resource allocation, model life-cycle management, and resource utilization strategies for AI models. It serves sectors that require inference services, including the technology industry and businesses using AI. It was founded in 2022 and is based in Hangzhou, China.

AI Dynamics operates within the artificial intelligence sector and offers a platform called NeoPulse that includes deep learning solutions and a low-code AutoML environment. The company serves sectors such as healthcare, life sciences, and manufacturing, focusing on applications for drug discovery, clinical trials, and industrial automation. AI Dynamics was formerly known as Dimensional Mechanics. It was founded in 2015 and is based in Bellevue, Washington.
Alpaca focuses on the intersection of artificial intelligence and art and operates within the technology and creative industries. The company offers a suite of AI tools designed to assist artists in their creative process, enabling them to generate images, refine concepts, and experiment with style and composition. Alpaca primarily serves the creative industry, particularly artists and designers. It is based in Montreal, Canada.

Clarifai focuses on artificial intelligence (AI), specializing in computer vision, natural language processing, and audio recognition across various sectors. The company provides an AI lifecycle platform for building, training, and deploying AI models, including data labeling and model management. Clarifai's solutions serve multiple industries, including content moderation and digital asset management. It was founded in 2013 and is based in Wilmington, Delaware.
Loading...