Founded Year

2013

Stage

Incubator/Accelerator - IV | Alive

Total Raised

$864.6M

Revenue

$0000 

Mosaic Score
The Mosaic Score is an algorithm that measures the overall financial health and market potential of private companies.

-12 points in the past 30 days

About Dataiku

Dataiku is an artificial-intelligence (AI) platform that integrates technology, teams, and operations to assist companies in incorporating intelligence into their daily operations across various industries. The platform provides tools for building, deploying, and managing data, analytics, and AI projects, including Generative AI, Machine Learning, data preparation, insights generation, and AI governance. Dataiku serves banking, life sciences, manufacturing, telecommunications, insurance, retail, public sector, utilities, energy, and healthcare sectors. It was founded in 2013 and is based in New York, New York.

Headquarters Location

125 West 25th Street 7th Floor

New York, New York, 10001,

United States

646-568-7477

Loading...

Dataiku's Product Videos

ESPs containing Dataiku

The ESP matrix leverages data and analyst insight to identify and rank leading companies in a given technology landscape.

EXECUTION STRENGTH ➡MARKET STRENGTH ➡LEADERHIGHFLIEROUTPERFORMERCHALLENGER
Enterprise Tech / Supply Chain & Procurement

The demand forecasting and inventory optimization market offers solutions that utilize sales data and other relevant metrics to drive inventory and supply chain planning improvements. These solutions use advanced technologies such as artificial intelligence and machine learning to forecast demand accurately across sales channels and help plan more efficient and cost-effective inventory ordering. T…

Dataiku named as Highflier among 15 other companies, including Microsoft, Oracle, and RELEX.

Dataiku's Products & Differentiators

    Dataiku

    Dataiku is the platform for Everyday AI. With the explosion of generative AI, everyone is using AI for everyday tasks. Companies want to channel that excitement to transform business outcomes. Dataiku’s single, coherent platform is the only product that welcomes users with a wide range of skills and expertise, covers the full lifecycle of an AI project, and provides value to individuals at every level. Dataiku accelerates AI projects from months to days with a rich visual interface, built-in solutions, and pre-built components that take full advantage of a wide variety of generative AI services and cloud platforms for maximum speed and scale. Dataiku also provides strong MLOps, responsible AI, and governance capabilities, allowing teams and executives to monitor and manage AI projects and build confidence in AI outcomes.

Loading...

Research containing Dataiku

Get data-driven expert analysis from the CB Insights Intelligence Unit.

CB Insights Intelligence Analysts have mentioned Dataiku in 5 CB Insights research briefs, most recently on Feb 13, 2025.

Expert Collections containing Dataiku

Expert Collections are analyst-curated lists that highlight the companies you need to know in the most important technology spaces.

Dataiku is included in 4 Expert Collections, including Unicorns- Billion Dollar Startups.

U

Unicorns- Billion Dollar Startups

1,278 items

A

AI 100 (All Winners 2018-2025)

200 items

T

Tech IPO Pipeline

257 items

The tech companies we think could hit the public markets next, according to CB Insights data.

A

Artificial Intelligence

9,906 items

Latest Dataiku News

The 10 Hottest Data Science And Machine Learning Tools Of 2025 (So Far)

Jun 20, 2025

The data science and machine learning technology space is undergoing rapid changes, fueled primarily by the wave of generative AI and—just in the last year—agentic AI systems and the large language models that power them. Where data science and machine learning tools were traditionally targeted toward developing and supporting data analytics and predictive analytics systems, these tools today are increasingly supporting the development of AI agentic systems, according to The 2025 Gartner Magic Quadrant for Data Science and Machine Learning Platforms. Another major trend, according to the report, is a shift away from point-specific tools towards “full-stack data and AI platforms” that encompass model development and lifecycle management, data science tasks, data pipelines and other chores. What's more, these platforms are themselves incorporating LLMs and GenAI assistants “to enhance the data science workflow.” As part of CRN's 2025 Year So Far series , here's a look at some of the hottest data science and machine learning tools in use today. Some of the following tools are relatively new to the market while others have been around for a while and recently updated. The list also includes both commercial products and open-source software. AI Agents With Dataiku Dataiku's flagship Universal AI Platform is one of the industry's leading AI and machine learning platforms for data scientists. In April Dataiku debuted AI Agents with Dataiku, a new set of capabilities within the platform for creating and controlling AI agents at scale. The platform supports the central creation of agents with Code Agent for data scientists and developers and the Visual Agent no-code option for non-technical business users. Capabilities include Managed Agent Tools for maintaining the quality and validation of tools used by agents and a GenAI Registry for strategic oversight of agentic use cases. A key technology within AI Agents with Dataiku is the Dataiku LLM Mesh architecture to manage model access across all proprietary, open-source and cloud service large language models, according to the company. Dataiku Safe Guard defines and applies guardrails while Agent Connect centralizes agent access across an organization from a single interface. For agent observability and performance monitoring, AI Agents with Dataiku provides Trace Explorer for visibility into agent decision making, Quality Guard to continuously evaluate and monitor agent performance, and Cost Guard for real-time usage tracking, budget enforcement and internal cost allocation. Anaconda AI Platform Anaconda is well known for its data science and AI platform for developers using the popular Python programming language. The new Anaconda AI Platform, unveiled in May, is a unified open-source platform that Anaconda says provides a comprehensive system for streamlining machine learning workflows and building, training and deploying machine learning models. The platform provides simplified development and governance controls to boost practitioner productivity and reduce organizational risks associated with open-source AI development, according to the company. Features and capabilities include Anaconda AI Navigator for AI application development and experimentation with large language model, and the AI-powered Anaconda Assistant chatbot that assists with coding, debugging and data visualization. It also includes Conda Package Manager for managing packages and dependencies, curated “essential” ML libraries, and MLOps for automating model deployment and management. DataRobot Syftr In May, agentic workforce platform developer DataRobot debuted Syfter, an open-source framework that's designed to help AI developers evaluate and identify performant agentic workflows for commercial use. Syfter, according to DataRobot, “empowers AI practitioners to programmatically discover and implement the best combinations of components, parameters, tools and strategies for agentic use cases” and optimize them for accuracy, processing speed and cost. Some of Syftr's capabilities include multi-objective search and Bayesian optimization early stopping mechanism. Syfter is currently available as a “permissively licensed” open-source project. An enterprise edition of Syfter will be available this fall. Domino Enterprise AI Platform Domino Data Lab's flagship Domino Enterprise AI Platform is a machine learning operations (MLOps) system that helps organizations build and operate AI at scale. The platform provides a central hub for data science teams, according to the company, offering tools and infrastructure for managing the entire data science lifecycle, from initial exploration to model deployment and monitoring. In June, Domino Data Lab launched a new release of the platform with new capabilities, including a unified system for productivity, governance and delivery, “turning fragmented initiatives into an AI factory” for trusted, repeatable outcomes. The release also included a new Zero-to-AI service to catalyze proven AI cultural change within an organization. Hex Technologies Hex provides a collaborative data science and analytics workspace where data teams and business users can share analytical results. The platform combines the capabilities of traditional data science notebooks with integrated AI assistance, data applications and reports, and advanced collaboration functionality, according to the company. In January, the company introduced Hex Embedded Analytics, which allows developers to build the Hex technology into data products such as applications that need customer-facing analytics. In May, Hex raised an impressive $70 million in Series C funding. MLflow 3.0 MLflow is an open-source MLOps platform for managing workflows and artifacts across the machine learning lifecycle, according to the mlflow.org website, assisting machine learning practitioners and development teams in handling the complexities of the machine learning process. MLflow 3.0, introduced on June 11, “isn't just another feature update,” according to the 3.0 release announcement, but “fundamentally expands what's possible” with ML tooling and addresses observability and quality challenges around GenAI deployment. The new edition provides the LoggedModel1 entity to enable better organization and comparison of generative AI agents, deep learning checkpoints, and model variants across experiments. It also offers a new GenAI evaluation suite and enhanced model tracking for lineage support. The MLflow project was originally created by data management platform giant Databricks, which contributed it to the Linux Foundation in 2020. Databricks offers a fully managed MLflow service on its own platform. Today MLflow has more than 30 million monthly downloads and contributions from more than 850 developers worldwide, according to Databricks. PyTorch 2.7.1 PyTorch is a widely used, open-source machine learning library and framework for developing and training deep neural networks. It is known for its flexibility and ease-of-use for more intuitive model building and debugging, along with its dynamic computation graphs capabilities, according to the PyTorch.org website. The most recent edition is PyTorch 2.7.1 released on June 4, according to GitHub . The new version includes support for Python 3.12 and optimizations for AOTInductor. PyTorch 2.7.1 is part of the PyTorch 2 series that focuses on enhancing PyTorch's performance and the user experience through compiler-level changes. Snowflake Data Science Agent At its Snowflake Summit 2025 in early June Snowflake unveiled Data Science Agent, an “agentic companion” that the company said boosts data scientists' productivity by automating routine machine learning model development tasks. Snowflake said Data Science Agent simplifies AI and ML workflows, democratizes users' access to data across their businesses, and eliminates technical overhead – all through a natural language interface within Snowflake, according to the company. Data Science Agent, soon to be in private preview, uses Anthropic's Claude large language models to break down problems associated with ML workflows into distinct steps, such as data analysis, data preparation, feature engineering and training, according to Snowflake. The product creates fully functional pipelines using such advanced techniques as multi-step reasoning, contextual understanding and action execution. Tecton 1.1 Tecton got its start developing a feature platform that streamlines the process of building, deploying and managing machine learning features. The company expanded beyond its machine learning roots in September 2024 with a new release of its platform that delivers contextual data to the large language models that power generative AI systems. In February, the company debuted Tecton 1.1, the latest update to the platform, with added capabilities the company says makes it simpler for AI teams to build more sophisticated features, optimize infrastructure and improve model performance. The release includes new API resources for accessing any third-party data source in real time, a new capability for more efficiently performing the calculations needed for real-time feature views to speed up transformations during online retrieval queries, and a number of performance enhancements in the core Tecton platform. TensorFlow 2.19 TensorFlow is a popular open-source machine learning platform and software library for developing and deploying machine learning models—especially sophisticated deep learning models and neural networks—for AI. TensorFlow 2.19 was released in March with a number of technical improvements including changes to the C++ API in LiteRT and bfloat16 support for tflite casting. While PyTorch is generally seen as an alternative platform for small-scale machine learning development projects where model experimentation and quick editing are priorities, TensorFlow is generally viewed as best for large projects and production environments that require performance and scalability, according to the TensorFlow.org website TensorFlow is available under the Apache License 2.0. Rick Whiting has been with CRN since 2006 and is currently a feature/special projects editor. Whiting manages a number of CRN's signature annual editorial projects including Channel Chiefs, Partner Program Guide, Big Data 100, Emerging Vendors, Tech Innovators and Products of the Year. He also covers the Big Data beat for CRN. He can be reached at rwhiting@thechannelcompany.com

Dataiku Frequently Asked Questions (FAQ)

  • When was Dataiku founded?

    Dataiku was founded in 2013.

  • Where is Dataiku's headquarters?

    Dataiku's headquarters is located at 125 West 25th Street, New York.

  • What is Dataiku's latest funding round?

    Dataiku's latest funding round is Incubator/Accelerator - IV.

  • How much did Dataiku raise?

    Dataiku raised a total of $864.6M.

  • Who are the investors of Dataiku?

    Investors of Dataiku include NVIDIA DGX-Ready Managed Services, FirstMark Capital, Battery Ventures, Dawn Capital, CapitalG and 19 more.

  • Who are Dataiku's competitors?

    Competitors of Dataiku include Chalk, 2021.AI, PAASUP, Databricks, Amber Road and 7 more.

  • What products does Dataiku offer?

    Dataiku's products include Dataiku.

  • Who are Dataiku's customers?

    Customers of Dataiku include Banker's Bank, Unilever, US Venture, Floa Bank and Thrive SPC.

Loading...

Compare Dataiku to Competitors

DataRobot Logo
DataRobot

DataRobot specializes in artificial intelligence and offers an open, end-to-end AI lifecycle platform within the technology sector. The company provides solutions for scaling AI applications, monitoring and governing AI models, and driving business value through predictive and generative AI. DataRobot serves various industries, including healthcare, manufacturing, retail, and financial services, with its AI platform. It was founded in 2012 and is based in Boston, Massachusetts.

H2O.ai Logo
H2O.ai

H2O.ai specializes in generative AI and machine learning. It provides a comprehensive AI cloud platform for various industries. The company offers a suite of AI cloud products, including automated machine learning, distributed machine learning, and tools for AI-driven data extraction and processing. H2O.ai caters to sectors such as financial services, healthcare, insurance, manufacturing, marketing, retail, and telecommunications. H2O.ai was formerly known as 0xdata. It was founded in 2012 and is based in Mountain View, California.

Domino Logo
Domino

Domino provides an enterprise artificial intelligence platform for AI model development and deployment across various industries. The company's offerings include a platform for building, deploying, and managing AI models, with features that support collaboration and integration into enterprise workflows. Its platform is intended to support AI operations and knowledge sharing within organizations. The company was founded in 2013 and is based in San Francisco, California.

Alteryx Logo
Alteryx

Alteryx is a company specializing in enterprise analytics, providing a platform that facilitates data preparation and analytics processes. The company's products allow users to conduct data analysis, develop predictive models, and visualize data insights. Alteryx serves sectors that require data analytics capabilities, including financial services, retail, healthcare, and manufacturing. Alteryx was formerly known as SRC. It was founded in 1997 and is based in Irvine, California.

Databricks Logo
Databricks

Databricks provides a platform for businesses to integrate data and artificial intelligence. The company's main offerings include a data intelligence platform that supports generative artificial intelligence (AI), analytics, data governance, and data warehousing. Databricks serves sectors requiring data management and analytics capabilities, including financial services, healthcare, and media and entertainment. It was founded in 2013 and is based in San Francisco, California.

I
InData Labs

InData Labs provides data science consulting and software development focused on data science and AI across various sectors. The company offers services including predictive analytics, natural language processing, computer vision, and big data analytics. InData Labs serves industries such as finance, e-commerce, marketing and advertising, manufacturing, and healthcare. It was founded in 2014 and is based in Miami, Florida.

Loading...

CBI websites generally use certain cookies to enable better interactions with our sites and services. Use of these cookies, which may be stored on your device, permits us to improve and customize your experience. You can read more about your cookie choices at our privacy policy here. By continuing to use this site you are consenting to these choices.