Oh no! Some styles failed to load. 😵 Please try reloading this page

Compare the Top Data Catalog Software of 2021

Data Catalog Software Guide

What is Data Catalog Software?

Data catalog software enables organizations to automatically identify and take inventory of data sources across their systems and uses metadata management technology to organize and catalog the data. Compare the best Data Catalog software currently available using the table below.

  • 1
    OvalEdge

    OvalEdge

    OvalEdge

    OvalEdge is a cost-effective data catalog designed for end-to-end data governance, privacy compliance, and fast, trustworthy analytics. OvalEdge crawls your organizations’ databases, BI platforms, ETL tools, and data lakes to create an easy-to-access, smart inventory of your data assets. Using OvalEdge, analysts can discover data and deliver powerful insights quickly. OvalEdge’s comprehensive functionality enables users to establish and improve data access, data literacy, and data quality.
    Starting Price: $1,300/month
  • 2
    K2View

    K2View

    K2View

    K2View provides an operational data fabric dedicated to making every customer experience personalized and profitable. The K2View platform continually ingests all customer data from all systems, enriches it with real-time insights, and transforms it into a patented Micro-Database™ - one for every customer. To maximize performance, scale, and security, every micro-DB is compressed and individually encrypted. It is then delivered in milliseconds to fuel quick, effective, and pleasing customer interactions. K2View products include: • Data Fabric Software • Test Management Tools • Data Preparation Software Global 2000 companies – including AT&T, Vodafone, Sky, and Hertz – deploy K2View in weeks to deliver outstanding multi-channel customer service, minimize churn, achieve hyper-segmentation, and assure data compliance.
  • 3
    Alation

    Alation

    Alation

    What if you had a recommendation engine for your data? Data inventory was automated. A searchable catalog revealed user behavior of data. And the system proactively made smart recommendations inline, as you wrote queries. Alation makes all of this possible with the world’s first collaborative data catalog for the enterprise. It's a powerful solution that dramatically improves the productivity of analysts, the accuracy of analytics, and empowers better business decisions for all. Alation surfaces proactive recommendations to data consumers through applications. We took inspiration from Google for a simple interface to connect the language of your business to the technical schema of your data. Finding the data you need is no longer stalled by tricky semantic translations. Unfamiliar with a data environment, and unsure of which data to use in your query? Alation helps you build the query and signals whether data is trustworthy with inline recommendations.
  • 4
    Lumada

    Lumada

    Hitachi

    Embed sensors for IoT use cases and enrich sensor data with control system and environment data. Integrate this in real time with enterprise data and deploy predictive algorithms to discover new insights and harvest your data for meaningful use. Use analytics to predict maintenance problems, understand asset utilization, reduce defects and optimize processes. Harness the power of connected devices to deliver remote monitoring and diagnostics services. Employ IoT Analytics to predict safety hazards and comply with regulations to reduce worksite accidents. Lumada Data Integration: Rapidly build and deploy data pipelines at scale. Integrate data from lakes, warehouses and devices, and orchestrate data flows across all environments.
  • 5
    Aginity

    Aginity

    Aginity

    On a quest to eradicate inconsistent analytics, Aginity SQL workspace empowers collaborative analytics within enterprise SQL communities. With Aginity's innovative active catalog, data engineers & analytic stewards can curate and promote the most useful & efficient SQL assets with role-based governance. Aginity provides one platform to manage data across on-premise and cloud databases, dramatically decreasing the time spent cleaning, organizing, and transforming data, while increasing productivity and data quality with intuitive workflows for searching, sharing, and modularizing code. Clients include Geico, Aetna, and Johnson & Johnson.​
  • 6
    Tree Schema Data Catalog
    The essential tool for metadata management. Automatically populate your entire catalog in under 5 minutes! Data Discovery. Find the data you need anywhere within your data ecosystem from the database all the way down to the specific values for each field. Automatically document your data from existing data stores. First-class support for tabular and unstructured data. Automated data governance actions. Data Lineage. Explore your data lineage and understand where your data comes from and where it is going. View impact analysis of changes Find all up and downstream impacts. Visualize relationships and connections. API AccessNew. Manage your data lineage as code and keep your catalog up to date with the Tree Schema API. Integrate Data Lineage into CICD pipelines Capture values & descriptions within your code Analyze impact for breaking changes. Data Dictionary. Know the key terms and lingo that drive your business. Define the context and scope for keywords
    Starting Price: $99 per month
  • 7
    Azure Data Catalog
    In the new world of data, you can spend more time looking for data than you do analyzing it. Azure Data Catalog is an enterprise-wide metadata catalog that makes data asset discovery straightforward. It’s a fully-managed service that lets you—from analyst to data scientist to data developer—register, enrich, discover, understand, and consume data sources. Work with data in the tool of your choice. Data Catalog lets you find the data you need and use it in the tools you choose. Your data stays where you want it, and Data Catalog helps you discover and work with it where you want, with an intuitive user experience. ncrease broad adoption and continuous value creation across your data ecosystem. Data Catalog helps you get tips, tricks, and unwritten rules into an experience where everyone can get value. With Data Catalog, everyone can contribute. Democratize data asset discovery.
    Starting Price: $1 per user per month
  • 8
    erwin Data Intelligence
    erwin Data Intelligence (erwin DI) combines data catalog and data literacy capabilities for greater awareness of and access to available data assets, guidance on their use, and guardrails to ensure data policies and best practices are followed. Automatically harvest, transform and feed metadata from a wide array of data sources, operational processes, business applications and data models into a central catalog. Then make it accessible and understandable via role-based, contextual views so stakeholders can make strategic decisions based on accurate insights. erwin DI supports enterprise data governance, digital transformation and any effort that relies on data for favorable outcomes. Schedule ongoing scans of metadata from the widest array of data sources. Easily map data elements from source to target, including data in motion, and harmonize data integration across platforms. Enable data consumers to define and discover data relevant to their roles.
    Starting Price: $299 per month
  • 9
    Talend Data Catalog
    Talend Data Catalog gives your organization a single, secure point of control for your data. With robust tools for search and discovery, and connectors to extract metadata from virtually any data source, Data Catalog makes it easy to protect your data, govern your analytics, manage data pipelines, and accelerate your ETL processes. Data Catalog automatically crawls, profiles, organizes, links, and enriches all your metadata. Up to 80% of the information associated with the data is documented automatically and kept up-to-date through smart relationships and machine learning, continually delivering the most current data to the user. Make data governance a team sport with a secure single point of control where you can collaborate to improve data accessibility, accuracy, and business relevance. Support data privacy and regulatory compliance with intelligent data lineage tracing and compliance tracking.
    Starting Price: $100 per month
  • 10
    Qlik Catalog
    When you empower your business with on-demand access to analytics-ready data, you accelerate discovery and people get answers faster. Qlik Catalog is an enterprise data catalog that simplifies and accelerates the profiling, organization, preparation, and delivery of trustworthy, actionable data in days, not months. Qlik Catalog builds a secure, enterprise-scale catalog of all the data your organization has available for analytics, no matter where it is. Powerful, automated data preparation and metadata tools streamline the transformation of raw data into analytics-ready information assets. Business users get a single, go-to data catalog to find, understand, and use any enterprise data source to gain insights. Automatically profile and document the exact content, structure, and quality of your data using built-in data loaders to simplify and accelerate the process. Build a Smart Data Catalog that documents every aspect of your data.
    Starting Price: $30 per user per month
  • 11
    Google Cloud Data Catalog
    A fully managed and highly scalable data discovery and metadata management service. New customers get $300 in free credits to spend on Google Cloud during the Free Trial. All customers get up to 1 MiB of business or ingested metadata storage and 1 million API calls, free of charge. Pinpoint your data with a simple but powerful faceted-search interface. Sync technical metadata automatically and create schematized tags for business metadata. Tag sensitive data automatically, through Cloud Data Loss Prevention (DLP) integration. Get access immediately then scale without infrastructure to set up or manage. Empower any user on the team to find or tag data with a powerful UI, built with the same search technology as Gmail, or via API access. Data Catalog is fully managed, so you can start and scale effortlessly. Enforce data security policies and maintain compliance through Cloud IAM and Cloud DLP integrations.
    Starting Price: $100 per GiB per month
  • 12
    IBM Watson Knowledge Catalog
    Activate business-ready data for AI and analytics with intelligent cataloging, backed by active metadata and policy management. IBM Watson® Knowledge Catalog is a data catalog tool that powers intelligent, self-service discovery of data, models and more. The cloud-based enterprise metadata repository activates information for AI, machine learning (ML) and deep learning. Access, curate, categorize and share data, knowledge assets and their relationships, wherever they reside. Organize, define and manage enterprise data to provide the right context and drive value across needs like regulatory compliance and data monetization. Protect data, manage compliance and audit-readiness, and maintain client trust with active policy management and dynamic masking of sensitive data. Consume and transform data at the speed of business with intuitive dashboards and flows that can be shared with peers or analytics tools.
    Starting Price: $300 per instance
  • 13
    SAP Data Intelligence
    Turn data chaos into data value with data intelligence. Connect, discover, enrich, and orchestrate disjointed data assets into actionable business insights at enterprise scale. SAP Data Intelligence is a comprehensive data management solution. As the data orchestration layer of SAP’s Business Technology Platform, it transforms distributed data sprawls into vital data insights, delivering innovation at scale. Provide your users with intelligent, relevant, and contextual insights with integration across the IT landscape. Integrate and orchestrate massive data volumes and streams at scale. Streamline, operationalize, and govern innovation driven by machine learning. Optimize governance and minimize compliance risk with comprehensive metadata management rules. Connect, discover, enrich, and orchestrate disjointed data assets into actionable business insights at enterprise scale.
    Starting Price: $1.22 per month
  • 14
    Tableau Catalog
    Everyone benefits with Tableau Catalog. By providing a complete picture of the data and how it is connected to the analytics in the Tableau environment, Tableau Catalog increases the trust and discoverability for both IT and business users. Whether you're communicating changes being made to the data, reviewing a dashboard or searching for the right data for analysis, Tableau Catalog lets you feel confident your organization is always using the right data. Tableau Catalog automatically ingests all of the data assets in your Tableau environment into one central list. No need to set up an index schedule or configure connectivity. Quickly see all your tables, files, and databases in one place. Migrating databases, deprecating a field or adding a new column to a table all have potential effects on the assets in your environment. With lineage and impact analysis, you can see not only what assets will have up and downstream implications but also who will be affected.
    Starting Price: $15 per month
  • 15
    DvSum

    DvSum

    DvSum

    DvSum is a AI-powered Data Intelligence platform that makes it remarkably easier for your data and analytics teams to discover, monitor, and govern data. With powerful AI-enabled algorithms, DvSum automatically catalogues, classifies, and curates your data and makes it available as an actionable Data Catalog. Propel your enterprise towards its digital and analytics enabled transformation goals with DvSum Data Intelligence.
    Starting Price: $1000/ per month
  • 16
    Alteryx

    Alteryx

    Alteryx

    Alteryx is the launchpad for automation breakthroughs. Be it your personal growth, achieving transformative digital outcomes, or rapid innovation, the results are unparalleled. The unique innovation that converges analytics, data science and process automation into one easy-to-use platform, empowers everyone and every organization ​to make business-altering breakthroughs the new status quo.​ Visit Alteryx.com for more information, and to start your free trial.
  • 17
    Informatica Enterprise Data Catalog
    Scan and index metadata, discover and profile data, and provide detailed lineage across tens of millions of data sets. Classify and organize data assets across any environment to maximize data value and reuse. Automatically scan across multi-cloud platforms, BI tools, ETL, and third-party metadata catalogs; and data types. Leverage AI-powered domain discovery, data similarity, business term associations, and recommendations. Track data movement, from high-level system views to granular column-level lineage, and get detailed impact analysis. Use the Data Asset Analytics dashboard to understand asset usage, enrichment, and collaboration. View data quality rules, scorecards, metric groups, and profiling stats in context. Tap into shared data knowledge with certifications, ratings and reviews, a Q&A platform, and change notifications. Our broad and deep lineup of enterprise-grade data management solutions sets Informatica apart from the crowd.
  • 18
    erwin Data Catalog
    erwin Data Catalog by Quest is metadata management software that helps organizations learn what data they have and where it’s located, including data at rest and in motion. It tells you the data and metadata available for a certain topic so those particular sources and assets can be found quickly for analysis and decision-making. erwin Data Catalog automates the processes involved in harvesting, integrating, activating and governing enterprise data according to business requirements. This automation results in greater accuracy and faster time to value for data governance and digital transformation efforts, including data warehouse, data lake, data vault and other Big Data deployments, cloud migrations, etc. Metadata management is key to sustainable data governance and any other organizational effort for which data is key to the outcome. erwin Data Catalog automates enterprise metadata management, data mapping, data cataloging, code generation, data profiling and data lineage.
  • 19
    Oracle Cloud Infrastructure Data Catalog
    Oracle Cloud Infrastructure (OCI) Data Catalog is a metadata management service that helps data professionals discover data and support data governance. Designed specifically to work well with the Oracle ecosystem, it provides an inventory of assets, a business glossary, and a common metastore for data lakes. OCI Data Catalog is fully managed by Oracle and runs with all the power and scale of Oracle Cloud Infrastructure. Benefit from all of the security, reliability, performance, and scale of Oracle Cloud while using OCI Data Catalog. Using REST APIs and SDKs, developers can integrate OCI Data Catalog’s capabilities in their custom applications. Using a trusted system for managing user identities and access privileges, administrators can control access to data catalog objects and capabilities to manage security requirements. Discover data assets across Oracle data stores on-premises and in the cloud to start gaining real value from data.
  • 20
    Infogix Data360
    Improve your decision making by increasing productivity, accuracy and understanding of all available data. This is the start of making your data work for you–not against you. Insights are only as valuable as the quality of the data used to construct them. Start by identifying all accessible data using the automated catalog, search, and discovery features in Data360. Translate highly technical metadata into meaningful business information that will benefit everyone – and can be utilized by anyone. Managing data responsibility is no easy task. Making the most of the data available to your organization requires a data governance solution that prioritizes data quality. Discover the quality, value, and trustworthiness of your data sets with Data360 Govern, an enterprise data governance, catalog, and metadata management solution.
  • 21
    Datameer

    Datameer

    Datameer

    Datameer is a SaaS Data Transformation solution for Snowflake Data Warehouses. Datameer accelerates analytical data engineering with a hybrid SQL and No-Code Interface that empowers engineers and analysts to discover, transform, catalog, and publish data assets utilizing the native compute of Snowflake for downstream reporting, analytics, and machine learning.
  • 22
    Anzo

    Anzo

    Cambridge Semantics

    Anzo is a modern data discovery and integration platform that lets anyone find, connect and blend any enterprise data into analytics-ready datasets. Anzo’s unique use of semantics and graph data models makes it practical for the first time for virtually anyone in your organization – from skilled data scientists to novice business users – to drive the data discovery and integration process and build their own analytics-ready datasets. Anzo’s graph data models provide business users with a visual map of enterprise data that is easy to understand and navigate, even when your data is vast, siloed and complex. Semantics add business content to data, allowing users to harmonize data based on shared definitions and build blended, business-ready data on demand.
  • 23
    AWS Glue

    AWS Glue

    Amazon

    AWS Glue is a fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data for analytics. You can create and run an ETL job with a few clicks in the AWS Management Console. You simply point AWS Glue to your data stored on AWS, and AWS Glue discovers your data and stores the associated metadata (e.g. table definition and schema) in the AWS Glue Data Catalog. Once cataloged, your data is immediately searchable, queryable, and available for ETL.
  • 24
    Apache Atlas

    Apache Atlas

    Apache Software Foundation

    Atlas is a scalable and extensible set of core foundational governance services – enabling enterprises to effectively and efficiently meet their compliance requirements within Hadoop and allows integration with the whole enterprise data ecosystem. Apache Atlas provides open metadata management and governance capabilities for organizations to build a catalog of their data assets, classify and govern these assets and provide collaboration capabilities around these data assets for data scientists, analysts and the data governance team. Pre-defined types for various Hadoop and non-Hadoop metadata. Ability to define new types for the metadata to be managed. Types can have primitive attributes, complex attributes, object references; can inherit from other types. Instances of types, called entities, capture metadata object details and their relationships. REST APIs to work with types and instances allow easier integration.
  • 25
    Zaloni Arena
    End-to-end DataOps built on an agile platform that improves and safeguards your data assets. Arena is the premier augmented data management platform. Our active data catalog enables self-service data enrichment and consumption to quickly control complex data environments. Customizable workflows that increase the accuracy and reliability of every data set. Use machine-learning to identify and align master data assets for better data decisioning. Complete lineage with detailed visualizations alongside masking and tokenization for superior security. We make data management easy. Arena catalogs your data, wherever it is and our extensible connections enable analytics to happen across your preferred tools. Conquer data sprawl challenges: Our software drives business and analytics success while providing the controls and extensibility needed across today’s decentralized, multi-cloud data complexity.
  • Previous
  • You're on page 1
  • 2
  • Next