AWS Glue vs SAS Data Management

Last Updated:

Our analysts compared AWS Glue vs SAS Data Management based on data from our 400+ point analysis of ETL Tools, user reviews and our own crowdsourced data from our free software selection platform.

SAS Data Management Software Tool

Product Basics

AWS Glue is a fully managed, event-driven serverless computing platform that extracts, cleanses and organizes data for insights. Automatic code generation ensures citizen data scientists and power users can create and schedule integration workflows. An event-driven architecture enables setting triggers to launch data integration processes.

A common data catalog with automatic schema generation ensures data is unique and easily accessible. With streaming data integration, it catalogs assets from datastores like Amazon S3, making it available for querying with Amazon Athena and Redshift Spectrum. Developers can access readymade endpoints to edit and test code.

Pros
  • Serverless & Scalable
  • Easy Visual Workflow
  • Built-in Data Connectors
  • Pay-per-Use Pricing
  • AWS Ecosystem Integration
Cons
  • Complex Transformations
  • Limited On-Premise Data
  • Python & Scala Only
  • Potential Cost Overruns
  • AWS Lock-in Concerns
read more...
SAS Data Management empowers organizations to wrangle their data, from ingestion and cleansing to transformation and governance. It excels at handling mountains of complex data, making it ideal for industries like finance, healthcare, and government. Key benefits include streamlined data integration, enhanced data quality, and robust security controls. Popular features include its drag-and-drop interface, automation capabilities, and advanced analytics tools. User experiences suggest it can be powerful for data wranglers but might have a steeper learning curve compared to simpler options. Pricing is typically per seat or core, with annual subscriptions or perpetual licenses available.

Pros
  • Robust for large datasets
  • Scalable & high performance
  • Advanced data manipulation
  • Automate complex tasks
  • Strong statistical analysis
Cons
  • Steep learning curve
  • Costly licensing & maintenance
  • Limited visual analytics
  • Not user-friendly interface
  • Difficult to debug code
read more...
$0.44/M-DPU-Hour
Free Trial is unavailable →
Get a free price quote
Tailored to your specific needs
$300 Monthly
Get a free price quote
Tailored to your specific needs
Small 
i
Medium 
i
Large 
i
Small 
i
Medium 
i
Large 
i
Windows
Mac
Linux
Android
Chromebook
Windows
Mac
Linux
Android
Chromebook
Cloud
On-Premise
Mobile
Cloud
On-Premise
Mobile

Product Assistance

Documentation
In Person
Live Online
Videos
Webinars
Documentation
In Person
Live Online
Videos
Webinars
Email
Phone
Chat
FAQ
Forum
Knowledge Base
24/7 Live Support
Email
Phone
Chat
FAQ
Forum
Knowledge Base
24/7 Live Support

Product Insights

  • Effortless Data Integration: Streamline data movement across diverse sources like databases, applications, and cloud storage with pre-built connectors and automated schema discovery.
  • Simplified Data Preparation: Clean, transform, and enrich data with a visual drag-and-drop interface and built-in transformations, eliminating the need for complex coding.
  • Serverless Scalability: Forget infrastructure management! Glue seamlessly scales to handle massive data volumes without upfront provisioning or ongoing maintenance.
  • Cost-Effective Flexibility: Pay-per-use pricing based on actual resource consumption makes Glue ideal for both small and large data pipelines, optimizing your costs.
  • Seamless AWS Integration: Leverage the power of the AWS ecosystem! Glue effortlessly integrates with S3, Redshift, and other AWS services, creating a unified data pipeline within your existing infrastructure.
  • Improved Data Accessibility: Deliver prepared data to data lakes, data warehouses, and analytics platforms, democratizing access for data scientists, analysts, and business users.
  • Enhanced Collaboration: Share data pipelines and workflows with other users and teams, fostering collaboration and streamlining data-driven workflows.
  • Centralized Data Catalog: Maintain a single source of truth for your data assets with Glue Data Catalog, ensuring data consistency and discoverability.
  • Continuous Monitoring and Optimization: Track job performance, identify bottlenecks, and optimize your pipelines for efficiency with built-in monitoring and logging tools.
  • Future-Proof Data Infrastructure: Stay ahead of the curve with Glue's serverless architecture and cloud-native approach, adapting to your evolving data needs with ease.
read more...
  • Faster, Deeper Insights: SAS Data Management streamlines data preparation, reducing time spent wrangling data and freeing you to focus on analysis. Dive into complex datasets faster with automated tasks, data quality checks, and efficient transformation tools.
  • Unify Data from Anywhere: Break down data silos and gain a holistic view with seamless access across diverse sources. Connect to databases, cloud platforms, and data lakes with ease, regardless of format or location.
  • Empower Business Users: Equip non-technical users with self-service tools for data discovery and exploration. Drag-and-drop interfaces and intuitive wizards make data manipulation accessible, fostering data-driven decision-making across the organization.
  • Boost Efficiency and Productivity: Automate repetitive tasks and simplify complex workflows with SAS Data Management's powerful scripting language. Eliminate manual processes and free up time for higher-value analysis, boosting team productivity.
  • Build Trustworthy Data: Ensure data quality and compliance with comprehensive governance features. Track data lineage, maintain audit trails, and apply robust security measures to build trust in your data and its insights.
  • Unleash the Power of AI and Machine Learning: Integrate AI and machine learning capabilities directly into your data pipelines. Cleanse data with intelligent algorithms, identify hidden patterns, and generate predictive models, all within the SAS Data Management platform.
  • Scale with Confidence: SAS Data Management scales seamlessly to meet your growing data needs. Handle large and complex datasets efficiently, whether on-premises or in the cloud, with robust infrastructure and performance optimization tools.
  • Future-proof Your Data Strategy: Stay ahead of the curve with SAS Data Management's continuous innovation. Access cutting-edge technologies like in-memory analytics and cloud-native capabilities to adapt to the evolving data landscape.
read more...
  • Console: Discover, transform and make available data assets for querying and analysis. Builds complex data integration pipelines; handles dependencies, filters bad data and retries jobs after failures. Monitor jobs and get task status alerts via Amazon Cloudwatch. 
  • Data Catalog: Gleans and stores metadata in the catalog for workflow authoring, with full version history. Search and discover desired datasets from the data catalog, irrespective of where they are located. Saves time and money – automatically computes statistics and registers partitions with a central metadata repository. 
  • Automatic Schema Discovery: Creates metadata automatically by gleaning schema, quality and data types through built-in datastore crawlers and stores it in the Data Catalog. Ensure up-to-date assets – run crawlers on a schedule, on-demand or based on event triggers. Manage streaming data schemas with the Schema Registry. 
  • Event-driven Architecture: Move data automatically into data lakes and warehouses by setting triggers based on a schedule or event. Extract, transform and load jobs with a Lambda function as soon as new data becomes available. 
  • Visual Data Prep: Prepare assets for analytics and machine learning through Glue DataBrew. Automate anomaly filtering, convert data to standard formats and rectify invalid values with more than 250 pre-designed transformations – no need to write code. 
  • Materialized Views: Create a virtual table from multiple different data sources by using SQL. Copies data from each source data store and creates a replica in the target datastore as a materialized view. Ensures data is always up-to-date by monitoring data in source stores continuously and updating target stores in real time. 
read more...
  • Integrated Development Environment (IDE): Access source systems virtually and create target structures. Manage processes with an intuitive, point-and-click, role-based GUI – import and export metadata functions and run ETL and ELT process flows. Supports interactive debugging and testing of jobs with full log access. 
    • Unified Architecture: Leverage the complete data pipeline — from data quality to data federation — in one platform. Ensure data transparency and accountability with auditing tools and source data lineage. 
  • Process Designer: Build and update data management processes with a visual, end-to-end event designer. Control and run data integration tasks and fork jobs to execute in parallel. Run shell scripts by calling REST and SOAP web services. 
  • Embeddable Data Quality: Access customizable business rules within batch, near-time and real-time processes and reuse as needed. Identify incomplete, ambiguous and inaccurate data with its interactive GUI. Get alerts for when data quality falls below acceptable standards. Supports data cleansing in native languages for more than 38 regions globally. 
  • Data Transformation: Build data warehouses, data marts, and BI and analytic data stores by pulling data from multiple sources. Extract required data with more than 300 out-of-the-box SQL-based transforms. Reuse transform functions in different projects and environments through custom exits, message queues and web services. 
read more...

Product Ranking

#9

among all
ETL Tools

#43

among all
ETL Tools

Find out who the leaders are

Analyst Rating Summary

88
94
100
100
92
94
62
84
Show More Show More
Data Delivery
Performance and Scalability
Platform Capabilities
Platform Security
Workflow Management
Data Delivery
Metadata Management
Performance and Scalability
Platform Capabilities
Platform Security

Analyst Ratings for Functional Requirements Customize This Data Customize This Data

AWS Glue
SAS Data Management
+ Add Product + Add Product
Data Delivery Data Quality Data Sources And Targets Connectivity Data Transformation Metadata Management Platform Capabilities Workflow Management 100 92 62 90 96 100 100 100 94 84 97 100 100 92 0 25 50 75 100
100%
0%
0%
100%
0%
0%
85%
8%
7%
85%
15%
0%
36%
0%
64%
71%
0%
29%
88%
0%
12%
96%
0%
4%
90%
0%
10%
100%
0%
0%
100%
0%
0%
100%
0%
0%
100%
0%
0%
80%
10%
10%

Analyst Ratings for Technical Requirements Customize This Data Customize This Data

100%
0%
0%
100%
0%
0%
100%
0%
0%
100%
0%
0%

User Sentiment Summary

Great User Sentiment 165 reviews
Great User Sentiment 99 reviews
85%
of users recommend this product

AWS Glue has a 'great' User Satisfaction Rating of 85% when considering 165 user reviews from 3 recognized software review sites.

86%
of users recommend this product

SAS Data Management has a 'great' User Satisfaction Rating of 86% when considering 99 user reviews from 4 recognized software review sites.

4.0 (46)
4.2 (18)
n/a
4.6 (17)
4.4 (109)
4.6 (29)
3.9 (10)
4.0 (35)

Awards

SelectHub research analysts have evaluated AWS Glue and concluded it earns best-in-class honors for Workflow Management.

Workflow Management Award

we're gathering data

Synopsis of User Ratings and Reviews

Cost-Effective & Serverless: Pay only for resources used, eliminates server provisioning and maintenance
Simplified ETL workflows: Drag-and-drop UI & auto-generated code for easy job creation, even for non-programmers
Data Catalog: Unified metadata repository for seamless discovery & access across various data sources
Flexible Data Integration: Connects to diverse data sources & destinations (S3, Redshift, RDS, etc.)
Built-in Data Transformations: Apply pre-built & custom transformations within workflows for efficient data cleaning & shaping
Visual Data Cleaning (Glue DataBrew): Code-free data cleansing & normalization for analysts & data scientists
Scalability & Performance: Auto-scaling resources based on job needs, efficient Apache Spark engine for fast data processing
Community & Support: Active user community & helpful AWS support resources for problem-solving & best practices
Show more
Streamlined Workflow: Simplifies data management tasks with drag-and-drop interface and automated processes, saving time and improving efficiency.
Robust Data Quality: Ensures data accuracy and consistency through comprehensive cleaning, validation, and transformation tools, fostering trust in data-driven decisions.
Scalability and Performance: Handles large datasets efficiently with parallel processing and optimized algorithms, enabling complex analyses without performance bottlenecks.
Extensive Integrations: Connects seamlessly with various data sources and analytics platforms, facilitating a holistic view of data across the organization.
Regulatory Compliance: Supports secure data governance and auditability for meeting industry regulations, providing peace of mind and reducing compliance risks.
Show more
Limited Customization & Control: Visual interface and pre-built transformations may not be flexible enough for complex ETL needs, requiring manual coding or custom Spark jobs.
Debugging Challenges: Troubleshooting Glue jobs can be complex due to limited visibility into underlying Spark code and distributed execution, making error resolution time-consuming.
Performance Limitations for Certain Workloads: Serverless architecture may not be optimal for latency-sensitive workloads or large-scale data processing, potentially leading to bottlenecks.
Vendor Lock-in & Portability: Migrating ETL workflows from Glue to other platforms can be challenging due to its proprietary nature and lack of open-source compatibility.
Pricing Concerns for Certain Use Cases: Pay-per-use model can be expensive for long-running ETL jobs or processing massive datasets, potentially exceeding budget constraints.
Show more
Cost and Licensing: Requires significant upfront investment and ongoing licensing fees, making it less accessible to smaller organizations or budget-constrained projects.
Steep Learning Curve: Complex interface and proprietary language can be challenging for users without prior SAS experience, requiring dedicated training and support.
Limited Open-Source Integration: Primarily focused on its own ecosystem, with limited compatibility and integration with open-source tools and platforms.
Black-Box Nature: Limited transparency into internal algorithms and processes can make troubleshooting and debugging complex issues challenging.
Vendor Lock-in: Switching to other data management solutions can be difficult and costly due to data dependencies and lack of standard export formats.
Show more

User reviews of AWS Glue paint a picture of a powerful and user-friendly ETL tool for the cloud, but one with limitations. Praise often centers around its intuitive visual interface, making complex data pipelines accessible even to non-programmers. Pre-built connectors and automated schema discovery further simplify setup, saving users time and effort. Glue's serverless nature and tight integration with the broader AWS ecosystem are also major draws, offering seamless scalability and data flow within a familiar environment. However, some users find Glue's strength in simplicity a double-edged sword. For complex transformations beyond basic filtering and aggregation, custom scripting in Python or Scala is required, limiting flexibility for those unfamiliar with these languages. On-premise data integration is another pain point, with Glue primarily catering to cloud-based sources. This leaves users seeking hybrid deployments or integration with legacy systems feeling somewhat stranded. Cost also arises as a concern. Glue's pay-per-use model can lead to unexpected bills for large data volumes or intricate pipelines, unlike some competitors offering fixed monthly subscriptions. Additionally, Glue's deep integration with AWS can create lock-in anxieties for users worried about switching cloud providers in the future. Overall, user reviews suggest Glue shines in cloud-based ETL for users comfortable with its visual interface and scripting limitations. Its scalability, ease of use, and AWS integration are undeniable strengths. However, for complex transformations, on-premise data needs, or cost-conscious users, alternative tools may offer a better fit.

Show more

User reviews of SAS Data Management paint a nuanced picture. Fans praise its streamlined workflow, robust data quality tools, and scalability for handling massive datasets. They appreciate its seamless integration with various data sources and analytics platforms, enabling a holistic view and fostering trust in data-driven decisions. Regulatory compliance support is another major plus, offering peace of mind and reducing risks. However, critics point to the hefty price tag and complex licensing structures as major barriers, especially for smaller companies or budget-constrained projects. The steep learning curve can be daunting for new users, requiring dedicated training and potentially slowing down productivity. Limited open-source integration and a closed-ecosystem nature restrict flexibility and collaboration with external tools. The black-box nature of its algorithms can also make troubleshooting and debugging difficult. Some users feel locked in due to data dependencies and non-standard export formats, making transitioning to other solutions costly and cumbersome. Ultimately, SAS Data Management's strengths in robust data handling, scalability, and compliance shine for organizations with complex data needs and strict regulations. However, its high cost, limited open-source compatibility, and steep learning curve make it less ideal for smaller companies or those seeking greater flexibility and affordability. Users weighing options should carefully consider their specific needs and resources before making a decision.

Show more

Screenshots

Top Alternatives in ETL Tools


Azure Data Factory

Cloud Data Fusion

Dataflow

DataStage

Fivetran

Hevo

IDMC

Informatica PowerCenter

InfoSphere Information Server

Integrate.io

Oracle Data Integrator

Pentaho

Qlik Talend Data Integration

SAP Data Services

SAS Data Management

Skyvia

SQL Server

SQL Server Integration Services

Talend

TIBCO Cloud Integration

Related Categories

Head-to-Head Comparison

WE DISTILL IT INTO REAL REQUIREMENTS, COMPARISON REPORTS, PRICE GUIDES and more...

Compare products
Comparison Report
Just drag this link to the bookmark bar.
?
Table settings