SAS (previously Statistical Analysis System) is a data and artificial intelligence software developed by SAS Institute for data management, advanced analytics, multivariate analysis, business intelligence, and predictive analytics.
SAS was developed at North Carolina State University from 1966 until 1976, when SAS Institute was incorporated. SAS was further developed in the 1980s and 1990s with the addition of new statistical procedures, additional components and the introduction of JMP. A point-and-click interface was added in version 9 in 2004. A social media analytics product was added in 2010. SAS Viya, a suite of analytics and artificial intelligence software, was introduced in 2016.
Technical overview and terminology
SAS is a data analysis and artificial intelligence software suite that can mine, alter, manage and retrieve data from a variety of sources and perform statistical analysis on it.
SAS programs have DATA steps, which retrieve and manipulate data, PROC (procedures) which analyze the data, and may also have functions.
The DATA step has executable statements that result in the software taking an action, and declarative statements that provide instructions to read a data set or alter the data's appearance. Data sets are organized into tables with rows called "observations" and columns called "variables". Additionally, each piece of data has a descriptor and a value.
PROC statements call upon named procedures. Procedures perform analysis and reporting on data sets to produce statistics, analyses, and graphics. There are more than 300 named procedures and each one performs a substantial body of statistical work.
SAS data can be published in HTML, PDF, Excel, RTF and other formats using the Output Delivery System, which was first introduced in 2007. SAS Enterprise Guide is SAS's point-and-click interface. It generates code to manipulate data or perform analysis without the use of the SAS programming language.
The SAS software suite has more than 200 add-on packages, sometimes called components Some of these SAS components, i.e. add on packages to Base SAS include:
History
Origins
The development of SAS started in 1966 after North Carolina State University re-hired Anthony Barr to program his analysis of variance and regression software so that it would run on IBM System/360 computers. The project was funded by the National Institutes of Health. to improve crop yields. Barr was joined by student James Goodnight, who developed the software's statistical routines, and the two became project leaders. In 1972, after issuing the first release of SAS, the project lost its funding. According to Goodnight, this was because NIH only wanted to fund projects with medical applications. Goodnight continued teaching at the university for a salary of $1 and access to mainframe computers for use with the project, In 1971, SAS 71 was published as a limited release. It was used only on IBM mainframes and had the main elements of SAS programming, such as the DATA step and the most common procedures, i.e. PROCs. In 1976, Barr, Goodnight, Sall, and Helwig removed the project from North Carolina State and incorporated it as the SAS Institute, Inc.
Development
SAS was redesigned in SAS 76. The INPUT and INFILE statements were improved so they could read most data formats used by IBM mainframes. Generating reports was also added through the PUT and FILE statements. The ability to analyze general linear models was also added as was the FORMAT procedure, which allowed developers to customize the appearance of data. SAS version 4 had limited features, but made SAS more accessible. Version 5 introduced a complete macro language, array subscripts, and a full-screen interactive user interface called Display Manager. Vertical products like SAS Financial Management and SAS Human Capital Management (then called CFO Vision and HR Vision respectively) were also introduced.
JMP was developed by SAS co-founder John Sall and a team of developers, in order to take advantage of the graphical user interface introduced in the 1984 Apple Macintosh. JMP's name originally stood for "John's Macintosh Project". JMP was shipped for the first time in 1989. Updated versions of JMP were released continuously after 2002 with the most recent release in 2016. In January 2022, JMP became a wholly owned subsidiary of SAS Institute, having previously been a business unit of the company.
SAS 6 was used throughout the 1990s and was available on a wider range of operating systems, including Macintosh, OS/2, Silicon Graphics, and PRIMOS. SAS introduced new features through dot-releases. From 6.06 to 6.09, a user interface based on the Windows paradigm was introduced and support for SQL was added. Version 7 introduced the Output Delivery System (ODS) and an improved text editor. Subsequent releases improved upon the ODS. For example, more output options were added in version 8. The number of operating systems that were supported was reduced to UNIX, Windows and z/OS, and Linux was added. SAS 8 and SAS Enterprise Miner were released in 1999. In 2004, SAS Version 9.0 was released, referred to as "Project Mercury" internally, and was designed to make SAS accessible to a broader range of business users. SAS 9.0 added custom user interfaces based on the user's role and established the point-and-click user interface of SAS Enterprise Guide as the software's primary graphical user interface (GUI). In 2008, SAS announced Project Unity, designed to integrate data quality, data integration, and master data management.
SAS Institute Inc v World Programming Ltd was a lawsuit with developers of a competing implementation, World Programming System, alleging that they had infringed SAS's copyright in part by implementing the same functionality. The case was referred by the United Kingdom's High Court of Justice to the European Court of Justice on 11 August 2010. In May 2012, the European Court of Justice ruled in favor of World Programming, finding that "the functionality of a computer program and the programming language cannot be protected by copyright."
A free version of SAS was introduced for students in 2010. SAS Social Media Analytics, a tool for social media monitoring, engagement and sentiment analysis, was also released that year. SAS Rapid Predictive Modeler (RPM), which creates basic analytical models using Microsoft Excel, was introduced the same year. In 2010, JMP 9 included a new interface for using the R programming language and an add-in for MS Excel. The following year, a High Performance Computing platform was made available in a partnership with Teradata and EMC Greenplum. In 2011, the company released SAS Enterprise Miner 7.1. The company introduced 27 data management products from October 2013 to October 2014 and updates to 160 others. At the SAS Global Forum 2015, SAS announced several new products that were specialized for different industries, as well as new training software.
The company has invested in the development of artificial general intelligence, or "strong AI", with the goal of advancing deep learning and natural language processing to the point of achieving cognitive computing.
In 2019, SAS announced that it would be investing $1 billion into the development of advanced artificial intelligence, deep learning, natural language processing and machine learning. It announced an additional $1 billion investment into these areas in 2023, particularly for industries such as finance, insurance, government, health care and energy. In September 2023, the company announced its expansion of research into the applications of generative AI in analytics, data management and modeling.
Software products
As of 2011, SAS's largest set of products was its line for customer intelligence. SAS modules for web, social media and marketing analytics may be used to profile customers and prospects, attempt to predict their behaviors, and manage communications.
SAS also provides the SAS Fraud Framework, which is designed to monitor transactions across different networks and use analytics to identify anomalies that are indicative of fraud. This software uses artificial intelligence to monitor income and assets.
SAS has various analytical tools related to risk management.
SAS has products for specific industries, such as government, retail, telecommunications, aerospace, marketing optimization, and high-performance computing. The company also has a suite of analytical products related to health care and life sciences.
Comparison to other products
In a 2005 article for the Journal of Marriage and Family comparing statistical packages from SAS and its competitors Stata and SPSS, Alan C. Acock wrote that SAS programs provide "extraordinary range of data analysis and data management tasks," but were difficult to learn and use. A 2014 comparison by the University of California, Los Angeles, gave similar results.
Competitors such as Revolution Analytics and Alpine Data Labs advertise their products as considerably cheaper than SAS's. In a 2011 comparison, Doug Henschen of InformationWeek found that start-up fees for the three are similar, though he admitted that the starting fees were not necessarily the best basis for comparison. SAS's business model is not weighted as heavily on initial fees for its programs, instead focusing on revenue from annual subscription fees.
SAS Viya
In 2016, SAS Viya, an artificial intelligence, machine learning, analytics and data management platform, was introduced with a new architecture optimized for running SAS software in public clouds. Viya also increased interoperability with open source software, allowing models to be built in tools such as R, Python and Jupyter, and then executed on SAS's Cloud Analytics Services (CAS) engine. In 2020, a further architectural revamp in Viya 4 containerized the software. SAS sells Viya alongside SAS 9.4, and has not positioned it as a replacement for SAS 9.4.
In 2023, two new software as a service (SaaS) modules for SAS Viya were released as a private preview: Workbench, for use in creating AI models, and App Factory, for use in creating AI applications. Both modules support multiple programming languages and are expected to become generally available in 2024. SAS Viya also became available on Microsoft Azure Marketplace under a pay-as-you-use model in 2023.
In 2023, the company introduced SAS Health, a common health data model built on the SAS Viya platform.
Adoption
According to IDC, SAS is the largest market-share holder in "advanced analytics" with 35.4 percent of the market as of 2013. It is the fifth largest market-share holder for business intelligence (BI) software with a 6.9% share and the largest independent vendor. It competes in the BI market against SAP BusinessObjects, IBM Cognos, SPSS Modeler, Oracle Hyperion, and Microsoft Power BI. SAS has been named in the Gartner Leader's Quadrant for Data Integration Tools and for Business Intelligence and Analytical Platforms.
A study published in 2011 in BMC Health Services Research found that SAS was used in 42.6 percent of data analyses in health service research, based on a sample of 1,139 articles drawn from three journals.
Uses and applications
Education
SAS' analytical software is used in education to measure and visualize student outcomes and growth trends. Several states, including Virginia, North Carolina, Mississippi, Missouri, and North Dakota use its software to measure and analyze learning loss and learning recovery in students.
Environmental science
SAS and the International Institute for Applied Systems Analysis launched an app that crowdsources image data related to deforestation to train AI algorithms that can identify human impact on the environment. The University of Florida's Center for Coastal Solutions partners with SAS to develop research, training programs and analytical tools related to environmental issues affecting coastal communities.
The UNC Center for Galapagos Studies partnered with SAS in 2023 to create a model that can track the health and migratory patterns of species such as sea turtles and hammerhead sharks, as well as the health of the phytoplankton population.
Finance and insurance
SAS's fraud detection and prevention software is used by the tax agencies of various countries, such as the United States, and Malta.
See also
- Comparison of OLAP servers
- JMP (statistical software), a subsidiary of SAS Institute Inc.
- SAS language
- R (programming language)
References
Further reading
External links
- SAS OnDemand for Academics No-cost access for learners (free SAS Profile required)
- A Glossary of SAS terminology
- SAS for Developers
- SAS community forums
- Wikiversity:Data Analysis using the SAS Language
