Learn about business intelligence (BI) features in T-SQL and how they can help you with data science and analytics efforts without the need to bring in other languages such as R and Python. This book shows you how to compute statistical measures using your existing skills in T-SQL. You will learn how to calculate descriptive statistics, including centers, spreads, skewness, and kurtosis of distributions. You will also learn to find associations between pairs of variables, including calculating linear regression formulas and confidence levels with definite integration.
Every web/retail shop wants to know the products customers tend to buy together. Trying to predict the target discrete or continuous variable with few input variables is important for practically every type of business. This book helps you understand data science and the advanced algorithms use to analyze data, and terms such as data mining, machine learning, and text mining.
- Describe distribution of variables with statistical measures
- Find associations between pairs of variables
- Evaluate the quality of the data you are analyzing
- Perform time-series analysis on your data
- Forecast values of a continuous variable
- Perform market-basket analysis to predict customer purchasing patterns
- Predict target variable outcomes from one or more input variables
- Categorize passages of text by extracting and analyzing keywords