Q10: Please talk about how, when and why we use should descriptive analytics?

Going back to our previous definition of descriptive analytics, it is used to answer questions about what has happened in a business. It is primary use is to look at the current business situation with an eye towards looking for cause and effect. It helps one to understand how to manage in the present based on what happened in the past.

The vast majority that have attended my trainings on analytics, are looking for help with descriptive analytics challenges. Using unstructured big data for predictive analytics modeling is not really something they are concerned with.

I have found that people who are really engaged with analytics are very driven to self-educate. They are driven by curiosity to make use of cutting edge stuff to tackle bigger and bigger challenges. For data scientists and really good analysts, descriptive analytics is easy and kinda boring.

But that is a small percentage of people who use analytics every day.  To most of my attendees, its more about how to cut down on the time it takes for them to prepare the reports they have to make and how to make them more useful to their bosses. That’s where most of my descriptive analytics training has an impact.

How to make a better report? How to build and maintain a simple business dashboard? How to have more impactful power point slides. How to streamline the reporting process? This is one way to look at descriptive analytics… its not just taking historical data and using it for reports, but also how to make the reports better.

ANALYTICS CAN

So how can we use descriptive analytics? Well, we probably already are. Inventory control, payroll, performance management, quality assurance, sales reports, marketing results… all use forms of descriptive analytics. They take what happen, they look at it and then they make decisions.

For the most part this can and is done in Excel. If you want to supercharge what you do in Excel, then you can use a business intelligence tool to build dashboards and publish dynamic reports. This is where most people doing reports need help. How to better visualize the data so it has more power and how to use BI tools to do things faster than can be done in Excel.

In many, many companies a lot of time and energy has been devoted to building reporting tools in house. And this is generally the problem. The reports are static and hard to change. If you are in a company like this, then descriptive analytics can be a bear.

To make the most of it, I suggest using free tools like Tableau Public, which is free, to demonstrate new ways to analyze and report data, to get the boss interested in updating the way you company reports.

Another big challenge facing analysts doing mostly descriptive analytics in the form of reporting, is blending data. Taking data from different data sources and combining them. This can often be very manual and general done in excel if you company hasn’t invested in a way to centrally store enterprise wide data and make it easily accessible. There are some applications out there that can help you with this, Alteryx and Qlikview being ones I have used and they both have a free demo.

If you are already doing predictive analytics, then you probably have your descriptive analytics figured out.

So, if you need help super charging your reporting, are looking to get started using business intelligence and data blending tools, and/or need to build a business case to invest more into analytics, let me know. I’m happy to help you come up with a much better way to build reports that have real impact and don’t take up all your time.

 

Prelude to Q10: Understanding the 3 different types of analytics.

The analytics efforts in a business are generally divided into 3 types; descriptive, predictive and prescriptive analytics.

A simple definition of descriptive analytics is that it is used to answer questions about what has happened in a business. It is primary use is to look at the current business situation with an eye towards looking for cause and effect. It helps one to understand how to manage in the present based on what happened in the past.

Per the Commission on Higher Education (CHED), descriptive analytics make use of current transactions to enable managers to visualize how the company is performing. When teaching the concept, it is generally focused on analysis and reporting to guide decision-making.

Most businesses use mostly descriptive analytics in their analysis, reporting and decision-making.

Three_Phases_of_Analytics

Have to apologize to whoever made this image,  I dont know the source, but you have my thanks for making it. 

As you can see in the image, predictive analytics takes data and extrapolates patterns to predict likely outcomes. Past, Present, Past Present, Future… the goal being too provided educated guesses on what is most likely to happen next. The primary use of predictive analytics is to predict outcomes using models that will mitigate risk and eliminate choices based on unlikely outcomes.

Per CHED, Predictive analytics allows voluminous data to be used for prediction, classification and association making it very useful tool for projections, forecasts, and correlations. Most lessons around predictive analytics involve data modeling and require a much higher degree of skill then descriptive analytics.

In general, predictive analytics is used by large companies in data-rich industries. Up until recently there were very few tools available to smaller businesses to add this type of analytics to their decision-making.

Prescriptive analytics goes one step further and finds the best course of action for a given situation. Its primary goal is to enhance decision-making by giving multiple outcomes based on multiple variables.   The analogy of how doctors prescribe medicine to patients based on a wide range of variables in a patient’s health, using an equally wide range of treatment options.

Per CHED, Prescriptive Analytics help organizations develop insights to make decisions from the current data that maximizes the organization goals.  Prescriptive analytics not only anticipates what will happen and when it will happen, but also why it will happen. Largely, instruction take the model building found in predictive analytics and supercharges it with more data, more choices and more outcomes.

Prescriptive analytics is fairly new and just now gaining widespread use in the corporate world. There are not many tools available that are cheap or easy to use. Generally, you find data scientists assigned to prescriptive analytics projects. It also take us closer to some decision-making in a business being completely automated. With enough data on hand, using machine learning to analyze the data, we are starting to see artificial intelligence at play with prescriptive analytics. It is a pretty exciting time.

Its important to keep in mind that to really be good at predictive and prescriptive analytics you need both the high tech tools and the training/experience to use them effectively.

 

Q9: Can you please describe the concepts of storing data in a data ware house?

Twenty years ago data was mostly stored in databases. These databases housed all the data a business would need to do analytics. Transaction data, sales data, customer data, demographic data was all neatly collected, stored and analyzed in databases.

A surprising number of companies still store most of their data in databases. It works well for business that just need to look at historical data to conduct basic descriptive analytics.

About ten years ago the amount of data captured in a business and the growing diversity in date sources and data storage brought about the mainstream use of data warehouses in the business world.

Data warehouse are often a collection of databases interconnected so that data can be brought together into one place for reporting and analysis.

Whether you are working with a data base or a data warehouse, you should have a basic understanding of how data is stored. It should be in table format, with header columns and data rows.

A good way to quickly assess the analytics culture of a business is to look at how data is shared among management. Does it look table like? Or is it obvious that most of the time spent by the author was put into decorating? If you can’t easy sort something, then you are not dealing with a good data culture.

The best way to have a good data culture is to have well documented data structures. Any dB admin worth a grain of salt has the data hierarchy mapped out and has a knowledge base to help users know what data is in each field.

Like with finding data, being good at storing data starts with knowing the environment. Any good analyst should have a basic understanding of how to use SQL to pull a query for a data table. Even if you cant do hard core coding, know how data is generally stored in a structure is key.

1075177_10151826941667425_1417094118_n

Another important concept about data warehouses if you have to know how to join or blend data from different sources. When you have multiple data tables in a warehouse you often need to join the data on a common field. Data blending goes on step further as you are often trying to take data that doesn’t have a natural point on common that is easy to join on. Advanced data warehouses and data management tools can blend things easily, but its still important to understand the core concepts of how to join and blend data.

As I mentioned in earlier posts, there is now a new concept taking root that one up data warehouses. Data lakes are being used to address the fact that we have more unstructured data then we have structured data. Data bases and data warehouses were designed only to handle structured data the easily fits into a data able.

Now we have to collect data from images, videos, blogs, comments and other places that are not easily converted to a value. Data blending across both traditional structured data warehouses and new types of data is not easily done in most data warehouses so tools are being developed to bridge this gap.

The lake is no longer a place just to fish, but also to do all the other things a lake can be used for.

So, when it comes to understanding data warehouses, learn who built and/or maintains it and buy them a cup of coffee. Get your hands on the data dictionary, knowledge base, FAQ, metadata.. whatever you can to map out the data environment. If you do that then you can find use the big data stored in a data warehouse to find the right data at the right time.

Q8: Here’s something a lot of us are wondering, what exactly is big data?

Think about some of the things you do in your daily life. You get up, you eat, go to work/school, shop, do something for entertainment, bank, go online and do things on social media. Everything you do generates data. That data is captured in countless ways. And then its stored in countless places. And analyzed by countless numbers of people. And then used in countless ways by businesses to market, design, advertise, build, sell, and so on.

Every time you check your phone to see if there are any updates on Facebook you generate a lot of data for your phone manufacturer, your service provider and Facebook itself. Everything you like or comment on can be turned into a data point. The time, place and length of your connection all provide useful data. Get the point? Its endless.

That’s big data.

In general, big data is thought of as all the data businesses capture and store in a database that they can use for business decision-making.

When you think of data collections that have millions and millions of rows of data like big bank transaction data, or traffic data for major cities, or all the statistics captured everyday across professional sports. Way too much for man to analyze without help from technology. That’s all big data.

Every business defines its big data a little differently. There is no one way to look at how best to manage big data because big data is such a living, evolving, never ending flow of information. It’s like lakes of water that are too big to swim across and too deep to dive to the bottom of without help. And no two lakes are alike.

Data analysts and data s2.5.2cientists are the ones who know the lake and guide you across or build you a submarine to explore the bottom.

As I have mentioned in previous posts, knowing the data environment is key to your success. And big data just adds weight to that statement. If you don’t know where all the data is coming from, can’t be sure if its clean, then you will get lost in the deluge of big data.

The Fundamental of Business Analytics – Business Analytics is the application of talent, technology and technique on business data for the purpose of extracting insights and discovering opportunities.

DMAIPH specializes in empowering organizations, schools, and businesses with a mastery of the fundamentals of business analytics. Contact DMAIPH now at analytics@dmaiph.com or connect with me directly to find out how you can strengthen your business analytics fundamentals.

 

 

Q7: What exactly is data science and why the rapid rise of data scientists?

A year ago I might have found it challenging to really answer this question. The first time I had heard of the term data science and a data scientist wasn’t that long ago. And I have been doing some pretty advanced analytics for close to 20 years now.  I know the term has been around in academic and research circles awhile longer, but 2014 is the first time I ever saw a job posting for data scientist in big business.

So what is data science? Besides simply being the study of data, it generally refers to using complex models, machine learning, predictive and prescriptive analytics and powerful technology to analyze business data in much greater volume, velocity and variety then possible a few years ago.

And of course the ones charged with doing the data science are data scientists. They understand math, statistics, and theories that can be applied to business data using new technologies and methodologies.

The biggest challenge to being a true data scientist is that you have to be adapt at both technology and working with people. Being a business data expert, knowing how to code and doing higher math are only half the job. You have to also share your data, communicate it in ways that drive action, share and engage with non-data centric people. It’s hard to find people who are good at both.

ByugG_cIEAAL6wM

Image from Forbes Magazine. 

In addition, whole some data scientists are educated to be data scientists, very, very few actually have any kind of degree in data science. That kind of degree really didn’t exist until very recently. Instead most data scientists have advanced degrees is related subjects and have migrated into the business world do to market demand.

That demand has been growing at a staggering rate the past few years as every day we generate more and more data across the planet. President Obama first employed a data scientist for his campaign in 2012. The White House now has a chief data scientist position.

If you were to compare results from job board searches form 2012, you’d see maybe 100 data scientist job postings. Now its easily in the 1000’s.  So that’s why the job market for data scientist is one of the hottest around.  Lack of training programs, having both tech and people skills, and the booming demand due to unending new data to being analyzed.

Some people ask me if I’m a data scientist I am careful with my answer. True data science is not something I am academically prepared for nor I have never published anything in a scholarly journal. But my real world experience working with data has made me an expert on many aspects of data science.

I guess I feel more like an analyst, but a freakin awesome analyst who can do a lot of things using data that are super important to a business.

img_8168

Analytics Education – Facilitating a mastery of the fundamentals of analytics is what DMAIPH does best. As a key parnter of the Data Science Philippines Meetup Group, DMAIPH champions the use of using data. All across the world, companies are scrambling to hire analytics talent to optimize the big data they have in their businesses.

We can empower students and their instructors with the knowledge they need to prepare for careers in analytics. Contact DMAIPH now at analytics@dmaiph.com or connect with me directly so we can set a guest lecturer date, On-the-Job Training experience or other analytics education solution specifically tailored to your needs.

Q6: Can you provide some tips on how to manage data?

So you have the data lake, the messy version of the lake or data swamp and then the pristine, well managed version of the data lake called the data reservoir.

08-data-reservoir-walter-with-hard-hat

Imagine how a reservoir of fresh water is used for multiple purposes… fishing, drinking, watering crops, providing electricity. That’s how your data should be structured. Even if you are working with multiple data sources made up of a lot of unstructured data from social media, you need to be organized with your data.

I’m willing to bet that if you are reading this then you are by nature pretty organized. Analysts tend to be. If you are working in an data swamp and the company culture is not data-driven, the best advice I can give you, no joke, is to find another job.

What to look for in a data-driven company? Are the data warehouses easy to use? Is their documentation on the data architecture? Is there a knowledge base? Are there experts and are they open to helping you?

If you say yes to questions like that, then your data management tasks are generally about optimization, data blending, adding new sources and being a kick ass analyst.

If you say no to questions like that, then your data management tasks are generally about cleaning data, lots of data validation and having your analysis be filled with caveats that you might be missing something.

So a few tips I have for those in good data companies; get your documentation fresh, do a lot of bread crumb dropping, save your queries and models.

Keep the data architects,database admins and/or IT staff in your circle. Share with them how powerful your analysis is because of their help. And most importantly, show you masterly of the data lake.  Tell your story. And teach others how to fish in it.

For those of you not so blessed with good data cultures. You have to start on both ends. Map out the data flow. Try and assess where the data goes bad. Is it the input or capture of the data, is it a loading process, is it filers? Once you get a start on the front end, then go to the back end.

Who needs the data? How much of what data is being provided now is actually usable? Eliminate any unnecessary data. Basically start cleaning up the swamp at the same time you map it. And again tell this story. Don’t make excuses, but you do need to educate. Let people know there is a problem with the data and outline what you will do to correct for it.

In either case, before you go out and request or purchase new tools or start adding new data… make sure you have the architecture figured out. That’s the best tip I can give you about managing data.

jobspicture2

The Fundamental of Business Analytics – Business Analytics is the application of talent, technology and technique on business data for the purpose of extracting insights and discovering opportunities. DMAIPH specializes in empowering organizations, schools, and businesses with a mastery of the fundamentals of business analytics. Contact DMAIPH now at analytics@dmaiph.com or connect with me directly to find out how you can strengthen your business analytics fundamentals.