Category: Big Data

What is Search-Driven Analytics?

Post author By Scottie Todd
Post date May 18, 2020

What is Search-Driven Analytics?

One of the core features of a data analytics package is usually the dashboard. This gives users access to insights and data from a variety of sources.

Unfortunately, one of the biggest challenges that come with using a dashboard is trying to sift through the available information. This can be especially problematic at businesses that are generating hundreds of insights on a daily basis. So how exactly does someone get right to what they need without sifting through numerous reports or going down an alphabetized list?

What is Search-Driven Analytics Used For?

An answer that’s growing in popularity is search. This is not search in the sense that you’re familiar with from Google, although it operates much in the same way. Instead, we’re talking about a search-driven system that allows users to type in everyday sentences to get results from your data.

For example, someone might go into the dashboard and type in “How many customer service calls did we take in 2019?” The backend for the dashboard will understand the query and return a list of results that are ranked from most relevant to least. Within seconds, the user can click on the appropriate analysis product and find the information they need.

How Does Search-Driven Analytics Work?

The engine that drives most search-driven technology is natural language processing (NLP). A technology that has been around since the 1950s, NLP has become vastly more powerful in the last decade due to increases in parallel processing in CPUs and GPUs.

An NLP system is usually designed to take a corpus of words to help a machine understand the basic ideas that underpin each bit of a dataset. When a user types in a search, the NLP algorithm will compare the search query against the scores it has for each dataset. Those datasets that look the most likely to match are then returned as a result.

Especially in settings where organizations have narrow interests, NLP can be very powerful and precise. A logistics firm, for example, might have an NLP setup that can answer questions like:

“Where are all current shipments from Asia?”
“How long does the average delivery take?”
“What were the company’s fuel costs for January?”

These are extremely natural questions for a user to want answered, and the search-driven dashboard can address them with ease in the vast majority of cases.

The Role of Data Visualizations

Another benefit of this approach is that data is usually fed straight into the dashboard in the form of visualizations. If you enter a query to see how many widgets were sold each month, you can click on the right item and a graph of widget sales will appear on the right-hand side. There’s no need to run an analysis engine or load anything into Excel to get a viable work product. It simply appears in a matter of seconds.

Why Use This Approach?

The beauty of a search-driven system is that it can help users browse through data, create ad-hoc reports and make decisions on the fly. If someone needs to pull up the total for all store inventories of a handful of items within a retail chain, for example, they can type that into the search bar and get an answer in seconds. They can then see the data, produce any necessary reports or paperwork and move forward with their task of refilling inventory levels.

Notably, this approach makes it much easier for less tech-savvy individuals to follow the data. In turn, that frees software engineers and data scientists within your organization to focus on building more robust systems, working with data and fixing advanced problems.

Conclusion

In the modern data-driven culture, a lot is made of onboarding people. Just 5 or 10 years ago, that often meant excluding those who lacked the technical expertise to function in a data-driven business setting.

The increasing ease of access to data due to tools like search-driven analytics makes it possible to bring more people on board. Likewise, it allows users to get their answers quickly rather than trying to navigate through complex interfaces. Search-driven analytics allows organizations to be more efficient and effective in leveraging the data they have access to.

Back to blog homepage

Big Data Business Intelligence

5 Strategies to Inspire Your Next Big Data Project

Post author By Scottie Todd
Post date May 11, 2020

Having big data capacities at a business or an organization isn’t an end in its own right. The goal is to produce projects that generate value for the organization, stakeholders, and ultimately the end customer.

At some point, however, everyone will struggle to come up with what their next move is as well as new projects to execute those ideas. Let’s explore 5 ways you can find inspiration for your next data project.

Look at What Competitors Are Doing

While there may be some things you’ll want to run past legal if you go into production with the results of your efforts here, looking at the ideas coming out of your competitors’ Big Data divisions is worthwhile. In addition to thinking about their work product, you should especially consider how they’ve accomplished certain goals. Sorting through the various possibilities of how they got a particular result may inspire you. Likewise, you might spot somewhere they went wrong or an opportunity to improve upon their analysis.

Cast a wide net when you’re looking for projects that competitors have done. Look for their:

Blog articles
Print publications
Research papers
Social media feeds
GitHub repositories
LinkedIn profiles
White papers
Industry reports

Revisit Existing Projects

There are many reasons to consider revisiting an existing project. NASA, for example, has been sorting through data that was gathered by the Voyager space probes in the 1980s to take advantage of technologies that didn’t exist at the time. You might find that advances in multicore processing power now make it possible to throw vastly more CPU and GPU cycles at a problem than you could have ever imagined five years ago.

Additionally, you may have access to updated data. Someone working in the financial sector, for example, would probably like to return to some of their projects since the 2020 stock market crash. There are often interesting opportunities to compare and contrast projections that you made in the past versus real-world outcomes. Focus on what you can learn as opposed to getting upset about what you might have missed.

Progress in Equipment

New equipment can be a game-changer as well. For example, single-board computers are more readily available, powerful, and cost-effective than they were a few years ago. If a project could benefit from the deployment of IoT sensors, for example, this might be the time to explore it.

Big Data work in agriculture is rapidly becoming dependent on IoT devices. There’s a lot to be said for dropping a few hundred sensors across several square miles to monitor soil chemistry, moisture, temperatures, and weather. What once would have been an unthinkably expensive operation that would require massive technical expertise can now be managed by a farmer with a laptop.

Sift Through Data Sources

You might not really see an idea that deserves to be studied until you swim by it. Looking around at the sites that cater to data enthusiasts, such as Kaggle and Data.gov. You might end up finding a dataset that sends your mind racing, and pretty soon you’ll be able to draw a line between the questions the data raises and how you know you can go about answering them.

Talk with Folks Who Know Nothing About Big Data

Living inside a data-centric bubble has its perks, but it can lead to tunnel vision. When you converse with people who aren’t immersed in the data world, listen to the problems they express interest in or frustration with. There aren’t many human endeavors where some benefit wouldn’t come from having better quality data. Doctors, artists, athletes, engineers, and many more all have puzzles they wish to be solved.

Keep a notepad and pen on you at all times so you can scribble down ideas when you encounter them in the wild. If you don’t have the person’s contact information, ask for it so you can do a follow-up, if necessary.

Conclusion

People often assume that inspiration just falls from the sky. It doesn’t, it demands ample amounts of thought and focus. Creators and innovators have processes such as these, putting these strategies to work is when they can help you find that next big idea for your big data project.

Back to blog homepage

Big Data Business Intelligence Data Analytics

How to Leverage Your CRM with Big Data & BI Tools

Post author By Scottie Todd
Post date April 20, 2020

Customer relationship management (CRM) systems are widely used in many businesses and organizations today. While it’s great to have all your customer information compiled and accessible in one source, you may not be maximizing the value of your CRM. In particular, big data, business intelligence (BI) and analytics packages offer an opportunity to take your customer data to the next level. Let’s take a look at how you can achieve this with your customer data.

How to Leverage CRM as a Source for Analysis

Most CRM systems operate on top of databases that have the necessary capabilities to feed data into analytics and BI systems. While skilled database programmers can get a lot of mileage out of writing complex queries, the reality is that most of the data can simply be pulled into the big data pipeline by way of database connectors. These are small components of applications that are used to talk with databases in languages like MySQL, MongoDB, Fox Pro, and MSSQL.

Once you’ve pulled the data into your analysis engine, a host of functions can be used to study it. For example, a company might use the information from their CRM to:

Perform a time-series analysis of customer performance
Analyze which appeals from email marketing content seem to drive the greatest returns
Determine which customers are at risk of moving on to competitors
Find appeals that reinforce customer loyalty
Spot customer service failures
Analyze social media postings by customers to assess their experience with the company

What Methods Are Used?

Suppose your business wants to determine which email campaign appeals are worth reusing. Working from copies of email content, you can conduct a word cloud analysis that shows which concepts were strongly featured. Response data from the CRM can then be used to identify which words and phrases have performed best.

These items can then be organized into a BI dashboard widget that tells those writing the emails how to structure their content. For example, a market research firm might find that case studies drive more users into the marketing funnel than news about the practice. Marketers can then write new campaign emails based on that provided data. Almost as important, they can also access email performance and refine their approach until the material is exemplary.

Tools that are ideal for projects like this include:

Sentiment analysis
Marketing data
Pattern recognition
Word cloud analysis

Such analysis will also require a constant stream of data going from the CRM into the analytics engine and onward to the BI dashboards. Done right, this sort of big data program can convert something you’re already accumulating, such as customer relationship data, into insights that drive decision-making at all levels.

How Much Data is There?

Data for analysis can come from a slew of sources, and it’s important to have a CRM that allows you to access as many potential data sources as possible. For example, a company shouldn’t draw the line at collecting email addresses. You can also ask customers to include their social media accounts, such as handles from Twitter, LinkedIn, and Instagram.

Server logs should also be mined for interesting data points. You can, for example, study IP addresses and user logins to determine where a prospective customer might be in the marketing funnel. If you see that a lot of leads are dropping out at a certain stage, such as after signing up to receive your email newsletter, you can then start to analyze what’s misfiring at this stage in the process.

Once the problem is corrected, you can even use the CRM data to identify which customers you should reconnect with or retarget. You might, for example, send an offer for discounted goods or services to increase your customer lifetime value.

Conclusion

At many businesses, the CRM system is a highly underutilized resource. By coupling it with big data and an effective BI package, you can quickly turn it into a sales-driving machine. Team members will be excited to see the new marketing and sales tools at their disposal, and customers will value the increased engagement.

Back to blog homepage

Big Data Data Analytics Data Monetization

Shedding Light on the Value in Dark Data

Post author By Scottie Todd
Post date April 13, 2020

Hearing that your organization has dark data can make you think of your data as quite ominous and menacing. Saying that data is dark, however, is closer in meaning to what people are talking about when they say a room is dark. What they mean is there’s the potential for someone to switch the light on and make what was unseen visible.

What is Dark Data?

Every operation in the world produces data, and most of those entities record at least some of it regardless of whether they make further use of it. For example, many businesses collect information about sales, inventories, losses, and profits just to satisfy the basic reporting requirements for taxes and how their companies are set up. You might also have a complete customer service department that’s producing data all the time through daily chats, emails, and many other forms of communication. Even maintaining a social media presence means creating data.

Such data is considered dark if it isn’t put to other uses. Shining a light on dark data can allow a company to:

Conduct analysis
Creating sellable data products
Learn about relationships
Supply insights to decision-makers

By definition, dark data is an unutilized resource. Owning dark data is like keeping things in storage that never or rarely get used. In other words, if you tolerate the existence of dark data within your organization, you’re at risk of leaving money on the table. In fact, you may be taking a loss on dark data because you’re storing it without first turning to monetization.

How to Bring Dark Data into the Light

The first order of business is figuring out exactly what your organization has in the way of data sources. Some things will be fairly obvious, such as turning up sales data from a POS and inventory numbers from an ICS. Other data sources may be trickier to find, but they can be discovered by:

Surveying your team members to learn what data different departments collect
Conducting audits of computing systems to identify databases, log files, and spreadsheets
Scanning through social media feeds, including direct messages from customers
Collecting corporate data, such as financial statements and email correspondence
Studying call records

It’s also wise to think about places where you could be collecting more data. For example, a customer service system that isn’t sending out surveys is letting a perfectly good opportunity go to waste.

What to Do Now That You See the Data

The second order of business is figuring out how to draw more insights from your data. Companies accomplish this by:

Creating data lakes and providing access to them
Auditing databases for potentially useful information
Implementing or expanding data science projects
Developing data-centric corporate cultural practices
Adding resources to do machine learning and stats work
Hiring new professionals who can dig into the data

Much of this hinges on moving forward with a data-centric culture. Even if you already feel that you have one, there’s a lot to be said for looking at who your team members are and how you can use dark data with experienced data users at your disposal.

The third order of business is establishing goals for your projects. If you run a company that has potential legal risk exposure due to compliance problems involving laws like HIPAA and the GDPR, for example, you might analyze the very way your organization stores information. A company that collects huge amounts of anonymous data from millions of users might figure out how to package that data into sellable information products, such as reports or datasets. You may even cut costs by determining what data is useless, potentially removing terabytes of information from storage.

Conclusion

Modern organizations collect so much data that it’s hard for them to clearly imagine what they have at their disposal. It’s important to take an extensive look at all the ways dark data may be residing in your systems. By being a bit more aggressive and imaginative, you can find ways to improve processes, cut costs, and even drive profits.

Back to blog homepage

Big Data Data Analytics

New Questions? How to Find Answers in the Face of Uncertainty

Post author By Scottie Todd
Post date April 6, 2020

Recent world events have exposed a wide range of issues in terms of how companies implement processes and use data to make decisions. It’s abundantly clear that many enterprises came into the new year simply unprepared to make the sorts of decisions needed to persevere through difficult times, especially when the situation involves highly unexpected events.

If you’ve found yourself trying to figure out answers to questions you’ve never even pondered, you may be wondering what tools could help you get ahead of these circumstances. There’s a strong argument to be made about having a data warehouse in place that can make a major difference as organizations struggle to make these sorts of novel decisions. Whether you already have a big data warehousing system in place or are now just realizing the importance of one, it’s also wise to think about precisely how prepared or unprepared you might be. Let’s look at how a well-implemented data warehouse operation can help you get answers quickly in a rapidly evolving situation.

What is a Data Warehouse?

The difference between a massive collection of data and a data warehouse is that warehousing is designed to enable long-term use of data. A warehouse aggregates massive amounts of structured data from many sources, and it’s designed to enable analysis and reporting. While a database can answer queries, the information in a data warehouse can be used to find relationships between different data points.

How a Solid Data Warehouse Can Help Decision-Making

Suppose you were a producer of paper products based in California at the beginning of 2020. By the end of January, sources of wood pulp from China have dried up and competition for American sources has become fierce. You want to start looking for suppliers in Latin America, but you don’t know where to begin.

A good database will have that information, but it won’t have what a good data warehouse requires. For a data warehouse to contribute to a situation like this one, it needs to be able to tie together data about specific pricing and shipping costs for getting it to California.

You don’t want to have to research all of this due to the time and money you will be sacrificing in the process. Instead, a well-prepared data warehouse should be constantly digesting the necessary data to give you actionable information at the touch of your company’s analytics dashboard. In fact, a top-quality system with predictive and prescriptive analytics should help you compare and contrast the available options in an instant. You might never have talked with many suppliers in new territories before, but your data warehouse is going to give you a diverse range of commodity pricing and shipping rates from the region so you can start having that conversation today rather than next week.

Structured Data and Being Prepared

You’ll note that the example depends heavily on having large amounts of structured data available right away. A big part of being prepared for unforeseen events is investing early in the necessary infrastructure and data sources. You’re going to want to make serious investments in:

Servers
Cloud computing
Databases and redundant storage media
High-speed networking, including fiber optic cabling
Computational resources, such as GPUs and machine learning programs
Appropriate sources of structured data, including subscriptions to services providers that cater to your industry

It also means having highly failure-tolerant code constantly churning the inputs to ensure analysis can be run at a moment’s notice. Likewise, actionable data is likely to require security, as it has the potential to become a trade secret you’ll want to defend. Finally, you’ll need people who can understand the intricacies of data science and decision-makers who’ve been fully onboarded with using analytics insights on a daily basis.

Conclusion

Becoming a highly prepared, data-centric business is a bit like quitting smoking: there is no better time than today. Regardless of where you’re at in the process, it’s a good idea to think about how prepared your operation is on the data side to help you respond to unexpected crises. Even if you feel thoroughly prepared at this very moment, a culture of constant improvement and preparedness will have your data warehouse ready to answer the next big round of questions you’ve never asked before.

Back to blog homepage

Big Data Data Analytics

Top 5 Big Data Myths Debunked

Post author By Scottie Todd
Post date March 23, 2020

Thanks to all the buzz around the term Big Data, there are numerous outlooks and subsequent myths on the topic. While Big Data is an amazing tool that has numerous applications, it’s far from the magical fix for every analytics need. Let’s explore five of the top Big Data myths and the truth behind them.

Myth #1: How Machines Are Always Better Than People

To be clear, computers have two core advantages compared to individuals. First, a computer has the advantage of raw speed, being able to sort through more numbers in a few minutes than a person could in their entire lifetime. Second, a computer can quickly detect patterns and execute formulas across seas of data.

People tend to adapt to situations with limited information much better than machines do. For example, all self-driving car systems on the market still require significant human intervention in a wide range of situations. This is partly due to the myriad situations where unusual or downright novel things occur during any vehicle trip. Put simply, Big Data has to be paired with human decision-making in order to ultimately be effective.

Myth #2: Big Data is the Solution to Everything

It’s common for trends to become trendy far outside their proven applications. After all, everybody wants to be the Uber of something.

The same issue applies in the world of Big Data. Some problems, however, just don’t lend themselves to mass computation efforts. This can arise due to things that machines struggle to identify, such as:

Limited available data
Biases built into a dataset
Inapplicable information
Flawed data

When a Big Data system is asked to analyze a problem, it doesn’t stop to ask a lot of questions about it. If the job is machinable, the computer will accept it.

Myth #3: How Expensive Big Data Is

The word “big” creates an unfair perception that nothing less than building a cluster of supercomputers that each have 8 high-end cards isn’t worth the bother. Nothing could be further from the truth.

Analytics packages have become very accessible, and many can be run right on a typical multicore desktop, laptop or even phone. In fact, many companies have been working hard to deploy cost-effective data processing software which can be used for a variety of data projects. This means that Big Data systems can be deployed in small settings, providing access to IoT devices, machine learning, AI, and dashboards nearly anywhere and at a fair per-unit price.

Myth #4: Why Use Big Data if You’re Not in IT?

The notion of Big Data being about computers is a bit like thinking that carpentry is about nails and hammers. With Big Data, the goal is to create insights that can be applied in a variety of fields. Much as you would use a hammer and nail to build a house, you can use Big Data to build insights that will drive actions and informed business decisions.

The retail sector was an early adopter of Big Data, due to the recognition that retail had missed the train to the .COM boom. Companies in retail use big data to collect and process information like:

Social media sentiments
Inventory numbers & forecasting
Trends in customer tastes
Buying processes
Global supply chains

If Big Data can revolutionize the way people buy apparel and shoes, it can do a lot of good in many other sectors as well.

Myth #5: Big Data isn’t for the Little Guy

Small operations have some major advantages when it comes to Big Data. Major players often have to deal with the same challenges that come with turning an ocean liner. They struggle to turn because it’s challenging to ignite change within any massive corporation. Conversely, the agility afforded to many small companies allows them to gather insights and react to them in a matter of weeks. Paired with the cost-effectiveness of modern Big Data systems, this presents an advantage that can be rapidly leveraged by small businesses.

Back to blog homepage

Big Data Data Analytics

Big Data vs. Small Data: Does Size Matter?

Post author By Scottie Todd
Post date March 16, 2020

After years of talk about big data, hearing about small data can feel like a major pivot by the manufacturers of buzzwords. Small data, however, represents its own revolution in how information is collected, analyzed and used. It can be helpful, though, to get a handle on the similarities and differences between big and small data. Likewise, you should consider how the size of the data in a project impacts the project as a whole and what other aspects are worth looking at.

How Small and Big Data Are Similar and Different

Both are typically the products of systems that extract information from available sources to conduct analysis and derive insights. At the big end of the scale, the goal is to filter through massive amounts of information to identify things like trends, undiscovered patterns and other bits of knowledge that may occur at scales that are hard for an individual analyst to easily identify. Moving to the small end of the scale, you tend to get into more granular data that will often be more digestible for one person.

Macro trends tend to be big data. If you’re trying to figure out how the bond spread relates to shifts in banking stocks, for example, you’re probably working on the big end of the scale.

Small data is granular, and it may or may not be the product of drilling down into big data. For example, a company trying to target social media influencers likely isn’t looking to just turn up numbers. Instead, they want to have a list of names that they can connect with to put a marketing campaign into action.

Another feature of small data is that it’s often most prominent at either end of the analysis cycle. When individual user information goes into a database, for example, that’s all small data. Similarly, targeted insights, such as the previously mentioned social media marketing plan, represent potential applications.

Small data is also frequently more accessible to individual customers. It’s hard to tell an e-commerce customer why they should care about macro trends, even if you’re looking at what’s going to be cool next season. Conversely, if you can identify an interest and send a coupon code, they can put small data to use right away.

Does Size Matter?

The best way to think about this question is to consider the importance of using the right tool for the job. When sending coupon codes, for example, small data is a great tool because you can tailor each offer to an individual, a peer group or an identifiable demographic. As was noted, this can get very granular, such as providing hyper-localization of a push notification that only sends out a targeted offer when the customer is near a physical store location.

Small data can be the wrong tool for many jobs, too. An NHL goaltender may need to see 2,000 shots before they can be fully assessed, for example. Thinking too much about a single good or bad season can skew the assessment significantly. A seemingly small data issue, player evaluation, calls for a big data mentality.

Other Factors to Look At

A good way to think about the other factors in assessing big versus small is to use the three V’s. These are:

Volume
Variety
Velocity

Volume speaks to the question of how much data there is. While there’s a temptation to always want to feed a model more data, there’s an argument on the small data end of things that consumable metrics are better. In others, if you feel like a problem demands volume, it’s likely a big data task. Otherwise, it’s probably a small data issue.

Variety also indicates whether big or small data is the right way to go. If you need to drill down to a handful of metrics, small data is invaluable. If you need to look at many different data points, it may be a job for big data.

Velocity matters because data tends to come in waves. This gets a little trickier because both small and big data needs can require constant refreshes. Generally, if you’re looking to accumulate, it’s big data. If you’re trying to stay up to date, it’s small data.

Back to blog homepage

Big Data

The 7 Most Common Data Analysis Mistakes to Avoid

Post author By Scottie Todd
Post date March 9, 2020

When performing data analysis, it can be easy to slide into a few traps and end up making mistakes. Diligence is essential, and it’s wise to keep an eye out for the following 7 potential mistakes you can make. These include:

Sampling bias
Cherry-picking
Disclosing metrics
Overfitting
Focusing only on the numbers
Solution bias
Communicating poorly

Let’s take a look at why each one can be problematic and how you might be able to avoid these issues.

The Why

Sampling bias occurs when a non-representative sample is used. For example, a political campaign might sample 1,300 voters only to find out that one political party’s members are dramatically overrepresented in the pool. Sampling bias should be avoided because it can weigh the analysis too far in one particular direction.

Cherry-picking happens when data is stacked to support a particular hypothesis. It’s one of the more intentional problems that appear on this list because there’s always a temptation to give the analysis a nudge in the “right” direction. Not only is cherry-picking unethical, but it may have more serious consequences in fields like public policy, engineering, and health.

Disclosing metrics is a problem because a metric becomes useless once subjects know its value. This ends up creating problems like the habit in the education field of teaching to what’s on standardized tests. A similar problem occurred in the early days of internet search when websites started flooding their content with keywords to game the way pages were ranked.

Overfitting tends to happen during the analysis process. Someone might have a model, for example, and the curve produced by the model seems to be predictive. Unfortunately, the curve is only a curve because the data fits the model. The failure of the model may only become apparent, however, when the model is compared to future observations that aren’t so well-fitted.

Focusing only on the numbers is worrisome because it can have adverse real-world consequences. For example, existing social biases can be fed into models. A company handling lending might produce a model that induces geographic bias by using data derived from biased sources. The numbers may look clean and neat, but the underlying biases can be socially and economically turbulent.

Solution bias can be thought of as the gentler cousin of cherry-picking. With solution bias, a solution might be so cool, interesting or elegant that it’s hard not to fall in love with. Unfortunately, the solution might be wrong, and appropriate levels of scientific and mathematical rigor might not be applied because refuting the solution would just seem disheartening.

Communicating poorly is more problematic than you might expect. Producing analysis is one thing, but conveying findings in an accessible manner to people who didn’t participate in the project is critical. Data scientists need to be comfortable with producing elegant and engaging dashboards, charts and other work products to ensure their findings are well-communicated.

How to Avoid These Problems

Process and diligence are your primary weapons in combating mistakes in data analysis. First, you must have a process in place that emphasizes the importance of getting things right. When you’re creating a data science experiment, there need to be checks in place that will force you to stop and consider things like:

Where is the data coming from?
Are there known biases in the data?
Can you screen the data for problems?
Who is checking everybody’s work?
When will results be re-analyzed to verify integrity?
Are there ethical, social, economic or moral implications that need to be examined more closely before starting?

Diligence is also essential. You should be looking at concerns about whether:

You have a large and representative enough sample to work with
There are more rigorous ways to conduct the analysis
How you’ll make sure analysts are following properly outlined procedures

Tackling a data science project requires sufficient and ample planning. You also have to consider ways to refine your work and to keep improving your processes over time. It takes commitment, but a group with the right culture can do a better job of steering clear of avoidable mistakes.

Back to blog homepage

Big Data

Costs of the Data Literacy Divide

Post author By Scottie Todd
Post date February 24, 2020

Data literacy is quickly becoming one of the most crucial skills in a world that’s increasingly dominated by stats, business processes, machine learning, computing, and AI. Unfortunately, a massive skills gap has developed, producing adverse effects within every company that uses data in its operations.

Issues of data literacy have emerged all the way up to the C suites of multinational corporations, with one study showing that just 24% of business decision-makers have the necessary skills. Worse, there appears to be zero generational benefits for folks who grew up in the data age, with digital natives posting a 22% rate of acceptable data literacy. The U.S. lags Europe, and worldwide the issue is problematic in even the best of environments.

Only 17% of businesses report openly encouraging data literacy training, even though employees have almost uniformly expressed a desire for it. Likewise, only 36% of the same businesses report providing any incentives, such as higher salaries, for employees to upgrade their skills.

Defining Data Literacy

Among the trickier issues is defining what data literacy means. It’s important to distinguish data literacy from digital literacy. Many people handle basic digital skills well, but they commonly lack grounding in reading and comprehending data. There is also a high level of dependency on machines to get the answers right without really knowing what the implications of such attitudes are.

Data literacy in business predominantly consists of three main areas. These are:

Data science
Programming
Analytics

Each of these fields is underpinned by the discipline of statistics. That means people reading data, for example, need to be able to understand and apply concepts like:

Sample size
Regression
Correlation
Bias

They also need to be familiar with issues such as data preparation, analysis, archiving and presentation. Even professionals who are highly skilled at preparing and analyzing data, for example, may lack skills on the visualization side of the equation. It’s important to understand how particular presentations of data may aid or hurt comprehension, especially when the audience is being introduced to the information for the first time. As nit-picky as it sounds, the precise choice of one style of chart versus another imposes biases and creates possible comprehension issues.

What’s the Cost?

Unsurprisingly, all this comes at a cost. An estimate from the Data Literacy Project indicates that for a company valued at over $10 billion, the cost of the data literacy gap may cause economic damage to the order of 5% of the firm’s market cap. That represents more than $500 million worth of value. Despite these severe numbers, only 8% of businesses reported making significant changes to how they approach their data issues.

In addition to the immediate costs, there are also opportunity costs which are much harder to estimate. The absence of data-driven decision-making at many companies means those organizations are falling behind more progressive competitors in fostering data cultures. It’s not uncommon for businesses to lack data management titles such as a Chief Data Officer, this signifies the absence of governance overseeing many organizations’ data strategies. This also highlights organizational issues that aren’t preparing employees to be assuming those data executive roles 10 and 20 years from now.

How to Close the Gap

There are two areas where change can immediately occur. These are:

Training
Providing incentives

Solving these two issues is well within the means of modern companies. As previously stated, the majority of employees at companies are eager to join a data literacy effort through training. Likewise, providing incentives is not only logical in terms of encouraging training, but it becomes a retention issue. After all, you don’t want to spend money training people who end up at other organizations solely based on better pay.

At a larger scale, the adoption of a data-centric culture is critical. Data can’t just be a work product that’s presented to the higher-ups. From the C suites on down, every company needs to understand how to read data, interpret it and talk about its implications. With a focused effort, your organization can become one that starts taking advantage of new opportunities.

Back to blog homepage

Big Data Data Analytics

5 Top Strategies for Building a Data-Driven Culture

Post author By Scottie Todd
Post date February 20, 2020

For many businesses and organizations, moving toward a data-driven culture is essential to their survival. These sorts of vague exhortations, though, don’t do a great job of setting an organization on the path to becoming a data-driven culture. If you want to make data a focus of your operation, follow these 5 strategies.

1. Understand Why You Want to be Data-Centric

Before you can execute other strategies, it’s critical to make sense of where data fits into your organization’s goals and why you’re heading in this direction. For example, there’s a huge difference in mentality between trying to catch up with competitors and helping your business take advantage of opportunities. You can’t develop a data-driven culture just because it’s the most recent trend.

Nail down what are the basic opportunities in your industry and at your organization. Think about how customers might benefit from dealing with a more data-centric business. A clothing retailer, for example, might determine that it wants to be more data-centric because it needs to:

More accurately track and predict trends
Streamline its inventory and purchasing processes
Identify non-recurring customers and how to increase retention rates

Use this why factor in your list to guide later strategic efforts.

2. Determine Who Must Be Onboarded

As much as companies talk about bringing everyone onboard with a data-centric mentality, the reality is that the cleaning staff probably doesn’t need training sessions to get them on board. Think about who would be the target of a serious quip like, “Don’t you know we’re a data-centric business?” If a person is on that list, they need to be brought on board.

Be aware that executives are especially important in forming the new culture of data in a company. If the people at the top don’t understand why they’re suddenly being overwhelmed with dashboards, charts, and analysis, you’re going to have a hard time getting others to participate.

Also, be prepared to sever ties with people who won’t or can’t get on board. It needs to be clear to employees that the future success of the organization will be dependent upon continuously strengthening the data culture. That applies even if it means taking short-term losses to lengthy hiring processes in order to lose people who aren’t on board and to retain those who are is critical to cultivating data culture.

3. Form a Democratic Attitude Toward Data Access

Departments often hold onto data pools for a variety of reasons, including:

Unawareness of the value of their data to others
Interdepartmental rivalries
Poor organizational practices
Lack of social and computer networking to other departments

A data lake that every authorized party in the company has access to can foster innovation. Someone in marketing, for example, might be able to discover trends by looking at data from the inventory side of the operation.

To be clear, there’s a difference between being democratic and anarchistic. Access control is essential, especially for data that is sensitive for compliance, trade secrecy and privacy reasons. Good admins will help you ensure that all parties have appropriate levels of access.

4. Know What Infrastructure Must Be Built Out

A data-driven culture marches on a road paved with cabling, servers and analytics software. If your company hasn’t upgraded networking in over a decade, for example, you may want to look into having the work done to speed up access. Similarly, you’ll have to make decisions about building servers onsite versus using cloud-based solutions, adopting specific software stacks and choosing particular team processes.

5. Learn How to Measure Performance

Lots of great insights come from projects that don’t necessarily put money in the company’s bank accounts on day one. On the other hand, it’s easy to let employees foster pet projects in their own fiefdoms without much supervision if you turn them loose with resources.

The solution is to implement meaningful measures of performance. Promotions and raises need to be tied to turning projects into successes for the whole company. While people need room to be able to learn, they also need encouragement to work efficiently and quickly move onto exploring additional ideas.

Establish the metrics that matter for your data-driven cultural revolution. As the effort moves forward, look at the data to see how well the push is succeeding. Be prepared to revise metrics as conditions change, too. By following the data to its logical conclusions, you’ll find a host of new opportunities waiting to be capitalized on.

Back to blog homepage

About Us

Free Tools

Contact Us

What is Search-Driven Analytics?

What is Search-Driven Analytics Used For?

How Does Search-Driven Analytics Work?

The Role of Data Visualizations

Why Use This Approach?

Conclusion

Look at What Competitors Are Doing

Revisit Existing Projects

Progress in Equipment

Sift Through Data Sources

Talk with Folks Who Know Nothing About Big Data

Conclusion

How to Leverage CRM as a Source for Analysis

What Methods Are Used?

How Much Data is There?

Conclusion

What is Dark Data?

How to Bring Dark Data into the Light

What to Do Now That You See the Data

Conclusion

What is a Data Warehouse?

How a Solid Data Warehouse Can Help Decision-Making

Structured Data and Being Prepared

Conclusion

Myth #1: How Machines Are Always Better Than People

Myth #2: Big Data is the Solution to Everything

Myth #3: How Expensive Big Data Is

Myth #4: Why Use Big Data if You’re Not in IT?

Myth #5: Big Data isn’t for the Little Guy

How Small and Big Data Are Similar and Different

Does Size Matter?

Other Factors to Look At

The Why

How to Avoid These Problems

Defining Data Literacy

What’s the Cost?

How to Close the Gap

1. Understand Why You Want to be Data-Centric

2. Determine Who Must Be Onboarded

3. Form a Democratic Attitude Toward Data Access

4. Know What Infrastructure Must Be Built Out

5. Learn How to Measure Performance

About Us

Free Tools

Polk County Schools Case Study in Data Analytics

Get Your Guide