Rethink Time and Data in Your Organization - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

IoT
IoT
Data Management // Big Data Analytics
Commentary
9/24/2019
08:00 AM
Pierre DeBois
Pierre DeBois
Commentary
100%
0%

Rethink Time and Data in Your Organization

Time series is a standard analysis, but advanced machine learning tools introduce statistical techniques for more accurate forecast models.

Image: Icedmocha - stock.adobe.com
Image: Icedmocha - stock.adobe.com

Unlike the wishes Cher sings in her famous song, you cannot turn back time. But with the tools that have become available, you have a better chance of predicting time or, more accurately, predicting if occurrences in a time series sample will continue a decision-influencing trend.

Facebook Prophet and TensorFlow, issued by Google, are two machine learning protocols aimed at enticing developers to create exciting data science applications. Technology and analytics managers should view these tools as ways to expand their DataOps capabilities and expand their initial steps into machine learning.

Created by the Facebook core data science team, Facebook Prophet provides a reliable time series forecast where processing capacity is an issue. Prophet is based on an additive model to address how non-linear trends fit with yearly, weekly, and daily seasonality. The framework aids businesses when data contains periodic trends, such as retail holidays or the discovery that a sudden event impacted a trend. R programming and Python versions were launched a year ago, so businesses can leverage open source resources to create models. The source code and examples are available on GitHub.

I have previously reported on TensorFlow -- you can read about it here. The neural network framework also offers an additional suite of probability models; in R the models are called as a separate library. This allows for more advanced statistical models to be built into the model easily. In the case of time series, users can apply a Bayesian structural time series. A Bayesian structural time series is a set of probability models that includes and generalizes many standard time-series modeling concepts. Its purpose is to highlight statistical details for more accurate comparisons between time series data of current and previous periods. The TensorFlow probability library allows a model to incorporate the Bayesian Structural Time Series.

Why so much interest in time series reporting? If you stop and think about it, time series reports are as common as an Excel spreadsheet, since many tools display time series data. Just take one look at a web analytics solutions or social media analytics report.

But the visualizations of time series data in those solutions are not really designed with statistical analysis in mind.

For example, a web analytics solution like Google Analytics can provide time series results for referral traffic, and the results can permit decisions on which sources are consistently sending traffic to a website. But suppose you needed to predict how sustainable a trend for a given referral source can be? The slope of a trendline may not be immediately discernable from a flatline if the length of time is long enough. I speak from experience: Years ago I needed two and half days to determine the top conversion sources of my first client’s search traffic, because the visitor volume grew with a slow-developing growth pattern.

With today’s data sources, it is also likely that the frequency pattern of a given time series is not linear. This means observations would display successive increases and decreases in a logarithmic or curvilinear pattern. A tool with statistical capability would detect these nuanced trends far better than a standard solution would. Finance professionals who conduct stock market predictions know the value of better statistical capability well. They use advanced tools to create accurate time series predictions because noise and volatility in the data obscures the trend.

The latest tools make that needed statistical capability possible, speeding up analysis that creates meaningful decisions. Random noise in the data can be filtered out as well. The ability to separate data into components is why tools like Prophet and TensorFlow are so valuable. But advanced analysis can also be done in other dashboards such as Tableau, or they can be created as a visualization model in Python or R programming, as Prophet has provided.

Time series is a simple analysis that sometimes contains complex statistical nuances. Examining those nuances can reveal the right details quickly, helping a team make data-influenced decisions faster and better.

Pierre DeBois is the founder of Zimana, a small business analytics consultancy that reviews data from Web analytics and social media dashboard solutions, then provides recommendations and Web development action that improves marketing strategy and business profitability. He ... View Full Bio
We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
Comment  | 
Print  | 
More Insights
Slideshows
What Digital Transformation Is (And Isn't)
Cynthia Harvey, Freelance Journalist, InformationWeek,  12/4/2019
Commentary
Watch Out for New Barriers to Faster Software Development
Lisa Morgan, Freelance Writer,  12/3/2019
Commentary
If DevOps Is So Awesome, Why Is Your Initiative Failing?
Guest Commentary, Guest Commentary,  12/2/2019
White Papers
Register for InformationWeek Newsletters
Video
Current Issue
Getting Started With Emerging Technologies
Looking to help your enterprise IT team ease the stress of putting new/emerging technologies such as AI, machine learning and IoT to work for their organizations? There are a few ways to get off on the right foot. In this report we share some expert advice on how to approach some of these seemingly daunting tech challenges.
Slideshows
Flash Poll