Almost everyday of the past 12 years I’ve looked at web
analytic data. It’s never been a completely satisfying experience. Log files
mess. At the end of the day, it is all “just numbers.” It doesn’t tell me what
to do. It doesn’t do it for me. It doesn’t let me shape it as I want to.
Reports are the output of modern web analytics. Sites are
layered with tags on top of the page and tables are created for the data to sit
and be compiled. There have been recent advances in reporting itself, namely
solutions like Chartbeat (one of the great startup names) that provide realtime
data streams. This leapfrogs data delivery over the hours of latency that
exists in the site analytics space. But even realtime analytic data is still just
a report, only a little more interesting by being timely. The Achilles heel of
all site analytics continues to be the ability to mashup datasets from the report
tables, an essential first step for segmentation and thus actionable analytics.
Not only is this useful for look backs but it also delivers the flexibility needed to
create new measurement tables.
Whether it was intentional or not Site Optimization tools
emerged a few years ago and began to fill this gap. While they were used to
optimize website performance they were marketed as Testing tools. Who could
argue for the need to test? Their liftoff was an exciting time in the market. Initially
these tools paid great attention to the content targeting aspect of the product,
the object. This was done not to the neglect of the product but to the
simplicity of early versioning.
More and more though it became clear that segmentation was what would
really exponentially improve the application of the technology.
The past few years Web analytic tools have evolved to some
degree and added more robust segmentation ability (AdWords being the ultimate segmentation creation and reporting tool). As
everyone involved in optimizing the web in one way or another knows, it is the
keystone to success. I do believe however that success for these solutions are
still limited by the underlying data collection and warehousing methods that
ultimately affect their ability to make the analytics actionable.
To be more specific, as in other applications, there is
limited flexibility in data collection approaches and tables requiring a
top-down approach to creating data sets that form the building blocks of
intelligent segment creation. Top-down is the classic breaking down of systems
to look at the sub systems compositional structure. It informed software engineering
until the rise of object-oriented programming in the mid 90’s. For the most
part it is the way web analytics works today.
Like the exciting rise of the OOP paradigms that now power
the web, at Yieldbot we are really exciting about new bottom-up approaches to data
set and segmentation creation possible using these same paradigms. We believe
data needs to be as flexible and realtime as the web itself. There is great power in the ability to
take data, make it a sub-system of the platform and enable it to be shaped
based on the goals of campaign. Just as the semantic web has opened up metadata
with bottom-up RDF and microformats we will see a new wave of analytics will
bring the same mashup ability to data. We’re already doing it.
As if that’s not enough incentive for bottom-up data approaches,
as more and more data inputs/outputs are API generated (another area current
web analytic solutions are lagging) the ability to take bottom-up approaches become
even more mission critical. The best example of this is Display advertising.
The speed to power new markets using APIs driven data are beyond showing the
potential, they are generating returns for the DSPs, Exchanges and RTB using it.
Interestingly, unlike the data sharing advances in media
buying, the analytics of advertising is spreadsheet wasteland with scarce little BI. One wonders how
this situation exists when it is marketers and advertisers that need
segmentation most of all. Segmentation breeds efficiency. Segmentation delivers
relevance. Segmentation elicits emotion. Segmentation and twin sister matching sit in the middle of all that’s relevant in digital. Over the next few years it
will increasingly become the difference of success or failure. Who’s matching you?
And now the new question, how are they doing it? Bottoms up!