Rich data is what fuels online catalogs and the product search experience. Our core services include data sourcing, data normalization and data maintenance but we also handle more comprehensive data projects and work with clients to develop catalog taxonomy, catalog schema and data cleansing.
Data sourcing is the practice of crawling websites, PDFs, MS Office documents, databases, ERP systems and other product content repositories to support the underlying content in online catalogs with faceted search or guided navigation.
Distribution companies, in particular, have challenges securing digital content because suppliers have varying abilities to provide digital assets, product attribute data and images. Our ability to source data to enrich a distributor’s existing data set improves the consumer ecommerce experience and increases online conversions.
Data normalization is the practice of identifying variabilities in sourced data and normalizing it to fit established catalog attributes and values. Normalizing data is important to support a consistent experience for digital consumers by presenting uniform values, attributes and product categories within the merchandiser’s online catalog or ecommerce platform.
During the data normalization process, we research how customers search and make recommendations to present data in a customer-centric fashion as opposed to following industry convention. Customer search queries provide clues on how to restructure product data, attribute labels and product classifications to support the buying journey through SEO and site search.
Data cleansing is the practice of taking sourced, normalized data and identifying discrepancies or outliers and reconciling it against a rule, specification and standard. Typically, data cleansing is driven by a specification or standard to create clean, accurate values.
Online catalogs are overflowing with opportunity for data cleansing for a number of reasons:
New product additions
Erroneous product data provided and populated with no quality assurance process
Source data lacking a single source of truth because product data resides in many different places and mastered nowhere. For example, product data could reside in an ERP System, PIM System, PDF Catalog, Online Catalog, Internal Spreadsheets or other Product Information Database.
No organizational commitment to manage and maintain content over time
Because data is ever changing, data maintenance agreements are available to support on-going client eBusiness initiatives.
We have experience sourcing and managing data to support Magento, Hybris, Epicor, Infor, Profit 21 and many other ecommerce platforms.