Does Your Data Need to Be Perfect Before Using AI?
While good data is the foundation of effective AI implementation, it does not need perfect data to get started.
While good data is the foundation of effective AI implementation, it does not need perfect data to get started.
Part 3 of 6 · AI for Humans: Transforming Ideas into Action
AI projects rely on data. But the assumption that you need perfect data to get started often delays or derails useful progress. Most businesses begin with fragmented, inconsistent and underutilised datasets. That is normal. The question is not whether your data is perfect. It is whether you understand what you have, how it is structured and what it can support.
This guide is about strengthening your data foundations without major investment or disruption. The changes are small but they make the difference between an AI pilot that has something solid to work with and one that stalls in the first week.
Before improving your data, take time to understand its current state. A simple, focused inventory will reveal more value than you might expect.
For the business problem you prioritised in Part 1, document the following:
What relevant data do you currently collect?Examples: customer details, sales transactions, website interactions, operational metrics, service records.
Where does this data live?Examples: CRM systems, spreadsheets, accounting software, email, surveys, paper records.
How good is the data?Ask yourself:
These questions give you a clear view of what can be used now, what needs improvement and what should be archived or retired.
As you map out your current data, take note of systems or tools that are rarely used but still consuming resources. Retiring unused platforms or consolidating overlapping systems can reduce digital waste and energy usage, particularly in cloud environments.
Rather than trying to fix all your data issues, focus on a few changes that reduce friction and improve usability.
Challenge: Inconsistent data entry (e.g. customer names entered in different formats) makes reporting unreliable and messy.
Solution: Pick a few fields that are most important to your business (like customer name, job title, or product category). Create a short guide for how these should be entered. Then ask your team to follow the same rules.
Example: One of our clients standardised industry types across their CRM and billing system in a week, giving them clearer insight into customer trends with no new tools.
Challenge: Incorrect or missing information in forms or systems causes follow-up issues like failed deliveries or broken reports.
Solution: Use the built-in settings in tools like your CRM or website form to add checks (e.g., an email contains “@” or a postcode matches the UK format).
Example: A retailer we worked with added postcode validation to their checkout form and reduced failed deliveries immediately.
Challenge: Teams often calculate the same thing in different ways, which leads to confusion, rework, or bad decisions.
Solution: Pick a few key metrics (like “monthly revenue” or “on-time delivery”) and agree on how they should be defined and calculated. Write them down and make sure everyone uses the same version.
Example: We worked with a wholesaler who standardised the “on-time delivery” metric across sales, ops, and customer service. This instantly cleared up misaligned reporting.
Challenge: Related data often sits in separate systems (e.g. sales data in one place, feedback in another), making it hard to spot patterns.
Solution: Start simple. Export two related data sets and combine them in a spreadsheet. Even just looking at them side-by-side can reveal valuable insights.
Example: A healthcare provider we supported combined appointment scheduling and satisfaction scores into one spreadsheet. This helped them identify drop-off patterns that no single system showed on its own.
Manual exports might feel low-tech, but they can also highlight where you are duplicating data processing across multiple systems. Consolidating that reduces energy usage as well as complexity.
Governance does not need to be complex. These light-touch practices will help maintain your progress:
| Action | Who's Responsible? | Notes |
|---|---|---|
| Assign ownership for each dataset | Not a full-time role | |
| Create a basic data dictionary | Start with key fields | |
| Monthly data review (30 mins) | Use a simple scorecard | |
| Keep a "data debt" log | Prioritise based on business value |
We have put together a downloadable worksheet that covers the full data foundations process, from inventory through to a two-week sprint plan.
This does not guarantee clean data but it does create conditions where progress can be tracked and improvements can be sustained. Which is often enough to support your first few AI initiatives.
Better governance also has a sustainability upside. Improving clarity and reducing rework means your team uses less time, energy, and computational power. Small steps, but they add up.
Here is a suggested action plan to help you make solid progress quickly.
Week 1: Assessment and Planning
Week 2: Quick Wins Implementation
Improving your data does not require an overhaul. It starts with clarity: knowing what you have, how it is being used and what you want it to support. Small, deliberate actions can remove obstacles to AI adoption and increase the return on future investment.
Get each new part direct to your inbox.
Related service
Product strategy, service design, and practical AI integration.