Weekly Briefing

article sponsor image
Partner Content

When “Good Enough” Data Isn’t: Why AI Success Demands a Clean Data Diet

5 min read
listen to article Listen to this article

The real estate industry has always been a quick mover when it comes to dealmaking, despite its bearish attitudes towards tech. In a business driven by relationships, reputation, and responsiveness, there’s some tolerance for making big decisions with less-than-perfect information, but that’s rapidly changing.

As artificial intelligence (AI) adoption grows, the impact of incomplete or imperfect data becomes wider-reaching and more dangerous. Humans in decision making roles have enough context and experience to make judgment calls with limited information, but even the smartest AI can deliver bad decisions – or hallucinate– when missing the data it needs.


AI Use Cases for Real Estate

The simplest way to consider AI as a tool without getting overwhelmed is to isolate the tasks that need doing that require reviewing information to make a decision.


Here are some examples of real estate work easily accomplished with AI, along with data types you would need to provide:

  • Property valuation
    • Construction costs
    • Recent property transaction data
    • Sales data for comparable properties by asset class, geographic location, etc.
  • Risk assessment
    • Environmental assessments and survey data
    • Title and ownership history
    • Indicators of market volatility.
  • Lease management
    • Existing lease agreements and terms
    • Tenant payment history
    • Maintenance and repair records
    • Relevant market data.
  • Tenant screening
    • Tenant credit scores or financial statements
    • Rental payment history
    • Previous landlord references
    • Employment history.
  • Portfolio management
    • Property financial statements
    • Age and condition of all properties
    • Lease expiration dates and renewal options
    • Market data and growth forecasts.
  • Operational efficiency
    • Maintenance and repair schedules
    • IoT sensor data for connected systems and devices
    • Utility consumption data
    • Vendor and contractor records and performance data
    • Invoice and payments history.
  • Compliance
    • Environmental compliance reports
    • Tax assessment and reporting data
    • Fair housing and anti-discrimination records.
  • Environmental Sustainability
    • Energy consumption data
    • Utility rates over time
    • Sustainability certification records or certifications
    • Environmental impact assessments.
Work that requires frequent analysis but fairly standardized decision making across repetitive data sets is ideal for automating with AI. But the outcomes will only be as good as the data available for processing by the AI, or training any advanced learning models.

This is where most organizations run into trouble. Too much of the data they rely on exists in disjointed pockets of institutional knowledge spread across digital files, cabinets full of paper, or individual employee memory.


Humans learn to compensate for missing info with collaboration and intuition. Your tech will need a leg up to prevent creating inefficiencies, inaccuracies, or introducing new potential risks.


How to Clean Up Your Data for AI Projects

For the purpose of operationalizing by way of AI, the bar for clean, usable data is availability, readability, completeness, reliability, and consistency. Timeliness increases in importance when quick decisions are required.


Here’s a standard approach to getting your data AI-ready:

  1.       Audit:
    1. Identify all available data sources, their reliability and usefulness
    2. Identify all business processes around data use and collection
    3. Identify data gaps that prevent automation
    4. Identify inconsistencies, inefficiencies, and workarounds due to system or data limitations

  2.       Focus:
    1. Define the questions you want to be able to answer or decisions you’d like to make automatically and map out paths or flows from data point to decision
    2. Decide what to solve for and don’t look for a one-size-fits-all solution, focus on one business problem at a time

  3.       Centralize:
    1. Select a single source of truth like a CRM or ERP or other database where your data can be maintained and updated
    2. Define a process for importing data from other sources when necessary

  4.       Standardize:
    1. Prune what’s not necessary, standardize naming and numbering conventions
    2. Remove duplicate data – this is one of the biggest problems for accuracy and consistency down the road
    3. Define a schema for your data management – this could be as simple as a spreadsheet to indicate the name, type, and purpose of each field in a database record, and can help solidify decision making and prevent unnecessary complexity as your needs evolve
    4. Restrict access to key data

  5.       Educate:
    1. Before you train your AI, train your employees, your vendors, your partners, anyone who interacts with your data
    2. Everyone should have a shared understanding of how things work and why, how to update information, and whose responsibility it is to keep it up to date
    3. Schedule regular reviews and re-training sessions with your team to keep everyone on the same page

  6.       Select:
    1. Now you’re ready to consider the right AI solution for your needs—if it seems to be late in the process, you’re underestimating the size of the gap between what AI can do for your business and what’s required to succeed
    2. The better defined your problem, parameters, and available data, the easier it will be to choose the right level of intelligence and automation for your project; you might find an AI solution already exists specifically for lease abstraction or invoice processing
    3. Make the most of available models and resources by balancing what will be optimal for how you intend to use your data and will integrate into your existing tech stack and processes without needless disruption – doing the cleanup work up front will make this easier

  7.       Train & Review:
    1. Once you start training AI to operationalize your data, you’ll find ample opportunities to refine and perfect what it can do, but this part will remain ongoing
    2. The best AI, or the best AI for each job, will continue to show improvement with more cycles and as more data becomes available – that’s what separates it from simple workflows and flowcharts or templates
    3. Make regular review of results and expectations a part of your AI management strategy and enjoy the fruits of improved speed and efficiency across your business operations

As our industry evolves and adopts new technologies, the commitment to clean data becomes increasingly vital. Shortcuts taken in the past to expedite processes can transform into significant impediments when AI adds unprecedented speed and scalability. Every organization goes through these cycles, but those that invest in clean data will reap lasting dividends in terms of efficiency, accuracy, and innovation.

David Stifter, CEO & Co-Founder, PredictAP
David Stifter has spent over two decades at the nexus of real estate, technology, and finance. In 2020, he founded PredictAP to address a gap in the real estate AP automation market with an AI-powered invoice capture solution. He previously served as Managing Director at Digital Bridge (formerly Colony Capital), overseeing data architecture, process improvement, and accounting and finance transition projects for major acquisitions.

This Week’s Sponsor

PredictAP automates invoice capture for Yardi Payscan using proprietary AI. It eliminates manual data entry and improves AP outcomes without disrupting existing workflows. PredictAP is a Yardi Standard Interface Partner with US and EU API access, delivering seamless integration, and rapid time to value. Learn more at www.predictap.com.