FORMATION-GRADE LEAD INTELLIGENCE · 4 STATES LIVE · UPDATED MAY 2026
DATA DICTIONARY · v2.0

EVERY FIELD.
EVERY SOURCE.
EVERY TIMESTAMP.

The complete field reference for every data point we deliver. Every field ships with full provenance, license classification, and a confidence indicator. No black boxes. No handwaving.

100+
Customer-deliverable fields
265K
Leads in dataset (across 4 states)
4 + 46
States live · on roadmap
3
Orthogonal scores · Reachability · Relevance · Confidence
How to read this

Seven pieces of metadata. Per field.

We model this on Acxiom InfoBase and HL7 FHIR — the difference is every field carries provenance and license at the field level, not the record level. Most vendors ship a record with one timestamp. We ship a record with one timestamp per field.

Public Record
Free to use. Sourced directly from Secretary of State filings and federal data.
Enriched
Vendor-licensed. Skip-trace and validation data — terms apply.
Derived
Our IP. Computed by GoodLeads, licensed under your contract.

Confidence on every derivation.

No value ships without a signal telling you how much to trust it. Three patterns appear throughout the dictionary.

Pattern 01 Numeric scores 0–100, reported with score plus named tier — Reachability 73 → "Very Hot."
Pattern 02 Confidence tiers confirmed · likely · possible · unknown — used for classifications.
Pattern 03 Match status MATCHED · NEAREST · NO_MATCH — used for spatial and identity joins.
No matches

Nothing here. Try a different search.

Section 01

Identifiers

The persistent keys you use to track a lead through your CRM, your campaigns, and your conversations.

FieldLabelTypeSourceLicenseCoverageUse case
lead_refLead ReferenceText · GL-{STATE}-{NNNNN}derivedDerivedAlwaysSpeak about a lead in plain language: "GL-CO-12345 just hit Very Hot." Stable across enrichment cycles.
contact_refContact ReferenceText · GLC-{NNNNN}derivedDerivedAlwaysTrack a person across multiple businesses. One contact may link to many entities.
state_entity_idState Entity IDTextco_sos · fl_sunbiz · va_sccPublicAlwaysThe state's own ID. Use to look up the original filing.
source_systemSource SystemCategorical · CO_SOS, FL_Sunbiz, VA_SCCco_sos · fl_sunbiz · va_sccPublicAlwaysWhich state the record came from.
Section 02

Business Profile

The core firmographic record — what every B2B data product ships, normalized to a single canonical schema across 50 state formats.

FieldLabelTypeSourceLicenseCoverageUse case
entity_nameBusiness NameTextSOS pullPublicAlwaysThe legal name as filed.
entity_typeEntity TypeCategorical · LLC, Corporation, LP, LLP, PC, PLLC, Partnership, OtherSOS pullPublicAlwaysLegal structure — drives qualification rules and tax/banking pitches.
formation_dateFormation DateDate · ISO 8601SOS pullPublicAlwaysThe day the business was created. Days-since-formation is the single strongest "moment of formation" signal.
statusFiling StatusCategorical · Active, Inactive, Dissolved, WithdrawnSOS pullPublicAlwaysCurrent standing with the state.
jurisdictionJurisdiction of FormationCategorical · DOMESTIC_{ST}, FOREIGN_{ST}SOS pullPublicAlwaysWhere the entity was formed (domestic vs. foreign authority).
officer_countOfficer CountIntegerSOS pullPublicFL/VA: yes; CO: ra_onlyNumber of officers on the filing. Drives our "is this a solopreneur" inference.
public_available_atPublic Available DateDateSOS pullPublicAlwaysWhen the record became publicly searchable — the start of the speed-to-contact race.
standing_status_as_ofStanding Status As OfDateco_sosPublicCO onlyDate status was last refreshed with the state.
Section 03

Location & Geography

The SOS gives you a string. We give you a parsed, geocoded, census-attributed location.

FieldLabelTypeSourceLicenseCoverageUse case
principal_address_rawPrincipal Address (Raw)TextSOS pullPublicAlwaysThe address string exactly as filed. Kept for audit.
principal_address_normalizedPrincipal Address (Normalized)Object · {street, city, state, zip, country}derivedDerivedAlwaysParsed and standardized. USPS-style normalization across 50 state formats.
principal_cityPrincipal CityTextSOS pullPublicAlwaysGeographic targeting at city granularity.
principal_statePrincipal StateText · 2-letterSOS pullPublicAlwaysState of the principal address (may differ from formation state for foreign filings).
principal_zipPrincipal ZIPTextSOS pullPublicAlwaysZIP-level targeting and DMA rollup.
mailing_address_rawMailing Address (Raw)TextSOS pullPublicState-dependentWhere the business receives mail (often a personal address — strong contact signal).
mailing_address_normalizedMailing Address (Normalized)ObjectderivedDerivedState-dependentParsed mailing address.
mailing_cityMailing CityTextSOS pullPublicState-dependent
mailing_stateMailing StateText · 2-letterSOS pullPublicState-dependent
mailing_zipMailing ZIPTextSOS pullPublicState-dependent
latitudeLatitudeNumeric · decimal degreesus_census_geocoderPublic~92%Map placement, radius search, drive-time targeting.
longitudeLongitudeNumeric · decimal degreesus_census_geocoderPublic~92%Same as above.
geocode_statusGeocode Match StatusCategorical · Match, No_Match, Tieus_census_geocoderDerivedAlwaysWhether the address geocoded cleanly. Tie flags ambiguous addresses.
county_fipsCounty FIPS CodeText · 5 digitsus_census_geocoderPublic~92%Federal county code. Joins to BLS, IRS, and FEMA data.
census_tractCensus TractText · 11 digitsus_census_geocoderPublic~92%Demographic and income-band rollups via ACS data.
Section 04

Industry Classification

We don't buy NAICS/SIC from a third party. We classify every entity ourselves at load time — rule-based fusion of suffix patterns, keyword matching, and entity-type rules. Confidence ships with every value.

FieldLabelTypeSourceLicenseCoverageUse case
industry_sectorIndustry SectorCategorical · 14 values incl. Healthcare, Construction, Technology, Real Estate, Finance, Retail, Manufacturing, Hospitality, Transportationclassification_svcDerived~96%High-level segmentation for campaign routing.
industry_nameIndustryCategorical · 113 values (Psychology, Landscaping, Software Development, Property Management, Accounting, Dental Practice…)classification_svcDerived~96%Granular industry — the right altitude for vertical-specific outreach.
industry_formation_codeIndustry CodeText · hierarchical (HC-PSY, CON-LAND…)classification_svcDerived~96%Stable code for joins and CRM mappings.
industry_confidence_scoreIndustry ConfidenceNumeric · 0.0–1.0classification_svcDerived~96%Numeric confidence from the fusion classifier.
industry_confidence_tierIndustry Confidence TierCategorical · confirmed (≥0.85), likely (≥0.60), possible (≥0.40), unknown (<0.40)classification_svcDerivedAlwaysDecision rule for downstream filtering — most teams target confirmed + likely only.
industry_signals_countClassifier SignalsIntegerclassification_svcDerivedAlwaysHow many independent signals agreed on the classification. Higher = more robust.
Section 05

Property Intelligence

Where most B2B data products end, we begin. Every principal address joined against ~16M parcel boundaries. A residential principal address is a different sales motion than a commercial one — we surface that distinction at load time.

FieldLabelTypeSourceLicenseCoverageUse case
property_classificationProperty TypeCategorical · RESIDENTIAL, COMMERCIAL, INDUSTRIAL, AGRICULTURAL, MULTI_FAMILY, MIXED_USE, VACANT, EXEMPT, UNKNOWNproperty_classification_svcDerived~83%Distinguish home-based businesses from offices. Filter out shell companies (vacant).
property_classification_rawProperty Type (Raw Code)Textproperty_classification_svcPublic~83%The unmodified county property-use code.
property_type_detailProperty SubtypeText · Single Family, Condo, Office Building, Warehouse, Apartment…property_classification_svcPublic~83%Sub-type granularity from county records.
property_is_vacantIs VacantBooleanproperty_classification_svcDerived~83%Strong shell-company signal — vacant principal address = exclude.
property_assessed_valueAssessed ValueInteger · USDproperty_classification_svcPublic~83%County-assessed total value. Proxy for owner net worth and account size.
property_lookup_confidenceProperty Match ConfidenceCategorical · MATCHED, NEAREST, NO_MATCHproperty_classification_svcDerivedAlwaysHow confident the spatial join is. MATCHED = exact parcel; NEAREST = best guess within 50m.
parcel_idParcel IDTextproperty_classification_svcPublic~83%County parcel identifier for deeper records lookup.
data_vintageProperty Data VintageDateproperty_classification_svcPublic~83%When the parcel data was last published — tells you how fresh the assessed value is.
Section 06

Registered Agent Intelligence

Proprietary IP. The SOS gives you an RA name and address. We give you cross-state volume, market tier, formation-service brand attribution, and a GTM segment — all derived from a curated registry built from SEC filings, BBB records, and acquisition tracking.

FieldLabelTypeSourceLicenseCoverageUse case
registered_agent_nameRegistered Agent NameTextSOS pullPublicAlwaysThe named RA on the filing.
registered_agent_address_rawRA Address (Raw)TextSOS pullPublicAlwaysRaw RA address.
registered_agent_address_normalizedRA Address (Normalized)ObjectderivedDerivedAlwaysParsed RA address.
ra_typeRA TypeCategorical · P (Person), C (Corporation)fl_sunbizPublicFL onlyPerson or commercial RA service.
ra_self_representedSelf-RepresentedBooleanderivedDerivedAlwaysTrue if the owner filed as their own RA — strong "founder-led, no formation service" signal.
ra_market_tierRA Market TierCategorical · premium ($300+/yr), midmarket ($100–200), budget ($0–50), local, individual, suspicious, unknownra_intelligence_svcDerivedAlwaysIndicates the price tier the founder paid for formation help. Premium = mature / cost-insensitive; Budget = price-shopping DIY founder.
ra_entity_volumeRA Entity Volume (Cross-State)Integerra_intelligence_svcDerivedAlwaysHow many entities this RA serves across our entire dataset.
ra_address_cluster_countRA Address Cluster SizeIntegerderivedDerivedAlwaysNumber of entities sharing the exact RA address — flags shared commercial offices and mail-drops.
ra_is_attorneyRA Is AttorneyBooleanra_intelligence_svcDerivedAlwaysAttorney-as-RA pattern. Strong signal of premium legal-services formation path — different motion than DIY founders.
contact_name_in_entity_nameContact Name In Entity NameBooleanra_intelligence_svcDerivedAlwaysOwner's name appears in the business name (e.g. "Smith Consulting LLC"). Lifts contact-relevance and confidence.
gtm_segmentGo-to-Market SegmentCategorical · owner_operator, budget_formation, midmarket_formation, premium_established, suspicious_exclude, unclassifiedra_intelligence_svcDerivedAlwaysThe single most useful field for campaign segmentation. Combines RA tier + self-rep + jurisdiction + officer count.
RA Portfolio Shape

What this RA's book of business actually looks like

Beyond volume and price tier — three percentages that describe the kind of clients an RA serves. A budget RA with 95% LLC + 90% single-officer clients is selling to one persona; a premium RA with 30% foreign-filed corporates is selling to a completely different one.

FieldLabelTypeSourceLicenseCoverageUse case
domestic_pctDomestic Filing %Numeric · 0–100ra_intelligence_svcDerivedAlways% of this RA's clients formed in the same state where the entity is registered. High = local RA serving local businesses; low = cross-border filing operation.
llc_pctLLC Mix %Numeric · 0–100ra_intelligence_svcDerivedAlways% of clients that are LLCs vs. corporations / LPs. Discount RAs skew heavily LLC; corporate-focused RAs less so.
single_officer_pctSingle-Officer %Numeric · 0–100ra_intelligence_svcDerivedFL / NY (officer data); CO/VA may report 0 — sparse officer data, not a bug% of clients filing with exactly one officer/member. Proxy for "kitchen-table" small businesses vs. real operating entities.
Section 07

Formation Service Attribution

Who did this founder buy their LLC from? LegalZoom? Bizee? TailorBrands? We answer that with a curated registry of 20+ formation services and their RA subsidiaries — evidenced from SEC 10-Ks and acquisition disclosures.

FieldLabelTypeSourceLicenseCoverageUse case
formation_serviceFormation ServiceCategorical · LegalZoom, ZenBusiness, Bizee, TailorBrands, Inc Authority, Northwest, IncFile, Stripe Atlas, Doola, None…ra_intelligence_svcDerived~30% (when identifiable)Identify which formation product the founder already trusts — informs partnership conversations and competitive displacement plays.
ra_providerRA ProviderCategorical · same enum as formation_servicera_intelligence_svcDerivedAlways (when RA is a known provider)Who operates the RA subsidiary. May differ from formation_service when ownership is layered.
formation_service_confidenceAttribution ConfidenceCategorical · certain, very_high, high, medium, lowra_intelligence_svcDerivedWhen formation_service is setHigher = brand name appears directly in RA. Lower = inferred from address co-location.
formation_service_attributionAttribution MethodCategorical · direct, ra_onlyra_intelligence_svcDerivedWhen formation_service is setHow we made the call. direct is bulletproof; ra_only requires confidence-tier filtering.
Section 08

Contact

The person you actually call. We never ship "info@" addresses or main switchboard numbers — every contact is name-matched to an officer, owner, or self-representing RA.

FieldLabelTypeSourceLicenseCoverageUse case
contact_nameContact NameTextSOS pull · tracerfy · derived
PublicEnriched
AlwaysFull name. From the filing when officers are listed; from skip trace otherwise.
role_titleRole / TitleText · CEO, President, Owner, Manager, Member, MGR, P/S…SOS pullPublicState-dependentJob title from the filing. Drives decision-maker scoring.
relationship_typeRelationship to BusinessCategorical · OWNER, OFFICER, REGISTERED_AGENTderivedDerivedAlwaysHow this person relates to the business.
phone_primaryPrimary PhoneText · E.164tracerfy · enformionEnriched~60% Tracerfy + gap-fillBest phone. Mobile preferred over landline; landline preferred over VoIP.
phone_secondarySecondary PhoneText · E.164tracerfyEnrichedEnrichedSecond-best phone.
phone_tertiaryTertiary PhoneText · E.164tracerfyEnrichedEnrichedThird-best phone.
email_primaryPrimary EmailTexttracerfy · enformionEnriched~48% Tracerfy + gap-fillBest email.
email_secondarySecondary EmailTexttracerfyEnrichedEnrichedSecond-best email.
Section 09

Email Validation

Every email runs through ZeroBounce. We report the result, the inbox-activity recency, and whether the name on the inbox matches the person on the filing. Most providers ship "valid" or "invalid." We ship the full diagnostic.

FieldLabelTypeSourceLicenseCoverageUse case
zb_email_statusDeliverability StatusCategorical · valid, catch-all, abuse, invalid, unknownzerobounceEnrichedEnrichedWhether the email will deliver.
zb_email_sub_statusDeliverability Sub-StatusCategorical · alternate, mailbox_not_found, greylisted, role_based, disposable…zerobounceEnrichedEnrichedDetail behind the status. role_based = info@-style. alternate = better address available.
email_is_free_providerIs Free ProviderBooleanzerobounceEnrichedEnrichedGmail/Yahoo/Outlook/Apple flag. Personal vs corporate signal.
email_smtp_providerEmail InfrastructureCategorical · google, microsoft, yahoo, apple, comcast, rackspace…zerobounceEnrichedEnrichedWho runs the inbox. Tier-1 providers (Google, Microsoft) deliver more reliably.
email_mx_foundMX Records PresentBooleanzerobounceEnrichedEnrichedPrerequisite for delivery.
email_mx_recordMX RecordTextzerobounceEnrichedEnrichedThe actual MX string.
email_domain_age_daysDomain Age (Days)IntegerzerobounceEnrichedEnrichedDomain age. New domains = higher spam risk.
email_activity_foundInbox Activity DetectedBooleanzerobounceEnrichedEnrichedThe strongest "is this person reading mail" signal we have.
email_active_in_daysDays Since Last ActivityInteger / categorical · 60, 90, 180, 365, 365+zerobounceEnrichedEnrichedRecency of inbox activity. Drives the Reachability score.
zb_name_firstInbox Account First NameTextzerobounceEnrichedEnrichedThe first name registered on the inbox account.
zb_name_lastInbox Account Last NameTextzerobounceEnrichedEnrichedLast name registered on the inbox.
email_name_verificationName vs. Filing MatchCategorical · FULL_MATCH, LAST_MATCH, FIRST_MATCH, NO_MATCHderivedDerivedEnrichedDoes the name on the inbox match the name on the filing? Catches forwarded inboxes, family-shared addresses, and impersonation.
Section 10

Phone Validation

FieldLabelTypeSourceLicenseCoverageUse case
phone_validityPhone ValidityCategorical · MOBILE_VALID, LANDLINE_VALID, VOIP_VALID, INVALID, UNKNOWNtracerfyEnrichedEnrichedActive mobile, landline, or VoIP. Mobile has the highest connect rate.
enf_phone_primaryGap-Fill PhoneText · E.164enformionEnrichedGap-fill onlyBest phone from EnformionGo when Tracerfy missed.
enf_phone_connectedGap-Fill Phone ActiveBooleanenformionEnrichedEnrichedWhether the gap-fill phone is currently connected.
enf_phone_last_seenGap-Fill Phone Last SeenDateenformionEnrichedEnrichedMost recent date the number was reported active.
Section 11

Reachability Score

0–100 composite answering one question: can we reach this person? Every component ships alongside the score. See exactly how it was built. Tune your own thresholds.

FieldLabelTypeSourceLicenseCoverageUse case
reachability_scoreReachability ScoreInteger · 0–100scoring_svcDerivedAlwaysThe headline score. Tunable threshold for cohort selection.
reachability_tierReachability TierCategorical · On Fire (80–100), Very Hot (60–79), Hot (40–59), Warm (20–39), Cold (0–19)scoring_svcDerivedAlwaysNamed tier — drop into a CRM lifecycle stage without further work.
Score Components

Shipped in the parameters payload

ComponentMax pointsWhat it measures
email_validity_pts25ZeroBounce status — valid and catch-all score highest.
email_activity_pts20Inbox activity recency. 0–60 days = full points.
phone_reach_pts20Phone availability and type. Mobile > landline > VoIP.
geo_intel_pts10Geocode precision — lat/lon + county + census tract all present.
name_verify_pts10First/last name match between ZeroBounce inbox and SOS filing.
identity_conf_pts10Skip-trace identity confidence (Tracerfy name match).
email_infra_pts5Top-tier email provider + MX records present.
industry_bonus_pts5Industry classification confidence bonus.
Never-Downgrade Protection

Once a record reaches a score, a subsequent reload cannot lower it. Paid enrichment is irreversible — the data is too.

Section 12

Contact Relevance Score

0–100 composite answering an orthogonal question: is this the RIGHT person to sell to? A contact can be highly reachable but the wrong person — a commercial RA employee instead of the founder. This score separates them.

FieldLabelTypeSourceLicenseCoverageUse case
contact_relevance_scoreContact Relevance ScoreInteger · 0–100scoring_svcDerivedAlwaysHeadline relevance score — decision-maker proxy.
contact_relevance_tierContact Relevance TierCategorical · Decision Maker (85–100), Likely Decision Maker (65–84), Probable Contact (45–64), Uncertain Contact (25–44), Unlikely Decision Maker (0–24)scoring_svcDerivedAlwaysNamed tier — most teams target Decision Maker + Likely.
Score Components

Shipped in the parameters payload

ComponentMax pointsWhat it measures
self_representation_pts25Self-rep RA + individual RA = founder filing alone.
contact_name_source_pts20Officer (best) > Owner > RA person > Commercial RA.
location_uniqueness_pts20How many entities share this lat/lon. Unique = real address.
contact_exclusivity_pts15How many entities this contact links to. 1 = exclusive; 51+ = shared mail-drop.
name_in_entity_pts10Person's name appears in the business name (e.g. "Smith Consulting LLC").
ra_tier_pts5Individual RA scores higher than commercial.
entity_type_pts5Domestic LLC + 1 officer = highest.
interaction_penalty−10Applied when contact is on 4+ entities AND 6+ entities share location. Catches commercial-RA office pattern.
Section 13

Contact Confidence Score

0–100 composite answering a third orthogonal question: are we confident this email or phone really belongs to this person? A reachable, relevant contact can still be the wrong identity — a forwarded inbox, a family-shared phone, a name collision. This score isolates that risk.

FieldLabelTypeSourceLicenseCoverageUse case
contact_confidence_scoreContact Confidence ScoreInteger · 5–100 (floor of 5; never zero)scoring_svc · contact_confidence_v1.1DerivedAlwaysHeadline identity-confidence score. Pair with Reachability + Relevance for a three-axis lead-quality picture.
contact_confidence_tierContact Confidence TierCategorical · Verified Contact (75–100), Likely Contact (50–74), Possible Contact (25–49), Uncertain Contact (5–24)scoring_svc · contact_confidence_v1.1DerivedAlwaysNamed tier — most teams gate outbound at Verified + Likely.
Score Components

Five weighted signals · sum to 100

ComponentMax pointsWhat it measures
email_name_match30Does the email local-part contain name tokens that match the filing? Strongest single identity signal.
zb_quality20ZeroBounce validity + recent inbox activity (someone is actually reading this mailbox).
phone_ra_alignment20Mobile phone + self-represented RA = the founder's personal line. Strongest paired identity signal.
property_context15Residential principal address raises confidence (vs. shared commercial / virtual-office address).
ra_self_rep15Self-rep RA = the contact almost certainly IS the founder, not an employee or RA service rep.
Three Orthogonal Scores

Reachability answers "can we reach them?" · Relevance answers "are they the right person?" · Confidence answers "is this contact really them?" A lead can be high on any two and low on the third. Most providers ship one score (or none). We ship all three, with components.

Section 14

Pre-Enrichment Strategy

Before we spend a single skip-trace credit, our Contact Intelligence service decides how to enrich each entity. This is the field that explains why some leads got 2-credit advanced traces and others got 1-credit basic.

FieldLabelTypeSourceLicenseCoverageUse case
enrichment_strategyEnrichment StrategyCategorical · name_and_address, address_only, mailing_fallback, holdcontact_intelligence_svcDerivedAlwaysThe strategy we used (or would use) for skip-tracing this entity.
name_confidenceName ConfidenceCategorical · high, medium, lowcontact_intelligence_svcDerivedAlwaysHow confident we are that the contact name is a real, unique person.
address_confidenceAddress ConfidenceCategorical · high, medium, lowcontact_intelligence_svcDerivedAlwaysHow confident we are that the address is a real, residential, non-shared location.
name_densityName DensityIntegercontact_intelligence_svcDerivedAlwaysNumber of entities sharing the same contact name. High = generic/common name (lower identity confidence); 1 = unique person.
address_densityAddress DensityIntegercontact_intelligence_svcDerivedAlwaysNumber of entities at the same principal address. 1 = real address; 50+ = mail-drop / virtual office / commercial RA building.
cross_state_contact_countCross-State Contact CountInteger · 1–4contact_intelligence_svcDerivedAlwaysHow many of the live states this contact name appears in. Multi-state presence is a strong serial-founder / professional-filer signal.
Section 15

Gap-Fill Vendor Data

When Tracerfy can't find a contact, we fall back to EnformionGo. These fields show up only on records where the gap-fill ran.

FieldLabelTypeSourceLicenseCoverageUse case
enf_identity_scoreGap-Fill Identity ScoreInteger · 0–100enformionEnrichedGap-fill onlyEnformionGo's own match-confidence score. Average ≥96 on matches.
enf_email_primaryGap-Fill EmailTextenformionEnrichedGap-fill onlyBest email from gap-fill when Tracerfy missed.
enf_ageEstimated AgeIntegerenformionEnrichedGap-fill onlyReported age — useful for life-stage targeting.
Section 16

Provenance & Audit

Every value above ships with three pieces of audit metadata. Available on request via the provenance sidecar export.

FieldLabelTypeSourceLicenseCoverageUse case
source_nameSource (per attribute)TextderivedDerivedAlwaysWhere this specific value came from. Per-field, not per-record.
license_flagsLicense (per attribute)Categorical · PUBLIC_RECORD, ENRICHED, DERIVEDderivedDerivedAlwaysLicensing tier — drives compliance and data-sharing rules.
observed_atObserved Timestamp (per attribute)DateTime · ISO 8601derivedDerivedAlwaysWhen this specific value was captured or last refreshed.
score_versionScore VersionText · semverscoring_svcDerivedAlwaysThe scoring algorithm version that produced this record's scores. Lets you reason about model drift.
Appendix A

Coverage by State

Observed fill rates from production runs (refreshed monthly), plus forward-looking forecasts for newly-onboarded states. Forecasts are dashed and prefixed with "~" so they're not mistaken for measurement — they reflect the deal-requirement floors (45% phone / 35% email) and the average of established states.

Solid cell = data-status="observed" · production fill rate Dashed cell = data-status="forecast" · projection · not yet measured
Stage / Source COFLVANYGA next
SOS filing fields100%100%100%100%100%
Geocoding90%92%93%92%
Industry classification95%96%96%96%
Property classification76%88%84%80%
RA intelligence100%100%100%100%
Skip-trace phone56%64%87%45%
Skip-trace email41%53%59%35%
Email validation48%48%48%48%
Reachability score100%100%100%100%
Contact Relevance score100%100%100%100%
Contact Confidence score100%100%100%100%
Appendix B

What's Different

A scraper ships you SOS data with a timestamp. We ship you the same SOS data — plus six things nobody else delivers.

— 01

Three Orthogonal Scores

Reachability ("can we reach them?"), Contact Relevance ("are they the right person?"), Contact Confidence ("is this contact really them?"). Plus 15+ derived classifications — Industry, Property Type, RA Market Tier, GTM Segment, Formation Service, RA Portfolio Shape, Density signals, Enrichment Strategy. All computed by us.

— 02

Cross-State Entity Resolution

Same RA, same contact — identified across CO, FL, VA at load time. Without you doing any joins.

— 03

Per-Field Provenance

Every value carries source, license, and timestamp. Audit any cell. Most vendors ship one timestamp per record. We ship one per field.

— 04

Per-State Semantic Mapping

"What does mailing address mean in NY vs FL vs CO." We've solved that. Your CRM sees one canonical schema across 50 state formats.

— 05

Confidence on Every Derivation

No value ships without a tier or score telling you how much to trust it. No black boxes.

— 06

Field-Level License Classification

PUBLIC_RECORD, ENRICHED, and DERIVED tracked at the field level. Your compliance team can reason about residency, sharing, and resale.