entity dossier schema documentation

published: October 16, 2025

overview

this document provides comprehensive documentation for the entity dossier json schema used to profile companies and organizations participating in the data center industry. entity dossiers provide deep profiles of hyperscalers, operators, financial sponsors, utilities, construction firms, and technology vendors.

schema versions

  • current version: 1.0 (october 2025)
  • file locations: /support/datacenters/entities/{category}/{entity_name}.json
  • total entities: 37 profiled entities
  • schema definition: /support/datacenters/entities/ENTITY_DOSSIER_SCHEMA.json

entity types

Entity TypeDescriptionCount
hyperscaler

Large cloud providers operating own infrastructure (AWS, Microsoft, Google, Meta, Oracle, OpenAI, xAI)

7
operatorData center operators providing colocation, wholesale, or managed services15
financial-sponsorPrivate equity, sovereign wealth, infrastructure funds investing in data centers10
utilityElectric utilities serving data center power demands3
constructionGeneral contractors specializing in data center construction1
technology-vendorTechnology providers (networking, servers, cooling, power)1

top-level structure

required fields

FieldTypeDescription
entityTypeenum

Primary category: “hyperscaler”, “operator”, “financial-sponsor”, “utility”, “construction”, “technology-vendor”

entityNamestringOfficial company/entity name
overviewobjectBasic company information
sourcesarrayArray of source objects documenting dossier information
lastUpdatedstringDate of last dossier update (YYYY-MM-DD format)

optional fields

FieldTypeDescription
aliasesarrayAlternative names, abbreviations, former names
dataCenterProfileobjectData center-specific operations and portfolio information
strategyobjectCorporate strategy, growth plans, power strategy
financialsobjectFinancial metrics and performance
ownershipobjectOwnership structure and acquisition history
partnershipsobjectKey partnerships across power, construction, technology, finance
keyExecutivesarrayExecutive leadership profiles
competitiveAnalysisobjectMarket position, SWOT analysis, competitive differentiation
timelinearrayChronological history of major events
projectReferencesarrayReferences to projects in main database
mediaPresenceobjectRecent news and social media links
notesstringAdditional context, analysis, or observations

overview object

FieldTypeDescription
overview.foundedstringFounding year or date (YYYY or YYYY-MM-DD)
overview.headquartersobjectHQ location with city, state, country
overview.publicCompanybooleanWhether company is publicly traded
overview.tickerstringStock ticker symbol
overview.exchangestringStock exchange (e.g., “NASDAQ”, “NYSE”)
overview.marketCapUSDnumberMarket capitalization in USD
overview.employeesTotalnumberTotal employee count
overview.websitestringOfficial company website URL
overview.descriptionstring2-3 paragraph company description
overview.businessModelstringDetailed business model explanation

example:

{
  "overview": {
    "founded": "1975",
    "headquarters": {
      "city": "Redmond",
      "state": "Washington",
      "country": "United States"
    },
    "publicCompany": true,
    "ticker": "MSFT",
    "exchange": "NASDAQ",
    "marketCapUSD": 3800000000000,
    "employeesTotal": 228000,
    "website": "https://www.microsoft.com",
    "description": "Microsoft Corporation is a multinational technology company...",
    "businessModel": "Microsoft operates a diversified business model..."
  }
}

data center profile object

global footprint

FieldTypeDescription
dataCenterProfile.globalFootprint.totalDataCentersnumberTotal number of data centers globally
dataCenterProfile.globalFootprint.totalCapacityMWnumberTotal global power capacity in MW
dataCenterProfile.globalFootprint.totalSquareFeetnumberTotal global square footage
dataCenterProfile.globalFootprint.countriesnumberNumber of countries with presence
dataCenterProfile.globalFootprint.regionsarrayArray of geographic regions

us footprint

FieldTypeDescription
dataCenterProfile.usFootprint.projectsInDatabasenumberNumber of projects in this database
dataCenterProfile.usFootprint.statesarrayArray of US states with presence
dataCenterProfile.usFootprint.totalInvestmentUSDnumberTotal US investment disclosed
dataCenterProfile.usFootprint.totalCapacityMWnumberTotal US power capacity
dataCenterProfile.usFootprint.majorLocationsarrayArray of major location objects

major location object:

{
  "location": "Northern Virginia",
  "capacityMW": 150,
  "status": "Operational",
  "significance": "Microsoft's largest US data center region"
}

specialization

FieldTypeDescription
dataCenterProfile.specialization.primaryFocusarray

Array of focus areas: “hyperscale”, “colocation”, “enterprise”, “edge”, “ai-ml”, “cloud”, “wholesale”, “retail”

dataCenterProfile.specialization.targetCustomersarrayArray of target customer types/segments
dataCenterProfile.specialization.differentiatorsarrayArray of competitive differentiators
dataCenterProfile.specialization.technologyFocusarrayArray of technology focus areas

strategy object

FieldTypeDescription
strategy.corporateStrategystringOverall corporate strategy narrative
strategy.growthStrategystringGrowth and expansion strategy
strategy.powerStrategyobjectPower sourcing and sustainability strategy
strategy.geographicStrategystringGeographic expansion priorities
strategy.mAndAStrategystringMergers and acquisitions approach
strategy.publicCommitmentsarrayArray of public commitment objects

power strategy object

FieldTypeDescription
strategy.powerStrategy.approachstringOverall power sourcing approach
strategy.powerStrategy.renewableCommitmentstringRenewable energy commitments and targets
strategy.powerStrategy.nuclearPartnershipsarrayArray of nuclear partnership descriptions
strategy.powerStrategy.gridStrategystringGrid partnership and utility strategy

public commitment object

{
  "announcement": "Fiscal Year 2025 AI Data Center Investment",
  "date": "2025-01-03",
  "valueUSD": 80000000000,
  "scope": "AI-enabled data center construction globally",
  "timeline": "Through June 30, 2025"
}

financials object

FieldTypeDescription
financials.fiscalYearnumberFiscal year for reported metrics
financials.revenueUSDnumberAnnual revenue in USD
financials.ebitdaUSDnumberEBITDA in USD
financials.netIncomeUSDnumberNet income in USD
financials.totalAssetsUSDnumberTotal assets in USD
financials.totalDebtUSDnumberTotal debt in USD
financials.capitalExpenditureUSDnumberAnnual capex in USD
financials.dataCenterSpecificobjectData center-specific financial metrics
financials.growthMetricsobjectYear-over-year growth rates

partnerships object

power providers

FieldTypeDescription
partnerships.powerProvidersarrayArray of power partnership objects

power partnership object:

{
  "partner": "Constellation Energy",
  "type": "nuclear",
  "capacityMW": 835,
  "details": "20-year power purchase agreement for restart of Three Mile Island Unit 1",
  "announcementDate": "2024-09-20"
}

type enum: “nuclear”, “renewable”, “utility”, “microgrid”

construction partners

{
  "contractor": "DPR Construction",
  "relationship": "Preferred contractor for major U.S. projects",
  "projects": ["Equinix DA11 Dallas", "Equinix Ashburn North Campus"]
}

technology vendors

{
  "vendor": "NVIDIA",
  "category": "AI Hardware / GPUs",
  "details": "Primary GPU supplier for Azure AI infrastructure"
}

financial partnerships

{
  "partner": "Brookfield Asset Management",
  "type": "Renewable Energy Investment Partnership",
  "valueUSD": 10000000000,
  "details": "Joint investment in 10.5 GW renewable energy capacity"
}

key executives array

FieldTypeDescription
keyExecutives[].namestringExecutive full name
keyExecutives[].titlestringCurrent title
keyExecutives[].rolestringRole description (CEO, CFO, CTO, etc.)
keyExecutives[].startDatestringDate started in role (YYYY-MM-DD)
keyExecutives[].backgroundstringProfessional background narrative
keyExecutives[].linkedinstringLinkedIn profile URL
keyExecutives[].previousRolesarrayArray of previous position descriptions
keyExecutives[].educationstringEducational background
keyExecutives[].significancestringWhy this executive is significant

competitive analysis object

FieldTypeDescription
competitiveAnalysis.marketPositionstringOverall market position narrative
competitiveAnalysis.marketSharePercentnumberMarket share percentage
competitiveAnalysis.rankingobjectRankings by different metrics
competitiveAnalysis.strengthsarrayArray of competitive strengths
competitiveAnalysis.weaknessesarrayArray of competitive weaknesses
competitiveAnalysis.opportunitiesarrayArray of market opportunities
competitiveAnalysis.threatsarrayArray of competitive threats
competitiveAnalysis.competitiveDifferentiatorsarrayArray of key differentiators
competitiveAnalysis.directCompetitorsarrayArray of direct competitor names
competitiveAnalysis.competitiveAdvantagesarrayArray of sustainable advantages

timeline array

timeline event object:

FieldTypeDescription
timeline[].datestringEvent date (YYYY-MM-DD or YYYY-MM or YYYY)
timeline[].eventstringEvent description
timeline[].categoryenum

“founding”, “acquisition”, “expansion”, “partnership”, “financing”, “milestone”, “leadership-change”, “strategic-shift”

timeline[].significancestringWhy event is significant
timeline[].impactUSDnumberFinancial impact in USD (optional)

example:

{
  "date": "2024-10",
  "event": "$15B xScale Joint Venture with GIC and CPP Investments",
  "category": "partnership",
  "significance": "Nearly triples xScale program investment capital",
  "impactUSD": 15000000000
}

project references array

links entity to projects in main database:

{
  "projectName": "Equinix DC12 Ashburn",
  "state": "Virginia",
  "role": "operator",
  "capacityMW": 4
}

role enum: “sponsor”, “operator”, “tenant”, “investor”, “contractor”, “customer”

sources array

same structure as project schema sources:

{
  "url": "https://www.example.com/article",
  "title": "Article Title",
  "date": "2024-01-15",
  "publisher": "Publisher Name",
  "type": "sec-filing"
}

type enum: “company-website”, “sec-filing”, “press-release”, “news”, “industry-publication”, “analyst-report”, “conference-presentation”, “linkedin”, “wikipedia”

entity type specific patterns

hyperscaler dossier

emphasizes:

  • global data center footprint
  • cloud infrastructure strategy
  • ai/ml investments
  • power partnerships (especially nuclear)
  • capex commitments

operator dossier

emphasizes:

  • portfolio composition (retail vs wholesale)
  • customer mix and enterprise relationships
  • geographic footprint
  • xscale/hyperscale capabilities
  • interconnection ecosystem

financial sponsor dossier

emphasizes:

  • portfolio companies
  • total aum in data center sector
  • recent deals and valuations
  • exit strategies
  • investment thesis

validation checklist

when creating entity dossiers:

  • entityType matches directory location
  • entityName is official company name
  • overview.description provides comprehensive context
  • dataCenterProfile populated for relevant entity types
  • powerStrategy documented for major operators/hyperscalers
  • minimum 5 sources with diverse types
  • keyExecutives includes C-level leadership
  • timeline captures major milestones
  • competitiveAnalysis provides balanced swot
  • lastUpdated reflects most recent information
  • all financial figures in full USD (not abbreviated)
  • all dates in ISO 8601 format
  • projectReferences links to actual database projects

for questions or schema change proposals, reference the main data center database documentation.

on this page