Data Catalog

Everything on Officium comes from public government records. This is the complete list of datasets currently available in the app.

Free users see recent data only in the app UI. Create a free account to unlock the full historical versions of these files, including older trades, patterns, and records on former members.

Transparency: every file below is also available as bulk JSON at /data/*.json with no login — UI gates are for navigation convenience, not access control.

Built for accountability research — not just trade copying. Every dataset is public record and directly crawlable.

Now fully integrated: verified Senate LDA deep samples (revolving-door pipeline), official House & Senate PFD indices (wealth, assets, positions), smoking-gun precomputed signatures/scores, PAC leaderboard context, and historical members flags — all wired into dossiers, officials roster, watchlists and risk views. 28MB patterns + smoking + PFD cross signals form the core accountability layer.

patterns.json
Automated flags when a trade date is close to a bill’s introduction on Congress.gov. Severity is a computer score — not a legal judgment. Always verify with official filings.
26,465 timing flags records
28 MB
Automated
congress-trades.json
Official STOCK Act periodic transaction reports only (QuiverQuant House + Senate PTR ingest). Pattern-inferred rows are not included — use patterns.json for timing flags.
4,980 rows • 81 members records
~1.5 MB
Partial sample
fara-representations.json
Foreign Agents Registration Act filings (foreign principals & agents)
17k+ records
8.3 MB
Official
lda-perbioguide.json
Revolving-door lobbying — real LD-2 filings ingested from the Senate LDA public API where a former member of Congress is a named lobbyist (e.g. Coats→PhRMA, Royce→credit unions, Begich→Royal Caribbean, Bridenstine→space, Katko→SecurityScorecard). Precision-filtered: filings must fall outside the member's years in office and identity is confirmed by covered-position or filing volume. Replaces the prior synthetic per-member base.
23 former-member lobbyists • 432 real LD-2 registrations records
222 KB
Official
lda-historical-perbioguide.json
Senate LDA historical byBioguide merged from lda-perbioguide (23) + senateLdaDeepByBioguide (13 verified deep). Synthetic base removed 2026-05.
34 members • 393 regs records
~45 KB
Official
house-lda-perbioguide.json
House LDA: under the Lobbying Disclosure Act, filings are submitted to both chambers, so House lobbying is fully covered by the unified real Senate-LDA-API dataset (lda-perbioguide). This separate file is retained empty to avoid duplicate/synthetic data.
Superseded by unified LDA records
~1 KB
Partial samplenot wired
senate-lda-deep-api-samples.json
Senate LDA Deep API pulls (real public lda.senate.gov REST API): LD-203 lobbyist contributions (dates, amounts, payees) + LD-2 quarterly filings (clients, issues, income), each verified by filing UUID. Powers the revolving-door deep signals.
16 real samples (LD-203 + LD-2) records
~8 KB
Partial sample
house-pfds-index.json
House Personal Financial Disclosures (official PFDs) — assets, liabilities, gifts, positions, multi-year wealth trajectory. Covers 97% of sitting House members (75% with full asset detail). Asset data is PDF-locked; the residual 'error' filings are predominantly non-asset types (90-day extensions, candidate $5k threshold notices, withdrawals, PTRs).
1,027 members • 97% of current House records
3.2 MB
Official
senate-pfds-index.json
Senate Personal Financial Disclosures (official PFDs) — assets, liabilities, gifts, positions. Now indexed byBioguide (92% of filings resolved to a member), consumed directly by the signals hook.
3,766 filings • 132 senators (byBioguide) records
3.9 MB
Official
senate-giftrule-travel.json
Senate Gift Rule Disclosures — reimbursed travel (staff + Member) as new conflict signals (byBioguide + byOffice)
1,880 (122 Senate offices) records
134 KB
Official
committee-assignments.json
House/Senate/Joint Committee + Subcommittee assignments (current Congress) with Bioguide IDs, ranks, roles. Official Clerk + Senate LIS XMLs (via canonical processing). Foundational for committee overlap patterns (sector_capture, positionConflict enrichment).
528 members • 3,879 assignments records
1.0 MB
Official
ethics-cases.json
House & Senate ethics cases with byBioguide index (fuzzy name match from byMember keys + memberName).
324 cases • 261 byBioguide records
252 KB
Official
fec-ie-perbioguide.json
FEC Schedule E independent expenditures (committees spending for/against candidates) — real, from FEC bulk data across the 2024 and 2026 cycles: support/oppose splits, top spending committees, and per-cycle breakdown. Captures cross-office runs (e.g. a House member's Senate campaign) via the FEC candidate master.
499 members • $1.73B IE (2024 + 2026) records
413 KB
Official
fec-itemized-perbioguide.json
FEC candidate financial summaries — real, from official FEC bulk data (All Candidates / weball, 2026 cycle): receipts, disbursements, individual + PAC/committee contributions per member. Now enriched with real itemized individual contributions (FEC bulk indiv → 4.1M donations across 541 members): top donors, top employers, and donor-state breakdowns. Members without 2026 filings keep their real 2024 summary; PAC/committee donor detail lives in fec-detailed.
765 members • 541 with itemized individual donors (4.1M contributions, $557M) records
2.2 MB
Official
fec-detailed-perbioguide.json
FEC committee-to-candidate contributions — real, ingested from official FEC bulk data (Contributions to Candidates / itpas2 joined to the committee master, 2026 cycle). Per-member PAC/committee donors with exact dates, amounts, and committee IDs/names. Replaces the prior synthetic sample set entirely. Independent expenditures (24A/24E) are tracked separately in fec-ie.
538 members • 112,167 real contributions ($250.6M, 2026 cycle) records
2.9 MB
Official
govtrack-members.json
Current member roster enriched with authoritative profiles (unitedstates/congress-legislators), plus recent PTR filers patched from historical-members when missing from the base export.
542 current + PTR filers (full profiles) records
856 KB
Official
patterns-member-index.json
Compact severity counts and top pattern types per bioguide — used on home, officials, compare, and watchlist instead of loading the 28 MB patterns.json.
Per-member pattern counts records
421 KB
Automated
patterns-public.json
Freemium slice of patterns.json (evidence on/after 2023-01-01). Loaded on /patterns for logged-out users.
8,007 post-2023 flags records
~10 MB
Automated
patterns-recent-high.json
Pre-filtered HIGH patterns for the /today pulse page (avoids loading full patterns.json).
HIGH severity findings records
~3 MB
Automated
patterns-by-member/
One file per bioguideId — dossier loads only the active member (~KB–MB each) instead of the 28 MB monolith.
2,232 JSON shards records
~30 MB
Automated
historical-members.json
Former member records enriched with authoritative profiles (unitedstates/congress-legislators): term/service history, FEC candidate IDs, ID crosswalk, and bio — for revolving-door and historical research.
2,692 former officials (full profiles) records
3.2 MB
Official
govtrack-votes.json
Real roll-call metadata from GovTrack (119th Congress): question, result, margin, related bill. Browsable at /votes.
1,342 roll-call votes (119th Congress) records
854 KB
Official
govtrack-vote-positions.json
Per-member Yea/Nay (and related) on the 100 most recent roll calls ingested via GovTrack vote_voter (exact roll-call timestamp filter). Shown on /votes and dossiers — not joined to P1 conflict patterns (those use bill introduction dates).
100 roll calls • 536 members records
~9.7 MB
Partial sample
member-travel.json
Privately sponsored travel with byBioguide index (conservative name match). Dossier prefers index over fuzzy scan.
8,233 trips • 567 byBioguide records
2.3 MB
Official
pac-leaderboard.json
Top PACs by contributions to members
PACs records
46 KB
Official
smoking-gun-report.json
Precomputed high-signal research overlays: 9 signatures (insider_trader, pfd_opacity, revolving_door, sector_capture, family_business, money_pipeline, captured_chair, foreign_influence, apex_target) + smokingGunScore + detector hits (top 100 detailed)
581 members • 9 signatures records
626 KB
Automated
data-gaps-audit.json
Transparency & completeness audit across all sources
Coverage matrix records
1.8 MB
Automated
departed-officials.json
Former members (118th→119th transitions) + lifetime FEC receipts/disbursements — wired into dossier + officials roster (logged-in historical view)
Recently departed records
~1.4 MB
Official
Transparency Coverage Snapshot (from data-gaps-audit.json)
Active traders (congress-trades.json)
81
3.0% of historical roster
Members in conflict patterns
82.9%
Members with FEC IDs
85.2%
Donor profile coverage
24.1%
Travel (name-matched)
%
Ethics (name-matched)
%
Roll-call positions (sample)
%
Current / committee roster
%
not full vote history
These gaps explain why some dossiers are richer than others. The raw audit JSON contains the full per-member matrix.
Want the complete historical archive? The original production project offers bulk downloads and an OpenAPI surface. This Next.js port focuses on the highest-signal current data for fast exploration.Learn more →
For Researchers, Journalists & Machines
This site is intentionally crawlable and data-friendly.
  • Dynamic sitemap includes every member dossier URL (/dossier?bid=...) for complete discoverability.
  • • All raw JSON datasets are statically served from /data/*.json — directly fetchable by scrapers, scripts, or LLMs.
  • • Structured data (JSON-LD Person) on every dossier page for rich search engine and agent understanding.
  • • Full methodology transparency via the live data-gaps-audit.json and this catalog.
  • • Personal research tools like Watchlists with per-member conflict insights and one-click exports.
  • • No paywall on current public data. Free accounts unlock everything historical for deeper longitudinal research.
  • Precomputed high-risk overlays now fully wired: smoking-gun-report.json delivers 9 signatures (insider_trader, pfd_opacity, revolving_door, sector_capture, family_business, money_pipeline, captured_chair, foreign_influence, apex_target) + smokingGunScore. Available in Dossier (Research Flags + risk integration), Officials roster (sortable column + export), and Watchlist.
  • Official vs. automated: Stock trades are official PTR filings only. Conflict Patterns are computer timing flags. Dossier badges are research hints — always verify at FEC.gov, Congress.gov, and disclosure offices.
  • Coverage: Many members lack trades or ethics rows in our files. See data-gaps-audit.json for an honest per-member matrix.
NOTICE: All data is derived from official U.S. public government records (including Personal Financial Disclosure reports under the Ethics in Government Act). Use is subject to the restrictions of 5 U.S.C. app. § 105 and related statutes. Officium surfaces this information for research, journalistic, educational, and public accountability purposes only. Commercial solicitation, credit screening, or other prohibited uses may violate applicable law.
Bulk downloads and client-side exports (CSV/JSON) carry the same restriction notice in metadata/comments.
Quick Bulk Downloads
All files are public domain / CC0 where applicable. Use freely for research, scraping, or analysis. Subject to 5 U.S.C. app. § 105 restrictions (see notice above).