Kozo-KI

Posted on May 18

Building a Technical Literature Dashboard with Power Automate Power Apps LDX hub StructFlow

#powerautomate #powerapps #ai #structflow

Verification Case Report #006 — 2026.05.18 | Time spent: ~3 days

Overview

Metric	Value
Documents processed	18
Power Apps screens	4
Power Automate flows	2
StructFlow extracted fields	8

We built TechLit Viewer — an end-to-end technical literature management system that uses LDX hub StructFlow to extract structured data from documents, Power Automate to keep everything updated, and both Power Apps and a standalone HTML dashboard for visualization.

Background

Many customers express a need to continuously collect and evaluate technical literature across diverse fields. The old approach — storing PDFs and patent documents in folders — made it nearly impossible to answer questions like "which technologies are at which maturity level?" or "which documents are most relevant to our business?"

We set out to verify whether a practical technical literature management system could be built within a no-code/low-code stack by combining LDX hub StructFlow for AI extraction with Microsoft 365 tools for storage, automation, and display.

What we're testing: Can StructFlow turn unstructured technical documents (PDFs, patents, reports) into structured data that integrates cleanly with SharePoint, Power Automate, and Power Apps?

System Architecture: 5 Layers

// TechLit Viewer — System Overview

[1] INPUT         Source files (PDF / patents / reports)
                  └─ Stored in SharePoint document library. 18 documents total.

[2] AI EXTRACTION LDX hub StructFlow
                  └─ Auto-extracts 8 fields:
                     Title / DocType / Authors / Year / FieldMajor /
                     TRL / RelevanceScore / Summary

[3] DATA LAYER    SharePoint Lists (TechLit_Master / TechLit_Metrics)
                  └─ Power Automate writes; Power Apps + HTML dashboard reads.

[4] AUTOMATION    Power Automate (2 flows)
                  └─ ① Auto-extract on item update  (TechLit_Pipeline_UPDATE)
                     ② Manual bulk processing       (TechLit_BulkUpdate)

[5] DISPLAY       Power Apps (TechLit_Viewer) + HTML dashboard
                  └─ 4-screen Power Apps + standalone HTML dashboard

Flow Design: Two Flows for Two Purposes

① TechLit_Pipeline_UPDATE (always-on)

Fires automatically when a SharePoint list item is updated. Detects new document registrations or changes to existing records, sends the file to StructFlow, and immediately writes the extracted results back. Day-to-day document additions are fully automated via this flow.

② TechLit_BulkUpdate (manual trigger)

A manual-trigger flow for bulk processing all 18 documents. Used after schema changes, prompt revisions, or for initial data loading. Iterates through all items with a foreach loop, waits for StructFlow polling to complete, and writes results back to the list.

Key design insight: The update trigger excels at immediacy but isn't suited for full reprocessing. The manual trigger handles bulk operations well but requires operator action. Using both flows together covers both daily operations and administrative tasks.

StructFlow Extraction Schema: 8 Fields

Schema design started from the question: "what do we want to decide?" — not "what can we extract?"

Field	Description	Type
Title	Document title	string
DocType	Document type (patent / paper / report / other)	string
Authors	Authors / applicants	string
Year	Publication / filing year	integer
FieldMajor	Primary technology domain (Materials Science, Energy Engineering, etc.)	string
TRL	Technology Readiness Level (1–9)	integer
RelevanceScore	Relevance to our business (high / medium / low)	string
Summary	2–3 sentence technical summary	string

About TRL (Technology Readiness Level): Originally developed by NASA and adopted by EU Horizon programs. Levels 1–3 = basic research, 4–6 = demonstration, 7–9 = operational/production. Quantifying maturity makes it easy to distinguish research-stage from production-ready technologies at a glance.

Power Apps: 4 Screens for Different Use Cases

Screen	Features
Screen 1: Search	Free-text search by title, author, or domain. Filtering by DocType, TRL, RelevanceScore. Real-time queries against SharePoint list.
Screen 2: Detail View	Full field display for individual documents. Review StructFlow-extracted Summary. Visual TRL and relevance indicators.
Screen 3: Metrics Comparison	TRL distribution by technology domain. Document count by relevance. Year-over-year trend charts.
Screen 4: Tech Dashboard	Advanced visualization via embedded HTML component. Dynamic charts with Chart.js. Integrated full-text search over all 18 documents.

HTML Dashboard: Standalone Strategic View

Alongside Power Apps, we built a standalone HTML dashboard (techlit_dashboard.html) that runs entirely in the browser — no Power Apps dependency. This makes it easy to share with executives or external stakeholders.

// StructFlow extraction results stored directly as a data array
const data = [
  {
    id: 17,
    title: 'Immobilized Photocatalyst Paper',
    docType: 'patent',
    authors: '[Author name masked]',
    year: 2002,
    fieldMajor: 'Chemistry',
    trl: 4,
    relevance: 'high',
    url: ''  // SourceFileUrl — will become a live link once populated
  },
  // ... 18 documents total
];

The dashboard includes 4 charts (technology domain distribution, TRL distribution, year-over-year trend, document type breakdown) plus a searchable full-document table. Passing StructFlow's structured data to Chart.js generates all analysis charts automatically.

Results

StructFlow Extraction Accuracy

15 of 18 documents (83%) had all fields extracted correctly. The remaining 3 showed mixed-language inconsistencies in FieldMajor (English vs. Japanese) — no impact on usability, but a clear prompt improvement opportunity.

Power Automate Automation

Average processing time per document: ~67 seconds (including StructFlow polling). Full run of all 18 documents: ~20 minutes. Both scheduled and trigger-based execution confirmed stable.

Technology Domain Visibility

Environmental Science was the largest category (6 documents), followed by Materials Science (4). TRL distribution: basic research (1–3) was highest at 8 documents; operational stage (7–9) had 3. For the first time, we had a quantitative picture of the entire portfolio.

Relevance Scoring

15 of 18 documents scored RelevanceScore = High. By embedding our business context (translation & localization) into the StructFlow system prompt, we achieved automatic screening aligned with our actual evaluation criteria.

Biggest insight: from "searching" to "discovering"
Technical documents that could previously only be searched by filename can now be filtered across TRL, technology domain, and relevance simultaneously. Being able to instantly narrow down to "TRL ≥ 4 AND Materials Science AND High relevance" fundamentally changes the speed of technology strategy decisions.

Lessons Learned

1. Lock down SharePoint column types before you start

Column types (hyperlink vs. single line of text) can't be changed after the fact. Confirm the format Power Automate will write, then finalize column design before building.

2. Design the StructFlow schema from your decision criteria

Don't start from "what can we extract?" — start from "what do we need to decide?" Including evaluation-axis fields like TRL and RelevanceScore from the beginning dramatically increases dashboard value.

3. Separate Power Apps and HTML dashboard by purpose

Power Apps suits day-to-day search and update operations; the HTML dashboard suits sharing and presentations. Using two UIs on the same data source expands both the audience and the use cases.

4. Control FieldMajor normalization via the prompt

Mixed English/Japanese values like "Materials Science" and "材料科学" split aggregations. Adding an explicit instruction to the StructFlow system prompt ("always output technology domains in English") prevents this drift.

Next Steps

Item	Details
Populate SourceFileUrl	Add source URLs to the SharePoint list so users can reach original documents in one click from the dashboard
Schedule automation	Move from manual triggers to fully automated monthly runs with auto-distribution to leadership
Normalize FieldMajor	Add English-only instruction to StructFlow prompt and reprocess all 18 documents
Scale to 100+ documents	Add ExtractDoc as a pre-processing step to handle scanned PDFs
RefineLoop integration	Auto-translate and improve summaries of foreign-language documents (English, Chinese) into Japanese

Kawamura International is a translation and localization company documenting its AI process experiments in public. StructFlow, RefineLoop, RenderOCR — and whatever comes next.

DEV Community