mermaid`graph LR
subgraph Data-Collection
direction TB
a["START"] -->|Collect UVM PDFs|b[Raw PDFs]
end
subgraph Get boxes
direction TB
b -->|OCR + bounding boxes\nusing\nmask_rcnn_X_101_32x8d_FPN_3x|c(Boxed OCR)
c -->|Annotate Boxes|d(More\nannotated\nBoxes)
d -->|Get better boxes\nusing\nfast_rcnn_R_50_FPN_3x|c
d -->|Train\nusing\nlayoutlmv3-base|e(Predict\nbox categories)
end
subgraph Get course entries NER
direction TB
b -->|OCR|f(OCRed data)
f -->|Annotate text|g(Annotated\ncourse\nentries)
g -->|fine-tune\nLLAMA2 7B|h(Fine-tuned llama2)
end
subgraph Bayesian-Model
direction TB
e -->i(Clean\ncourse data)
h -->i
end
`