Technical Paper

Scalability assumptions

1.8 Scalability assumptions

This section expresses capacity through workload variables. The architecture evaluates user count, receipt density, line-item count, retry rate, and batch policy together.

SymbolMeaning
UDaily active users
rAverage receipts per user per day
aShare of receipts submitted for processing
LAverage line items per receipt
vShare of verified receipts eligible for rewards
eAverage event records per receipt
ρ_ocrOCR retry rate
ρ_llmLLM retry or self-consistency rate
BOn-chain settlement batch size

Daily processed receipt volume:

text
R_d = U × r × a

Approximate monthly hot-data growth:

text
Rows_m ≈ 30 × R_d × (1 + L + e)

Daily model-call volume:

text
OCR_d ≈ R_d × (1 + ρ_ocr)
LLM_d ≈ R_d × (1 + ρ_llm)

Daily reward and settlement volume:

text
Verified_d ≈ R_d × v
Onchain_batches_d ≈ ceil(Verified_d / B)

Public variable-cost form:

text
Cost_d ≈ OCR_d × c_ocr + LLM_d × c_llm + Storage_d × c_storage + Settlement_d × c_chain

Capacity decisions are made by measuring U, r, L, retry rates, and batch policy together. Numeric thresholds and provider-specific cost coefficients remain in operational planning.