Question 1

How long does an in-house BSA typically take to reach 85% accuracy?

Accepted Answer

Production-quality bank statement analysis at 85% accuracy across the realistic Indian statement mix (private + PSU + co-operative + AA payload) typically takes 18 to 24 months from first commit to first underwriting-grade output. The first 9 months get teams to a working extractor on clean private-bank native PDFs at around 70 to 78%. The next 6 to 9 months are absorbed by degraded scans, password-protected statements, multi-account aggregation, and the long tail of PSU and regional bank formats that do not follow modern layout conventions. The final accuracy lift above 85% is where teams stall — without dedicated R&D and a labelled corpus, in-house BSAs commonly plateau between 78% and 82%.

Question 2

What does a 4-person BSA team actually cost annually fully loaded?

Accepted Answer

A 4-FTE BSA team at typical Indian engineering compensation — one OCR/document-AI lead, two ML/parser engineers, one operations engineer for the labelled-corpus pipeline — runs roughly ₹1.4 to ₹1.6 crore per year fully loaded at ₹35 lakh per engineer. Fully loaded means base + variable + benefits + equipment + workspace + manager allocation. This does not include the labelled corpus cost (analyst time + statement procurement), GPU infrastructure for any deep models, and the parallel platform team needed for API, security, and audit trail. Add 30 to 40% on top once those are included realistically.

Question 3

Why are co-operative banks more expensive to support than private banks?

Accepted Answer

Three reasons. First, layout fragmentation — private banks converged on a handful of statement layouts over the last decade, while co-operative banks publish in dozens of regional templates, often with mixed Devanagari or regional-language headers. Second, image quality — co-operative bank statements are frequently scanned or printed from dot-matrix systems, requiring image cleanup and degraded-scan OCR pipelines that are unnecessary for native private-bank PDFs. Third, change cadence — co-operative banks change layouts more frequently and without notice, so parsers drift silently. The combined effect is roughly 3x the engineering effort per bank covered, and the maintenance tax is permanent, not one-time.

Question 4

Does it make sense to build for proprietary signal IP?

Accepted Answer

Almost never for the extraction layer; sometimes for the scoring layer on top. The extraction layer — turning a PDF or AA payload into normalised, categorised, validated transactions — is a solved engineering problem where the marginal accuracy gain from building is negative against a credible vendor. The scoring layer — turning normalised transactions into your specific lender's risk signals, policy rules, and decisioning logic — is where proprietary IP belongs. Most successful lenders license the analyzer for extraction and build their own scoring on top of the normalised output. Building the analyzer to protect scoring IP confuses the layer that matters.

Question 5

What is the typical break-even volume for buy-vs-build?

Accepted Answer

There is no clean volume break-even because the build cost is dominated by fixed coverage and maintenance, not throughput. A team building for 5,000 statements per month spends roughly the same as a team building for 50,000 — the bank coverage matrix and format-drift maintenance are the same shape. The real break-even is on coverage and accuracy ceiling: if the lender can tolerate sub-80% accuracy on a narrow private-bank-only mix, building can be defensible. The moment the portfolio extends into PSU lending, MSME, or co-operative-banked borrowers, the build path's coverage tax makes the buy path materially cheaper at any volume.

Bank Statement Analyzer: Build vs Buy Cost Calculator

How this works

Describe your statement mix

Describe the team you would assemble

Read the three-year TCO

Your build profile

How to read the output

The realistic build timeline

Related

TransactIQ

How TransactIQ is built

Bank coverage

BSA Accuracy Benchmark

Frequently Asked Questions

Skip the 18-month build. Underwrite this month.