HACKOBAR_item
[arXiv]score: 0.24

Information Extraction from Electricity Invoices with General-Purpose Large Language Models

April 30, 2026
A new arxiv study benchmarks Gemini 1.5 Pro and Mistral-small on structured information extraction from Spanish electricity invoices using the IDSEM dataset, testing 19 hyperparameter configurations across 6 prompting strategies with zero fine-tuning. The headline finding: prompt engineering dominates over hyperparameter tuning, with few-shot strategies outperforming zero-shot baselines by over 19 F1 percentage points while parameter sweeps yield marginal gains. Enterprise ML teams automating document processing pipelines should prioritize prompt design investment over model tuning cycles. This reinforces emerging evidence that in-context learning rivals fine-tuned extractors for domain-specific semi-structured documents.
cs.CL