●builderYou can apply this distillation pipeline to build deployable small models for finance NLP without requiring large labeled datasets.
●researcherClustering-based seed selection outperforms random sampling for synthetic data generation in low-resource domain-specific NLP tasks.