Seeking arXiv cs.CL endorsement, local LLM clinical NLP benchmark (Ollama, 5 models)

Hey, I am an independent researcher looking for a cs.CL endorsement for my first arXiv paper.

What I did: Ran 5 open-weight models locally via Ollama (Q4_K_M) on an L40S — Phi-3.5-mini, Mistral-7B, BioMistral-7B, Llama-3.1-8B, and Llama-3.3-70B, across 4 different FHIR serialisation strategies for medication reconciliation. 4,000 inference runs, 200 synthetic patients, exact-match F1 evaluation. Note: how you format the input data matters as much as which model you pick.

If you’re an active arXiv cs.CL author and willing to endorse, please DM me, happy to share the draft and endorsement code.

Thanks.