How to transcribe medical prescriptions with AILately I was chatting with a business man who has done amazing things in the medical imaging field. While conversing, he told me that he was working with his technical team on a recurrent problem that doctors have. This recurrent problem is the transcription of medical notes and medical prescription. He told me that they tried document AI APIs from Azure and other big cloud providers but the results are not up to their requirements. Specifically, he said that non textual parts of the prescriptions are hard to transcribe. For example, if you have YES or NO questions in the doctor's notes, then the document API failed to "understand" them. Another example is, if you have checkboxes in the prescriptions then the document API failed to categorize them into checked or unchecked. Fast forward a few weeks later, he asked me if I heard about Qwen2-VL models. At that time, I had heard about them and saw some people test them on different vision or language tasks, but I hadn't done any testing from my side yet. He asked if these models could address the shortcomings that they noticed when using document AI APIs from the big cloud providers. Immediately, I asked him to provide me with some samples where the cloud APIs failed so that I can test them with Qwen2-VL models. Upon testing, I was blown away by the results! These models are truly powerful in understanding documents. They can understand machine written text, human handwritten text and all sorts of symbols and structures like checkboxes, YES/NO questions, ...etc. Below you can see a dummy prescription image. I asked the model to transcribe it and I asked it to add an "X" next to the choices that were checked. What the model gave me: Truly impressive! I can hear you say, well, this can be done by OpenAI models, so what's the big fuss about? Well, Qwen2-VL has open sourced some of its models! This is an extremely important point, especially in healthcare applications. So basically, this model can be downloaded and integrated into a specific workflow that's later deployed directly in a doctor's clinic. Btw, this is a business opportunity if you can execute on it! β Secret Way to Learn from the Best People in Medical ImagingHere's a secret way to learn from the best people doing medical imaging stuff.
That's it for this week's edition, I hope you enjoyed it! |
π Learn how to build AI systems for medical imaging domain by leveraging tools and techniques that I share with you! | π‘ The newsletter is read by people from: Nvidia, Baker Hughes, Harvard, NYU, Columbia University, University of Toronto and more!
Hello Reader, Welcome to another edition of PYCAD newsletter where we cover interesting topics in Machine Learning and Computer Vision applied to Medical Imaging. The goal of this newsletter is to help you stay up-to-date and learn important concepts in this amazing field! I've got some cool insights for you below β DICOM Viewer with Volume Rendering What you're seeing below is a DICOM viewer with 4 views: axial, coronal, sagittal and volume rendering. All integrated into one web app!We've...
Hello Reader, I haven't sent the newsletter in a while! Sorry about that! I have been extremely busy with our agency PYCAD. We've been working on several projects for our clients and it's taking most of my time. Since this is the first newsletter I send in 2025, I thought what's better than sharing the new trends in AI for MedTech! So below, you'll see some of the trends and interesting news that are coming from the field! Top Healthcare Tech Trends for 2025 When it comes to healthcare...
Hi Reader, Another week and another PYCAD edition! This week I would like to share with you 2 things: An important insight that every medical professional should know about AI. My experience renting online servers to train my machine learning models, and how you can benefit from it. Without further ado, let's jump right into it! Medical Professionals Do NOT know This About AI When I see medical professionals online posting about how they use AI in their work, itβs usually one of 2 categories:...