How to transcribe medical prescriptions with AILately I was chatting with a business man who has done amazing things in the medical imaging field. While conversing, he told me that he was working with his technical team on a recurrent problem that doctors have. This recurrent problem is the transcription of medical notes and medical prescription. He told me that they tried document AI APIs from Azure and other big cloud providers but the results are not up to their requirements. Specifically, he said that non textual parts of the prescriptions are hard to transcribe. For example, if you have YES or NO questions in the doctor's notes, then the document API failed to "understand" them. Another example is, if you have checkboxes in the prescriptions then the document API failed to categorize them into checked or unchecked. Fast forward a few weeks later, he asked me if I heard about Qwen2-VL models. At that time, I had heard about them and saw some people test them on different vision or language tasks, but I hadn't done any testing from my side yet. He asked if these models could address the shortcomings that they noticed when using document AI APIs from the big cloud providers. Immediately, I asked him to provide me with some samples where the cloud APIs failed so that I can test them with Qwen2-VL models. Upon testing, I was blown away by the results! These models are truly powerful in understanding documents. They can understand machine written text, human handwritten text and all sorts of symbols and structures like checkboxes, YES/NO questions, ...etc. Below you can see a dummy prescription image. I asked the model to transcribe it and I asked it to add an "X" next to the choices that were checked. What the model gave me: Truly impressive! I can hear you say, well, this can be done by OpenAI models, so what's the big fuss about? Well, Qwen2-VL has open sourced some of its models! This is an extremely important point, especially in healthcare applications. So basically, this model can be downloaded and integrated into a specific workflow that's later deployed directly in a doctor's clinic. Btw, this is a business opportunity if you can execute on it! Secret Way to Learn from the Best People in Medical ImagingHere's a secret way to learn from the best people doing medical imaging stuff.
That's it for this week's edition, I hope you enjoyed it! |
π Learn how to build AI systems for medical imaging domain by leveraging tools and techniques that I share with you! | π‘ The newsletter is read by people from: Nvidia, Baker Hughes, Harvard, NYU, Columbia University, University of Toronto and more!
Hello Reader, Welcome to another edition of PYCAD newsletter where we cover interesting topics in Machine Learning and Computer Vision applied to Medical Imaging. The goal of this newsletter is to help you stay up-to-date and learn important concepts in this amazing field! I've got some cool insights for you below β What We Learned This Year (Medical Imaging Edition) As this year wraps up, I wanted to share a few quick lessons from the projects we worked on, especially around building web...
Hello Reader, Welcome to another edition of PYCAD newsletter where we cover interesting topics in Machine Learning and Computer Vision applied to Medical Imaging. The goal of this newsletter is to help you stay up-to-date and learn important concepts in this amazing field! I've got some cool insights for you below β Zoom That Works Everywhere If you canβt zoom any pane in your web DICOM viewer, youβre doing extra work for no reason. Think of it like this: when something is small, you bring it...
Hello Reader, Welcome to another edition of PYCAD newsletter where we cover interesting topics in Machine Learning and Computer Vision applied to Medical Imaging. The goal of this newsletter is to help you stay up-to-date and learn important concepts in this amazing field! I've got some cool insights for you below β A Quick Look at Our Volume Measurement Tool One of the tools weβve been working on is a simple way to estimate 3D volumes directly inside the viewer. You start by drawing a...