|
Hi Reader, Welcome to the PYCAD newsletter, where every week you receive doses of machine learning and computer vision techniques and tools to help you learn how to build AI solutions to empower the most vulnerable members of our society, patients. |
Generating images from text is cool. But do you know what's cooler? Animating those images like the gif below!
So how can you animate images that were generated using text to image models?
Introducing AnimateDiff.
This is a new approach that can be built on top of text to image models such as stable diffusion.
Here's how it works from a high level perspective:
Given a base text2image model (e.g., Stable Diffusion), AnimateDiff method first trains a motion modeling module on video datasets to encourage it to distill motion priors.
In simpler terms, it tries to model what motion is in videos.
During this stage, only the parameters of the motion module are updated, thereby preserving the feature space of the base text2image model.
At inference, the once-trained motion module can turn any personalized model tuned upon the base text2image model into an animation generator.
Then it can produce diverse and personalized animated images via iteratively denoising process.
Below is a diagram of this process.
This model can make your still images full of life and without any finetuning of the text-to-image model.
You can read more about AnimateDiff in their original paper. You can also check the code here.
​
So lately I've been exploring different ways to make this newsletter more useful for you as a reader. In order to do that, I thought I should directly ask you!
So Reader, how can AIFEE newsletter be more useful for you?
In order to help you answer this question, you can ask yourself the following questions:
Answering any of these questions will be a tremendously appreciated feedback!
Believe me, I don't take your feedback lightly. In fact, this newsletter subscribers are the first people I share my ideas with when it comes to doing any sort of content creation.
Although I have a larger following on LinkedIn, I just love and prefer bouncing ideas with you because I believe we truly have a beautiful community here and I want this community to thrive. That can only be done if I can truly help you achieve your goals with AI and machine learning.
That's it for this week's edition, I hope you enjoyed it!
👉 Learn how to build AI systems for medical imaging domain by leveraging tools and techniques that I share with you! | 💡 The newsletter is read by people from: Nvidia, Baker Hughes, Harvard, NYU, Columbia University, University of Toronto and more!
Hello Reader, Welcome to another edition of PYCAD newsletter where we cover interesting topics in Machine Learning and Computer Vision applied to Medical Imaging. The goal of this newsletter is to help you stay up-to-date and learn important concepts in this amazing field! I've got some cool insights for you below ↓ Zoom That Works Everywhere If you can’t zoom any pane in your web DICOM viewer, you’re doing extra work for no reason. Think of it like this: when something is small, you bring it...
Hello Reader, Welcome to another edition of PYCAD newsletter where we cover interesting topics in Machine Learning and Computer Vision applied to Medical Imaging. The goal of this newsletter is to help you stay up-to-date and learn important concepts in this amazing field! I've got some cool insights for you below ↓ A Quick Look at Our Volume Measurement Tool One of the tools we’ve been working on is a simple way to estimate 3D volumes directly inside the viewer. You start by drawing a...
Hello Reader, Welcome to another edition of PYCAD newsletter where we cover interesting topics in Machine Learning and Computer Vision applied to Medical Imaging. The goal of this newsletter is to help you stay up-to-date and learn important concepts in this amazing field! I've got some cool insights for you below ↓ How We Build Our DICOM Viewers Using Plugins One thing we focus on when building DICOM viewers is keeping every feature as a separate plugin. This gives the app a clean structure...