top of page
Writer's picturemansour ansari

The AI Revolution: A New Focus and a World of Possibilities


As an explorer on the digital frontier, a novice, I might add, but enthusiastically learning, my journey has taken me through many fascinating landscapes. From my deep dive into Quantum Computing to my recent pivot into the fast-paced world of AI, it's been an exhilarating ride. Don't worry; I'm still passionately engaged with Quantum Computing – the news, learning tools, and exciting developments continue to be a significant part of my work. But the lightening speed at which AI is advancing has captured my curiosity, and I feel the need to keep pace.


Today, I find myself enchanted by the creative capabilities of AI, with tools like Midjourney and others acting as my paintbrush on a canvas that stretches as far as the imagination can reach. It's like finding my creativity on a power boost, supercharged and rearing to create marvels that, until now, lived only in the realms of dreams.


Over the next few posts, I will share my journey of discovering the whispers that can sway the AI, the secret techniques that help create, innovate, and fascinate. I'm thrilled to explore the boundaries of what's possible today and even more excited about the infinite potential of tomorrow. Imagine what we might achieve when Quantum Computing takes the driver's seat in powering image generation currently dependent on high-speed GPU farms. The possibilities are mind-boggling.


In this post, we dive into the exciting process of building consistent images and giving them life. So sit back, and let's embark on this incredible AI adventure together.


It's time to swap faces!

In a rapidly evolving digital era, the boundaries between reality and virtuality are fading. In the limelight today is an astonishing fusion of artificial intelligence, creativity, and cinema, which could potentially transform the way we create and consume content. Here, I share my recent experiment that brings together Midjourney AI, a face-swapping plugin, and D-ID, an AI for creating video from still images.


Midjourney AI is a phenomenal tool that lets you craft intricate, high-definition images. For this experiment, I wanted to recreate my image in a new setting, specifically as the character from the iconic Harrison Ford's "Lost Ark" cinematic poster. With Midjourney AI and a face-swapping plugin, I was able to superimpose my face onto the poster with stunning precision.

But the magic didn't stop there. I wanted to bring this image to life. With the help of D-ID, an AI capable of turning still images into moving pictures, I transformed this still image into a video. I supplied some sample text, and D-ID made 'me' speak those words, all within the confines of Harrison Ford's face.


The next exciting step in this process is training an AI to emulate my voice. The goal? To replace the AI-generated voice in the video with my own, bringing another layer of personalization and authenticity to the content. Although this might seem like a significant undertaking, it's a powerful illustration of how we can harness AI for innovative and cost-effective content creation.


This kind of technology allows for amazing possibilities, particularly in the realms of marketing and education, where high-quality, engaging content is key. The scope for creativity is truly infinite, and this process is just the beginning.


A Snowy Day Adventure


Next project: A Snowy Day Adventure


Here is the children's rhyme


The street corner stood quiet, under the blanket of night,

With a lone snowman watching, beneath the soft starlight.

And high above the town, where the little girl sleeps,

A bird's eye view of the neighborhood, as the silent night creeps.


A day full of magic, full of winter's cheer,

A memory she would hold, and forever endear.

With each falling snowflake, in the heart of the night,

Dreams danced in her head, until the morning light.

Once upon a time, in a town covered in white,

A little girl woke up to a magical sight.

Big eyes full of wonder, hair tied in a tail,

Peered out of her window at the fresh snowy trail.


Through the pane of her window, she saw children at play,

Building snowmen and laughing on this fine winter's day.

Pulling on her boots, and buttoning her coat,

She thought, "a day in the snow, now that gets my vote!"


Down in the neighborhood streets, full of winter's delight,

Children played, and snowballs took flight.

She rolled a ball small, and one that was grand,

Two perfect snowmen, standing tall in the land.


As the day waned and the chill began to bite,

She thought of something warm, and hot chocolate felt right.

At the local coffee shop, sipping her treat,

She watched the snow fall, oh what a sweet feat!


Returning to her snowman, as the daylight began to dim,

She felt a small tug, a whimsical whim.

She bid her snow friends goodnight, under the setting sun,

And decided it was time to go home, the day's fun was done.


In her cozy living room, with mom and dad,

They laughed and they talked about the fun they had.

Homework by the fireplace, a task not so dire,

Beside the warmth and love, of the family fire.


Night came with her jammies, and a paper so white,

She tried to draw her snowman, under the soft moonlight.

Her eyes grew heavy, as she began to doze,

Dreaming of her snowman, in a peaceful repose.


These are some of the images I created to build the story.

So, to bring the children's rhyme to life, I harnessed the power of ChatGPT4. With its help, I was able to generate a consistent narrative that felt both engaging and authentic. With the story in hand, I turned to Midjourney, creating a sequence of images that reflected each twist and turn of our tale.

The next step was editing. Here, Sumo Paint, an affordable alternative to PhotoShop, came to the rescue. This allowed me to handle the layering, clean-up, regeneration, and scaling with precision and efficiency.

The assembled images were then combined with voiceover and background music, a task deftly handled by Pictory AI and the Philmora Video Editing system.

What's truly remarkable is the hardware needed to pull this off - a Desktop Windows 10 Pro with 32 Gig RAM. There was no need for a powerful GPU to render these images; the cloud-based process took care of it all. While there is a cost for the cloud services and access to the GPU power, the collaboration of these tools makes it all worthwhile.



Next project: A Snowy Day Adventure





8 views

Comments


bottom of page