텍스트나 이미지로부터 동영상을 생성하는 AI 「Stable Video Diffusion」을 Stability AI가 공개

이미지 생성 AI 「Stable Diffusion」을 개발하는 Stability AI가 텍스트나 화상으로부터 고해상도의 동영상을 생성할 수 있는 잠재 동영상 확산 모델 「Stable Video Diffusion 」을 공개했습니다.

Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets — Stability AI

Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets — Stability AI

We present Stable Video Diffusion — a latent video diffusion model for high-resolution, state-of-the-art text-to-video and image-to-video generation.

stability.ai

Stable Video Diffusion은 연구 미리 보기로 게시되며 소스 코드는 GitHub 리포지토리에 공개됩니다.

GitHub - Stability-AI/generative-models: Generative Models by Stability AI
https://github.com/Stability-AI/generative-models

GitHub - Stability-AI/generative-models: Generative Models by Stability AI

Generative Models by Stability AI. Contribute to Stability-AI/generative-models development by creating an account on GitHub.

github.com

또한 로컬에서 모델을 실행하는 데 필요한 가중치는 HuggingFace에서 확인할 수 있습니다.

stabilityai/stable-video-diffusion-img2vid-xt · Hugging Face
https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt

stabilityai/stable-video-diffusion-img2vid-xt · Hugging Face

Stable Video Diffusion Image-to-Video Model Card Stable Video Diffusion (SVD) Image-to-Video is a diffusion model that takes in a still image as a conditioning frame, and generates a video from it. Model Details Model Description (SVD) Image-to-Video is a

huggingface.co

Stable Video Diffusion은 14 프레임과 25 프레임을 생성할 수 있는 2가지 이미지 to Video 모델로 출시되었으며 3fps ~ 30fps로 사용자 정의 가능한 프레임 속도로 동영상을 생성할 수 있습니다.

"Ice dragon in the mountains(산 속의 아이스 드래곤)"를 입력하면 그대로의 애니메이션이 생성됩니다.

"Astronaut walking on the moon(달을 걷는 우주 비행사)"

"Two blue jays on the top of building(건물 꼭대기에 머무는 2마리의 어치 )"

Stability AI는 runway Research의 GEN-2와 pika.art의 PikaLabs의 사용자에 의한 영상 품질 평가(세로축)를 비교한 결과입니다. 14 프레임으로 생성한 Stable Video Diffusion(보라색)의 비교는 이렇습니다.

25 프레임 생성할 수 있는 Stable Video Diffusion XT(보라색)의 경우가 이하.

저작자표시 비영리 변경금지 (새창열림)

'AI · 인공지능 > 이미지 생성 AI' 카테고리의 다른 글

고해상도 이미지를 0.5초 만에 생성하는 오픈 소스 AI 이미지 생성 모델 「PixArt-δ」가 등장 (91)	2024.01.30
간단한 텍스트로부터 사실적인 동영상을 생성하는 확산 모델 「W.A.L.T」가 등장 (55)	2023.12.13
Meta가 이미지 생성 AI「Imagine」을 무료로 사용할 수 있는 웹 앱을 출시 (74)	2023.12.08
AI가 1장의 사진으로부터 고해상도 3DCG 모델을 생성해주는「Human-SGD」 (1)	2023.11.27
일러스트나 사진 작품에 독을 심어 AI 학습을 방해하는 학습방지 툴 「Nightshade」 (0)	2023.10.25
방의 사진과 프롬프트로 원하는 가구를 생성해주는 「Fill 3D」가 등장 (0)	2023.10.04
여러 장의 실패 샷에서 베스트 샷을 생성할 수 있는 AI 「RealFill」 (0)	2023.10.04
[2023년판] Stable Diffusion WebUI로 AI 미소녀 만드는 법「ChilloutMix」 (1)	2023.09.12

두우우부

텍스트나 이미지로부터 동영상을 생성하는 AI 「Stable Video Diffusion」을 Stability AI가 공개

'AI · 인공지능 > 이미지 생성 AI' 카테고리의 다른 글

티스토리툴바

텍스트나 이미지로부터 동영상을 생성하는 AI 「Stable Video Diffusion」을 Stability AI가 공개

'AI · 인공지능 > 이미지 생성 AI' 카테고리의 다른 글

관련글

티스토리툴바