Shap-E 3D 生成

文摘 Science/technology 2023-05-18 16:00 新加坡

上上周 OpenAI 开源了 Shap-E 3D 生成。论文地址是：https://arxiv.org/pdf/2305.02463.pdf

今天用 Colab 试用一下：

安装

!git clone https://github.com/openai/shap-e.git
%cd shap-e
!pip install -e .

Text-3D

我们先生成个小汽车看看效果

import torch
from shap_e.diffusion.sample import sample_latents
from shap_e.diffusion.gaussian_diffusion import diffusion_from_config
from shap_e.models.download import load_model, load_config
from shap_e.util.notebooks import create_pan_cameras, decode_latent_images, gif_widget

device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
xm = load_model('transmitter', device=device)
model = load_model('text300M', device=device)
diffusion = diffusion_from_config(load_config('diffusion'))

batch_size = 4
guidance_scale = 15.0
prompt = "a car"

latents = sample_latents(
    batch_size=batch_size,
    model=model,
    diffusion=diffusion,
    guidance_scale=guidance_scale,
    model_kwargs=dict(texts=[prompt] * batch_size),
    progress=True,
    clip_denoised=True,
    use_fp16=True,
    use_karras=True,
    karras_steps=64,
    sigma_min=1e-3,
    sigma_max=160,
    s_churn=0,
)

render_mode = 'nerf'  # you can change this to 'stf'
size = 64  # this is the size of the renders; higher values take longer to render.

cameras = create_pan_cameras(size, device)
for i, latent in enumerate(latents):
    images = decode_latent_images(xm, latent, cameras, rendering_mode=render_mode)
    display(gif_widget(images))

当然，还可以生成其他东东，可以参考官方给的各个 samples。

https://github.com/openai/shap-e/blob/main/samples.md

生产出来的 3D 是可以下载保存为 ply 格式，运行这个脚本：

from shap_e.util.notebooks import decode_latent_mesh

for i, latent in enumerate(latents):
with open(f'example_mesh_{i}.ply', 'wb') as f:
        decode_latent_mesh(xm, latent).tri_mesh().write_ply(f)

之后，ply 文件就存到了shap-e 文件夹下，右键点击下载即可保存到本地。

Image-3D

如果你有个图片，可以让 Shap-E 帮你做成 3D 模型。

把图片上传到 shap-e 文件夹下

运行脚本：

import torch

from shap_e.diffusion.sample import sample_latents
from shap_e.diffusion.gaussian_diffusion import diffusion_from_config
from shap_e.models.download import load_model, load_config
from shap_e.util.notebooks import create_pan_cameras, decode_latent_images, gif_widget
from shap_e.util.image_util import load_image

device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

xm = load_model('transmitter', device=device)
model = load_model('image300M', device=device)
diffusion = diffusion_from_config(load_config('diffusion'))

batch_size = 4
guidance_scale = 3.0

image = load_image("car.png")

latents = sample_latents(
    batch_size=batch_size,
    model=model,
    diffusion=diffusion,
    guidance_scale=guidance_scale,
    model_kwargs=dict(images=[image] * batch_size),
    progress=True,
    clip_denoised=True,
    use_fp16=True,
    use_karras=True,
    karras_steps=64,
    sigma_min=1e-3,
    sigma_max=160,
    s_churn=0,
)

render_mode = 'nerf' # you can change this to 'stf' for mesh rendering
size = 64 # this is the size of the renders; higher values take longer to render.

cameras = create_pan_cameras(size, device)
for i, latent in enumerate(latents):
    images = decode_latent_images(xm, latent, cameras, rendering_mode=render_mode)
    display(gif_widget(images))

于是，基于这个图片的 3D 模型就出来了

虽然丑到哭，但是相信开源社区的智慧会慢慢迭代这个 AI 工具的。

感觉活都要被 AI 干了，那我们人类干什么呢？👇

http://mp.weixin.qq.com/s?__biz=MzkwOTMzMzk0MQ==&mid=2247485234&idx=1&sn=51697f3cc9fe23f68e2fe0c66e3a98a9

Renee 创业随笔

絮絮叨叨

最新文章

【Google 的最新 Paper】生命有可能是由智能生物创造的？！

IMAGDressing

SMooDi - AI 生成逼真且风格化的人物动作

阿里的EchoMimic - 生成肖像视频

阿里的语义识别模型SenseVoice和语音生成模型CosyVoice

Google的Still-Moving：通过少量的静态参考图像生成个性化的视频内容

Google 内部工具 Smart Paste - 通过自动调整粘贴的代码来简化代码编写工作流程

Google 的Magic Insert 通过拖入到目标图片实现风格感知且逼真的插入效果

Google DeepMind 的Video-to-audio research - 为视频配音

Dify - LLM 应用开发平台

Scenario 游戏素材 GAI 试用

threestudio 3D 模型生成试用

Google Search Labs 试用

创业中的爬山（Hill Climbing）算法

Chat.ALL 使用笔记

使用 SadTalker 生成数字人视频

The Meta-Prompts: Guiding GPT to Generate its own Prompts

训练自己的声音：SoftVC VITS Singing Voice Conversion Fork

【AIGC 学习】Bark Text-To-Speech(2) 生产长音频

Shap-E 3D 生成

分类

时事

民生

政务

教育

文化

科技

财富

体娱

健康

情感

旅行

百科

职场

楼市

企业

乐活

学术

汽车

时尚

创业

美食

幽默

美体

文摘

原创标签

时事社会财经军事教育体育科技汽车科学房产搞笑综艺明星音乐动漫游戏时尚健康旅游美食生活摄影宠物职场育儿情感小说曲艺文化历史三农文学娱乐电影视频图片新闻宗教电视剧纪录片广告创意壁纸头像心灵鸡汤星座命理教育培训艺术文化金融财经健康医疗美妆时尚餐饮美食母婴育儿社会新闻工业农业时事政治星座占卜幽默笑话独立短篇连载作品文化历史科技互联网

发布位置

广东北京山东江苏河南浙江山西福建河北上海四川陕西湖南安徽湖北内蒙古江西云南广西甘肃辽宁黑龙江贵州新疆重庆吉林天津海南青海宁夏西藏香港澳门台湾美国加拿大澳大利亚日本新加坡英国西班牙新西兰韩国泰国法国德国意大利缅甸菲律宾马来西亚越南荷兰柬埔寨俄罗斯巴西智利卢森堡芬兰瑞典比利时瑞士土耳其斐济挪威朝鲜尼日利亚阿根廷匈牙利爱尔兰印度老挝葡萄牙乌克兰印度尼西亚哈萨克斯坦塔吉克斯坦希腊南非蒙古奥地利肯尼亚加纳丹麦津巴布韦埃及坦桑尼亚捷克阿联酋安哥拉