Video Inbetweening using 3D Convolutions | TF Hub

最新推荐文章于 2022-08-01 00:10:56 发布

XianxinMao

最新推荐文章于 2022-08-01 00:10:56 发布

阅读量269

点赞数

分类专栏： Neural Network 神经网络与深度学习 Tensorflow 文章标签： 3d tensorflow 人工智能深度学习 python

本文链接：https://blog.csdn.net/xianxinmao/article/details/120942906

版权

Tensorflow 同时被 3 个专栏收录

75 篇文章

订阅专栏

Neural Network

61 篇文章

订阅专栏

神经网络与深度学习

29 篇文章

订阅专栏

video_inbetweening_conv3d

From Here to There: Video Inbetweening Using Direct 3D Convolutions, 2019

Current Hub characteristics:

has models for BAIR Robot pushing videos and KTH action video dataset (though this colab uses only BAIR)
BAIR dataset already available in Hub. However, KTH videos need to be supplied by the users themselves
only evaluation (video generation) for now
batch size and frame size are hard-coded

import tensorflow as tf

import matplotlib.pyplot as plt
import numpy as np
import seaborn as sns
import tensorflow_hub as hub
import tensorflow_datasets as tfds

from tensorflow_datasets.core import SplitGenerator
from tensorflow_datasets.video.bair_robot_pushing import BairRobotPushingSmall

import tempfile
import pathlib
from tensorflow_docs.vis import embed
import imageio


def to_gif(images):
    converted_images = np.clip(images * 255, 0, 255).astype(np.uint8)
    imageio.mimsave('./animation.gif', converted_images, fps=1)
    return embed.embed_file('./animation.gif')


TEST_DIR = "/home/csdn/Downloads/Tensorflow_tutorial/TF_Hub/CV/bair_robot_pushing_small/softmotion30_44k/test/"

# Since the dataset builder expects the train and test split to be downloaded,
# patch it so it only expects the test data to be available
builder = BairRobotPushingSmall()
test_generator = SplitGenerator(name='test', gen_kwargs={"filedir": str(TEST_DIR)})
builder._split_generators = lambda _: [test_generator]
builder.download_and_prepare()

# @title Load some example data (BAIR).
batch_size = 16

# If unable to download the dataset automatically due to "not enough disk space", please download manually to Google Drive and
# load using tf.data.TFRecordDataset.
ds = builder.as_dataset(split="test")
test_videos = ds.batch(batch_size)
first_batch = next(iter(test_videos))
input_frames = first_batch['image_aux1'][:, ::15]
input_frames = tf.cast(input_frames, tf.float32)

所以 [:, ::15]，取的索引分别是 0 和 15，因为 30 超过了 0 - 29 的索引范围，同时下方的两个图也进行了验证

# @title Visualize loaded videos start and end frames.

print('Test videos shape [batch_size, start/end frame, height, width, num_channels]: ', input_frames.shape)
sns.set_style('white')
plt.figure(figsize=(4, 2 * batch_size))

for i in range(batch_size)[:4]:
    plt.subplot(batch_size, 2, 1 + 2 * i)
    plt.imshow(input_frames[i, 0] / 255.0)
    plt.title('Video {}: First frame'.format(i))
    plt.axis('off')
    plt.subplot(batch_size, 2, 2 + 2 * i)
    plt.imshow(input_frames[i, 1] / 255.0)
    plt.title('Video {}: Last frame'.format(i))
    plt.axis('off')
plt.show()

hub_handle = 'https://tfhub.dev/google/tweening_conv3d_bair/1'
module = hub.load(hub_handle).signatures['default']

filled_frames = module(input_frames)['default'] / 255.0

# Show sequences of generated video frames.

# Concatenate start/end frames and the generated filled frames for the new videos.
generated_videos = np.concatenate([input_frames[:, :1] / 255.0, filled_frames, input_frames[:, 1:] / 255.0], axis=1)
to_gif(generated_videos[1])

for video_id in range(4):
    fig = plt.figure(figsize=(10 * 2, 2))
    for frame_id in range(1, 16):
        ax = fig.add_axes([frame_id * 1 / 16., 0, (frame_id + 1) * 1 / 16., 1],
                          xmargin=0, ymargin=0)
        ax.imshow(generated_videos[video_id, frame_id])
        ax.axis('off')
    fig_name = 'generated_videos_' + str(video_id) + '.png'
    plt.savefig(fig_name)

把填充生成的 video frames 保存成 gif