Pytorch Encoder/Decoder

Apple’s new AI model recreates 3D objects with realistic lighting effects from a single image

Apple researchers have created an AI model that reconstructs a 3D object from a single image, while keeping light effects consistent across viewing angles.

IEEE

SED++: A Simple Encoder-Decoder for Improved Open-Vocabulary Semantic Segmentation

Abstract: Open-vocabulary semantic segmentation aims to partition an image into distinct semantic regions based on an open set of categories. Existing approaches primarily rely on image-level ...

GitHub

USR 2.0

MediaPipe is the default (CPU-based). For higher-accuracy landmark detection, install the ibug packages. This requires a CUDA GPU.

IEEE

Evaluation of Encoder-Only Transformer for Multi-Step Traffic Flow Prediction

Abstract: Traffic flow prediction is critical for Intelligent Transportation Systems to alleviate congestion and optimize traffic management. The existing basic Encoder-Decoder Transformer model for ...

GitHub

Official Implementation of Dyn-O: Building Structured World Models with Object-Centric Representations (NeurIPS 2025)

The code is validated on python 3.10.14 + CUDA 11.8 + pyTorch 2.4.0. It should work for newer python, CUDA and pyTorch versions but not guaranteed. conda create -n ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results