This study introduces an optical neural network (ONN)-based autoencoder for efficient image processing, utilizing specialized optical matrix-vector multipliers for both encoding and decoding tasks. To ...
Generating complex scene layouts faces challenges such as cross-modal semantic alignment bias and low efficiency in modeling dynamic spatiotemporal relationships. Existing methods have limitations on ...