AnalysisAI Models
8 days ago
Wavelet as Tokenizer: Shared token schema for audio, images, video
The paper introduces a preliminary continuous-token model using a one-level Haar DWT for audio, images, and video. It aims to replace separate modality-specific latent grids with a shared wavelet token schema. Results are preliminary.
·
8 days ago