Abstract: Vision-Language Models (VLMs) have enabled a variety of real-world applications. The large parameter size of VLMs brings large memory and computation overhead which poses significant ...
Once stolen from actor Nicolas Cage, it eclipsed the record price set in November when a copy of "Superman No. 1" sold for $9 ...
Abstract: Enabling robots to perform everyday tasks has become increasingly important. Task planning, which decomposes task instructions into executable action sequences, is crucial for equipping ...