VITA-1.5 By AiBard123 January 7, 2025 - 2 min read VITA-1.5是一款强大的开源交互式多模态大语言模型,支持实时视觉与语音交互。 read more
YouTube Summary Extension By AiBard123 January 6, 2025 - 2 min read YouTube Summary Extension是一款Chrome插件,利用AI生成YouTube视频的简洁总结,支持多种AI提供商。 read more
open-pi-zero By AiBard123 January 6, 2025 - 2 min read open-pi-zero是基于Physical Intelligence的pi0模型,采用MoE架构和预训练的3B PaliGemma VLM实现。 read more
Open Canvas By AiBard123 January 6, 2025 - 2 min read Open Canvas是一个开源Web应用,旨在通过与智能体协作,提升文档写作效率,支持记忆和自定义操作。 read more
flexrag By AiBard123 January 6, 2025 - 2 min read FlexRAG是一个灵活高效的框架,专为多模态检索增强生成任务设计,支持简单配置与高性能应用。 read more
LatentSync By AiBard123 January 6, 2025 - 2 min read LatentSync是一种基于音频的潜在扩散模型的唇同步框架,能提升时间一致性与同步精度。 read more