Stars
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
llama3 implementation one matrix multiplication at a time
SEED-Story: Multimodal Long Story Generation with Large Language Model
StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型,无需针对图片微调,即能生成高质量的个性风格化图片!
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Reformats source table to target table by applying LLM-based transformations.
A programming framework for agentic AI 🤖
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Universal and Transferable Attacks on Aligned Language Models
🧠 A curated list of awesome ChatGPT resources, including libraries, SDKs, APIs, and more. 🌟 Please consider supporting this project by giving it a star.
The platform for building AI from enterprise data
Official documentation for DemoGPT, a powerful framework for developing language model-powered applications. Explore comprehensive guides and references to unlock the potential of language models w…
A curated list of awesome resources, tools, and other shiny things for GPT prompt engineering.
WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)
Create 🦜️🔗 LangChain apps by just using prompts🌟 Star to support our work! | 只需使用句子即可创建 LangChain 应用程序。 给���star支持我们的工作吧!
Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggest…
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
CodeTF: One-stop Transformer Library for State-of-the-art Code LLM
ChatGPT directly uses internal documents for Question Answering.