(LLM) Models

48 bookmarks

Custom sorting

Qwen Chat

Qwen Chat offers comprehensive functionality spanning chatbot, image and video understanding, image generation, document processing, web search integration, tool utilization, and artifacts.

·chat.qwen.ai·Mar 20, 2025

Qwen Chat

Manus

Manus is a general AI agent that turns your thoughts into actions. It excels at various tasks in work and life, getting everything done while you rest.

·manus.im·Mar 12, 2025

Manus

open-thoughts/OpenThinker-32B · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

·huggingface.co·Feb 14, 2025

open-thoughts/OpenThinker-32B · Hugging Face

DeepSeek v3 - Advanced AI & LLM Model Online

DeepSeek v3 is a powerful AI-driven LLM with 671B parameters, offering API access and research paper. Try our online demo for state-of-the-art performance.

·deepseekv3.org·Jan 28, 2025

DeepSeek v3 - Advanced AI & LLM Model Online

Introducing Llama 3.1: Our most capable models to date

Bringing open intelligence to all, our latest models expand context length, add support across eight languages, and include Meta Llama 3.1 405B— the first frontier-level open source AI model.

·ai.meta.com·Jul 23, 2024

Introducing Llama 3.1: Our most capable models to date

Collaborative Control for Geometry-Conditioned PBR Image Generation

null

·unity-research.github.io·Feb 14, 2024

Collaborative Control for Geometry-Conditioned PBR Image Generation

LGM

null

·me.kiui.moe·Feb 14, 2024

LGM

QLoRa: Fine-Tune a Large Language Model on Your GPU

Fine-tuning models with billions of parameters is now possible on consumer hardware

·towardsdatascience.com·Jun 8, 2023

QLoRa: Fine-Tune a Large Language Model on Your GPU

What are Large Language Models (LLMs)?

In this article, we will understand the concept of Large Language Models (LLMs) and their importance in natural language processing.

·analyticsvidhya.com·Aug 17, 2023

What are Large Language Models (LLMs)?

XLNet

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

·huggingface.co·Aug 17, 2023

XLNet

DistilBERT

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

·huggingface.co·Aug 17, 2023

DistilBERT

microsoft/DeBERTa: The implementation of DeBERTa

The implementation of DeBERTa. Contribute to microsoft/DeBERTa development by creating an account on GitHub.

·github.com·Aug 17, 2023

microsoft/DeBERTa: The implementation of DeBERTa

XLM-RoBERTa

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

·huggingface.co·Aug 17, 2023

XLM-RoBERTa

RoBERTa: An optimized method for pretraining self-supervised NLP systems

Facebook AI’s RoBERTa is a new training recipe that improves on BERT, Google’s self-supervised method for pretraining natural language processing systems. By training longer, on more data, and dropping BERT’s next-sentence prediction RoBERTa topped the GLUE leaderboard.

·ai.meta.com·Aug 17, 2023

RoBERTa: An optimized method for pretraining self-supervised NLP systems

How Huawei PanGu-Alpha is Revolutionizing AI and Machine Learning

How Huawei PanGu-Alpha is Revolutionizing AI and Machine Learning TS2 SPACE

·ts2.space·Aug 17, 2023

How Huawei PanGu-Alpha is Revolutionizing AI and Machine Learning

LG’s hyperscale AI EXAONE 2.0 to be launched for drug development this year - Pulse by Maeil Business News Korea

LG AI Research has unveiled EXAONE 2.0, a hyperscale artificial intelligence (AI) language model that can be used for expert applications in the development of new materials or medicines. During LG’s AI Talk Concert event

·pulsenews.co.kr·Aug 17, 2023

LG’s hyperscale AI EXAONE 2.0 to be launched for drug development this year - Pulse by Maeil Business News Korea

AI21 Studio

A powerful language model, with an API that makes you smile

·ai21.com·Aug 17, 2023

AI21 Studio

ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for...

Pre-trained models have achieved state-of-the-art results in various Natural Language Processing (NLP) tasks. Recent works such as T5 and GPT-3 have shown that scaling up pre-trained language...

·arxiv.org·Aug 17, 2023

ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for...

GPT-NeoX

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

·huggingface.co·Aug 17, 2023

GPT-NeoX

GPT-J

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

·huggingface.co·Aug 17, 2023

GPT-J

GPT-Neo — EleutherAI

A set of 3 decoder-only LLMs with 125M, 1.3B, and 2.7B parameters trained on the Pile.

·eleuther.ai·Aug 17, 2023

GPT-Neo — EleutherAI

Product

Our API platform offers our latest models and guides for safety best practices.

·openai.com·Aug 17, 2023

Product

Meta Open-Sources 175 Billion Parameter AI Language Model OPT

Meta AI Research released Open Pre-trained Transformer (OPT-175B), a 175B parameter AI language model. The model was trained on a dataset containing 180B tokens and exhibits performance comparable with GPT-3, while only requiring 1/7th GPT-3's training carbon footprint.

·infoq.com·Aug 17, 2023

Meta Open-Sources 175 Billion Parameter AI Language Model OPT

OPT

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

·huggingface.co·Aug 17, 2023

OPT

google/mt5-base · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

·huggingface.co·Aug 17, 2023

google/mt5-base · Hugging Face

google-research/multilingual-t5

Contribute to google-research/multilingual-t5 development by creating an account on GitHub.

·github.com·Aug 17, 2023

google-research/multilingual-t5

Exploring Google’s T5 Text-To-Text Transformer Model | T5_transformer – Weights & Biases

In this article, we'll explore the architecture and mechanisms behind Google’s T5 Transformer model, from the unified text-to-text framework to the comparison of T5 results.

·wandb.ai·Aug 17, 2023

Exploring Google’s T5 Text-To-Text Transformer Model | T5_transformer – Weights & Biases