DevOps R&D performance

Media Matrix

Open source China APP

Sign in register

VisualGLM-6B -Multimodal dialogue language model

Collection six

error correction

Authorization Agreement Apache

development language Python View source code »

operating system Cross platform

Software type Open source software

Classification Neural network/artificial intelligence 、 LLM (Large Language Model)

Open source organizations nothing

region domestic

deliverer game

intended for unknown

Recording time 2023-05-19

Software Home Page

Software documentation

Official download

overview
information
Blog
security information

Software Introduction

VisualGLM-6B is an open source that supports Image, Chinese and English Based on ChatGLM-6B, with 6.2 billion parameters; The image part builds a bridge between the visual model and the language model by training BLIP2-Qformer. The overall model has a total of 7.8 billion parameters.

VisualGLM-6B relies on 30M high-quality Chinese image text pairs from the CogView dataset to conduct pre training with 300M screened English image text pairs, with the same weight in both Chinese and English. This training method better aligns visual information to the semantic space of ChatGLM; In the later fine-tuning stage, the model is trained on the long visual question and answer data to generate answers that meet human preferences.

Expand to read the full text

code

Gitee index of is

exceed Items for

comment

Click to lead the topic 📣 Post and join the discussion 🔥

Published information

2023/06/21 14:28

Ant Group confirmed that it is developing language and multi-modal large model, naming "Zhenyi"

According to the exclusive news of the Science and Technology Innovation Board Daily, Ant Group's technology research and development team is developing a language and multi-modal large model - named "Zhenyi" internally. The project has been highly valued by Ant Group's management and has been launched for several months. Multimodal large model refers to a model that combines multimodal information such as text, image, video, audio and so on for training. Previously, Ilya Sutskever, the co-founder of OpenAI, said, "The long-term goal of AI is to build a multimodal neural network, that is, AI can learn the concepts between different modes, so as to better understand the world. ”It's worth one ..

two

four

No more

Loading failed, please refresh the page

Click to load more

Loading

next page

{{o.author.name}}

Blogged

{{o.pubDate | formatDate}}

{{formatAllHtml(o.title)}}

{{parseInt(o.replyCount) | bigNumberTransform}}

{{parseInt(o.viewCount) | bigNumberTransform}}

No more

No content temporarily

{{o.author.name}}

Issued a question and answer

{{o.pubDate | formatDate}}

{{formatAllHtml(o.title)}}

{{parseInt(o.replyCount) | bigNumberTransform}}

{{parseInt(o.viewCount) | bigNumberTransform}}

No more

No content temporarily

No content temporarily

Hot software

Change

SylixOS -Embedded hard real-time operating system

RT-Linux -Embedded real-time operating system

css-checker -Automatically check similar CSS and display Diff

QIJI Font -Flush font

CentOS -RHEL derived Linux distribution

LXGW WenKai -Open source Chinese font

webOS -Intelligent TV Operating System Based on Linux Kernel

KnightOS -Calculator operating system

Siyuan Song typeface

Mongoose OS -Internet of Things Firmware Development Framework

bttn.css -Beautiful CSS button style

FreeRTOS -Embedded system

BabyOS -Develop accelerated code framework for MCU project

RT-Thread -Embedded real-time operating system

mbed OS -IoT operating system

RetroBSD -Embedded BSD system

Source Code Pro -Open source font

Cascadia Code -Equal width font matching Windows Terminal

Hechun -Chinese typesetting style enhancement

MeeGo -Linux based operating system

DroneCode -Open source UAV aviation operating system

TinyOS -Wireless Sensor Network Operating System

Nucleus OS

RTEMS -Real time multiprocessor system

RIOT-OS -Real time multithreading IoT operating system

Pixel Art to CSS -Pixel painting application based on React

KLite -Simple and easy to use embedded RTOS

JetBrains Mono -Open source programming font for developers

Siyuan boldface

SONiC OS -Switch operating system

Authors who have not been certified for this software

OSCHINA

Log in to view more high-quality content

Use WeChat to log in quickly

Top