WizardLM LLaMA based fine-tuning big language model

Collection three

WizardLM is participating 2021 OSC China Open Source Project Selection , please vote for it!

WizardLM in 2021 OSC China Open Source Project Selection {{projectVoteCount} has been obtained in, please vote for it!

2021 OSC China Open Source Project Selection It is in hot progress. Come and vote for your favorite open source project!

2021 OSC China Open Source Project Selection>>> Midfield Review

WizardLM won the 2021 OSC China Open Source Project Selection "The Best Popularity Project" !

Authorization Agreement unknown

development language Python View source code »

operating system Cross platform

Software type Open source software

Classification Neural network/artificial intelligence 、 LLM (Large Language Model)

Open source organizations nothing

region Unknown

deliverer game

intended for unknown

Recording time 2023-05-09

Software Home Page

Software documentation

Official download

Fast download

overview
information
Blog
Q&A
security information

Software Introduction

WizardLM is a fine tuned 7B LLaMA model. It uses a large number of instructions with different difficulties to follow the dialogue to fine tune. The novelty of this model is that it uses LLM to automatically generate training data.

The WizardLM model uses a new method called Evol Instruct (a new method that uses LLM generation of human beings to independently batch generate open instructions of various difficulty levels and technical ranges to improve the LLM ability) to train through 70k computer generated instructions. This method generates instructions with different difficulty levels.

Evol Inspect uses the following five operations to expand prompts:

Add Constraint
deepen
Concretization
Add reasoning steps
Complex input

These operations are sequentially applied to the initial instruction to make it more complex, and the reply is generated by LLM.

Expand to read the full text

code

Gitee index of is

exceed Items for

comment

Click to lead the topic 📣 Post and join the discussion 🔥

No content temporarily

Blogged

No more

No content temporarily

Issued a question and answer

No more

No content temporarily

Awesome software

Change

HeidiSQL -Database management client

Fooltrader -Quantitative analysis trading system

flink - Apache Flink

L3AF -Lightweight eBPF project

age -File encryption tool

MathOCR -Mathematical formula recognition system

Datav.js -Visual JavaScript Library

Spark IM -IM client

Miniflux 2 -Minimal news reader

enGrid -CFD application grid generation software

CentOS -RHEL derived Linux distribution

exa -Ls command alternatives

ply -Linux Dynamic Tracker

LLaVA -Large multi-modal model for end-to-end training

Neutralinojs -Lightweight desktop application development framework

Gummi -Easy LaTeX editing tool

Foundation construction -Xiuxian Game

OpenRA -"Command and Conquer: Red Alert" game engine

Invest Alchemy -Investment trading system

BBDown -Command Line Bili Bili Downloader

MOSS -Large language model of dialogue

UPX -Super compression tool

More yarn black body

Enlightenment -Bilingual Multimodal Large Language Model

Worker file manager -File manager under UN * X system

GPT-2 -Large language model based on transformer

MiLM-6B -Xiaomi AI large model

Tianji -Website analyzer+status monitor+service status reporting

Stanford CoreNLP -Natural language analysis tool written in Java

AliOS Things -Lightweight IoT embedded operating system

Hot content

More highlights

Top 10 distinctive highlights of the new ORM mybatis mp

The first anniversary of open source, the new version of Qinghua was released

MooTool 1.6.1 is released, and developers always have small tools

Jboot v4.1.6 release, based on JFinal microservice framework

Blazork8s released a new version, updated AI execution command logic, and added multiple AI functions

Kangaroo database tool v5.0 has been released

Open Source Daily | Out of the box ChatTTS installation package; Scaling Law is an empirical formula; Two baby dads AI revive old toys; Academicians of the Chinese Academy of Engineering talk about AI; The story of autonomous kernel MCU is hard to tell? TikTok "American Special Edition" recommended algorithm

SQLE 3.2405.0 Release

Maximizing the Effective Throughput of LLM Serving

State fine-tuning, PointRWKV, Chinese documents online... The latest news of RWKV community in May is coming!

Java AI has a bright future

Databases that do not need data

This PHP application server looks a bit trendy! FrankenPHP

The Code R&D management platform - the innovative realization of the integration of daily reports and working hours

The easy-to-use ORM framework mybatis-mp 1.5.3 was released

GRPC 1.64.1 release, cross language RPC framework

Malware is rampant in pirated Microsoft Office

Bun's May update: performance improvement and memory optimization

Fedora Traditional Artistic Energy - Anaconda, the default web UI installer, skipped the ticket again

PDM -- Modern Python Package Manager

Java virtual machine runtime data area

The use of script engine in Java in games

Simulate login process

Thread safe container class

Protostuff serialization analysis

Analysis of Paxos algorithm

Eclipse submits the project to github

RDD programming

Three ways for Eclipse to connect Hadoop analysis

Eclipse translation plug-in

ZAB protocol and Paxos algorithm

Master Slave of Message Middleware Activemq

Android sdk manager updates download slowly or cannot be downloaded

Netty Custom Protocol

Maven Assembly Build Release Package

Use exceptions only for exceptions

Android problem summary

If other types are more suitable, try to avoid using strings

Redis Transaction Processing

Spark local mode operation

What is 2PC/3PC

Linux enable root user

Some understanding of coroutine

Basic type takes precedence over packing basic type

Asynchronous programming RxJava - Introduction

Introduction and use of EasyProtocol

Netty's IP filtering

Eclipse remote connection Hbase

Netty Implements Shadowsocks Client

Create gradle project in Eclipse

Eclipse plug-in protobuf dt

Server side architecture of social games

Probability algorithm for sending props in the game

About Game Cache

Implementation of RPC function based on JMS

Installation and use of Redis

Netty emulates the Redis server

Java stream io and block io

Maven package contains source code

An integration test based on ELK5.1 (Elastic Search, Logstash, Kibana)

The for each loop takes precedence over the traditional for loop

Selection under parallel programming

Performance Comparison of Java Compression Algorithms

Thrift Agent Hbase

An experience and analysis of Dubbo

HttpClient executes Https request and analysis

Hessian experience and analysis

Installation and use of M2E and Extras

Redis Storage Object

Java source code, reverse code and complement code

From ACID to CAP/BASE

Redis script implements distributed locks

If you need accurate answers, avoid using float and double

Java Instrumentation for hot replacement

Excel2db excel converted to binary file

Java Random Analysis

Hexo carries a personal blog on github

Jee's problems after using maven

Eclipse plug-in development instance

Redis window environment

Broker Cluster of Message Middleware Activemq

Redis cluster

Getting Started with Bash

Java I/O analysis (before jdk1.4)

BlockingQueue for Java Concurrent Package Learning

Reactor and Proactor modes

Java proxy - Javassist

Java character set and encoding

Debugging Android Programs with Real Machine in Eclipse

Kafka Quick Start

Introduction to Java distributed applications

Github manages Eclipse distributed projects

Performance comparison and analysis of Disruptor and LinkedBlockingQueue

MySQL procedures and functions (records)

Infobright uses

Android+eclipse+maven environment construction

Comparison and analysis of lvs, haproxy and nginx load balancing

The problem that ToolProvider. getSystemJavaCompiler() is empty

Netty Encoding and Decoding Based on Protobuf Protocol

Spring integrates Hessian and analysis

Popular comments of the whole site

Monkeys think of apes 2024-05-31 18:31

You can cheat your brother. Just don't cheat yourself

Ma Nong Little Fatty Brother 2024-05-16 14:40

I give you six seconds. I give you six moves with the same effect in the martial arts contest, which shows the invincibility and confidence of the master

monkey_cici 2024-05-09 00:25

My I9 CPU, 64GB memory module and 3080Ti computer are inferior to the top configuration of 19999 on a tablet

MrChen89 2024-04-29 09:18

There are a group of people like this. I don't know what they have experienced. When it comes to HW, I can't say anything good, even if it's neutral

-SORA- 2024-04-30 17:07

When this happened in a foreign country, the comment area suddenly became very objective and rational**

Yokesily 2024-06-02 15:11

So designed

Small and beautiful software development 2024-06-01 05:06

Cheat one's job

One code Yma 2024-05-09 09:58

Recently, I often go to interviews. People who hate Ali background most regard me as a fool, even though I am a fool

Voice of God 2024-06-01 20:47

By default, injection ($) and splicing are turned off. If you want to use it, you need to sign the birth and death form and press the fingerprint.

Code craftsman 2024-06-01 11:22

I also said "user controllable parameters"

Brother Xiao Yang 2024-06-01 20:39

Isn't Ali developed? What are you afraid of? There's no need for every family to set up a set

Xiao Xu Middle aged 2024-06-01 06:49

thank

jalena 2024-05-31 23:57

I can imagine that I will also receive the CVE repair request next week..... I don't use the key!!!!!!!!!

hanf 2024-05-31 17:45

If only the design and architecture are similar, what's the point? Good things must be learned, and you can't prove that the design is not the same. As for the source code, you also said that neither Oracle nor Damon is open source, and you can't prove it. There are many people who question Dream, but so far, no one has come up with strong evidence. You should at least provide evidence to copy

Xiao Xu Middle aged 2024-06-01 07:03

good

Francesca 2024-05-19 18:00

Wine runs the Android emulator of Windows. Chrome OS is installed in the Android emulator. Linux environment is installed in chrome OS. Linux environment is installed in the Linux environment. Wine is installed in the Android emulator

osc_566335 2024-04-28 14:44

This is also called floor washing? Does it mean that Tesla will not wash the floor if it releases all the source code? Some people HWptds? That is to say, the language is ambiguous, which will also rise to the washing ground? Are some people too focused? Think the people he pays attention to must be staring at?

All the way north GP 2024-04-25 14:55

America, the future of mankind

Monkeys think of apes 2024-05-31 18:31

You can cheat your brother. Just don't cheat yourself

haol666 2024-05-31 18:56

This story is powerful, I take it seriously, until I see the end.

Bright 2024-05-19 23:25

What a fool! I killed myself. How can people deal with me later.

Bright Stars 2 2024-05-31 23:28

Remove Unsafe? You don't want netty anymore?

Hakuna 2024-05-31 18:28

It is compatible with Oracle, but does not know "just" or "just". Those who can be compatible with Oracle and do well are real men and real warriors. You should know that compatibility means that even bugs must be compatible, and you have no other code that can not be copied. It's all based on real skills and understanding of oracle.

zhy 2024-05-16 13:16

At the end of Shannon is Nong

oldpig 2024-04-28 09:59

”Huawei contributed all the source code "?, the title is completely inconsistent with the content.

Ning Jinnong 2024-06-01 21:04

Correct it. The example of loading the library is wrong. It should be # library=@ loading the dynamic library, "./yards to the treasurer. dll"

zzeric 2024-04-28 20:01

Although France is the parent community, the core developers of OCCT on github are all Russians. Without Russians, the French parent community cannot continue to operate. So Huawei took over, moved to China, changed its name and resumed open source and community operations. What's the problem?

One code Yma 2024-05-06 09:14

My technical article was moved by CSDN. Why didn't anyone step on the sewing machine? This kind of report is a joke to me. The monsters with background are fine, and the monsters without background fight to death

Yoona520 2024-05-17 16:34

Zhou Hongyi is now living more and more like a clown. If he stays behind the scenes, he has to become an online celebrity. Can you learn from Lei Jun?

Li Yinghui 2024-05-09 16:40

Buddhism has a good word, evil opinion. In dealing with the world, it is meaningless to draw conclusions from preset positions; It is also important to receive good logic training.

Apizza 2024-06-01 17:52

You can switch from lodash to radash in 2024!!!

The seven in one little King Kong 2024-06-02 15:54

Those people only use resources, others are not developed by NPM...

looly 2024-06-02 14:32

@Qingmiao Hutool has also been mentioned some loopholes that I think are relatively "low-level", or I think are not loopholes. At first, I was also very angry, but after thinking it through, I found that CVE's idea was that once you did not actively remind users that there was a pit, the user fell into the pit is your fault, that is, your vulnerability. For example, as a traffic policeman, you should remind everyone who crosses the road to pay attention to safety, and ask him to answer whether he knows. Once you don't remind someone and are hit by a car, you can't get away from it. Similarly, when using frameworks and tools, you should provide at least one parameter to remind users that there may be SQL injection vulnerabilities. Note that it is not in the comments, but in the method parameters, which is the user's responsibility. Therefore, it is not comprehensive to provide solutions in comments or documents.

Chief taxi captain 2024-05-17 11:17

I suggest that 360 open source all its products, and then become the leading enterprise in the domestic open source industry through open source, leading everyone to compete with foreign enterprises

kangaroo 2024-06-01 22:23

The next version focuses on improving existing functions * improving internal power and qi * and continues to move towards the goal of Grand Master.

infoworld 2024-05-11 15:12

Universities should use open source free software instead of commercial ones. In this way, hands and feet will not be tied technically.

kakai 2024-05-10 10:21

The world only knows that Android was created by Google. Several people know that Android is only a product acquired by Google. Similarly, what is the problem with Huawei's contribution to the collection of OGG open source work and integration into its own proprietary product line?

osc_92224065 2024-04-29 10:57

Long term oppressed outsourcing of state-owned enterprises

xiaoqibabby 2024-05-15 17:36

The bank is strongly required to be responsible for

sunday12345 2024-05-15 18:31

What does the bank do? It's blamed on the remote desktop. Persimmons really pick up soft pinches~?

Love to eat raw pears 2024-06-01 19:18

Don't expect programmers to have a deep understanding of the document. I still think that since the tool hides the details of $#, some necessary security checks are necessary. Many people do not use MybatisPlus directly, but use various so-called rapid development platforms. The MyBatisPlus rapid development platform Snowy, Guns, etc., has an impression that many versions have the problem of using Wrapper directly to splice the Request parameter. I remember that JeecgBoot was opened a lot of CVEs last year or the year before last because of the Wrapper splicing problem. Do you know the author of ibeetl? Many CVE blaming holes have been opened before. The problem is similar. The lack of basic knowledge "script editing permission" is actively handed over to the front end. What a low-level error or even low-energy behavior. However, I accepted it with an open mind and added a white list check.

Qin Liming 2024-05-11 09:12

be devoid of any sense of shame

young crops 2024-06-01 16:21

There is no tipping point. There are also many official documents stating that SQL fragments involving direct string splicing need to be controlled by the user, and specific solutions are also provided. If you say that the value part is injected, then we are also 100% free of any dispute. This obvious SQL fragment is unrealistic for ORM to explain without your control, Since SQL allows splicing fragments, there must be some scenarios that cannot be forced into non SQL strings. It is also very simple. Have you ever thought about why not force them???

Single structure 2024-05-11 10:09

Selected as Open Source China's disgrace pillar

Yeah, for 2024-05-17 13:42

That's too right. Old Zhou can't control Google, but he can control 360. Do not do to others what you do not want. All 360 products should be opened first.

Shen Lang Panda 2024-06-01 08:16

You can directly ask questions in the project work order. The comment area is not suitable for answering such questions

Xiao Xu Middle aged 2024-05-31 19:13

Very good

osc_25732934 2024-06-01 19:30

It seems that the current version of the Foreign Function&Memory API is not as fast as that of jni, or even worse. In addition, before vallhala comes out, all interactions between java and c have to get an additional memory. Even if it comes out, it may not be possible to directly throw a copy of binary data into memory as a structure. When the two apis are completely stable, the day lily is cold

Happy LeapFrog 2024-05-18 09:18

But the question is: "What's the use of this for ordinary Android users?" Now the answer seems to be: "Almost nothing.".

Shuimu Yi'an 2024-05-20 09:58

The news should be read continuously. I'm waiting for the third news besides rustdesk and teamviewer. Localized remote desktop software is far ahead.

GDWhisperer 2024-05-15 17:23

I transferred tens of thousands of yuan to my own account, which was under risk control. How did I do this? The bank should be responsible for this**

Starry Night Destiny 2024-06-01 21:49

It feels like Mybatis. It's OK to provide users with optional security solutions. It's useless for users to complain about this problem

sweet potato chips 2024-05-31 22:08

Glue code consumes few resources

Love to eat raw pears 2024-06-01 11:48

Why is this so-called "vulnerability" not a vulnerability? Spring, MyBatis and other frameworks can accept all kinds of CVE criticism, while MyBatisPlus has to dump the pot and accuse programmers of being too low-level# There is a difference. The premise is that you write XML, MyBatisPlus encapsulates Wrapper and claims to simplify code. Since it encapsulates and hides $#, it is not appropriate to do some necessary security checks? Instead of doubting the authority of CVE, you should know that SQL ->MyBatis ->MyBatisPlus ->various back-end scaffolds have multiple layers, each layer is simplifying, and each layer is throwing away the upper layer of the boiler. Who dares to use them. The programmers who use MyBatisPlus can't be expected to be at a high level. Every programmer wants to save effort. The front-end parameters can be directly obtained by HttpServletRequest from the back-end. Wrapper splicing can be found everywhere. If something goes wrong, is it the front-end or the framework? According to Qingmiao, can the injection vulnerability of the previous log4j and the deletion vulnerability of the Druid be used to eliminate low-level programmers?

Rocket ship 2024-05-31 19:22

It's a ghost anyway.

CodeDoger 2024-05-02 20:48

35 It's too old to go to work and too early to retire at 60

People are addicted to food 2024-06-01 13:53

History history combination

gamedot 2024-05-17 11:14

Old Zhou is deeply concerned about Huawei's great cause of open source. He is not a Huawei person, but has Huawei's soul.

Dogo_Little People 2024-06-02 12:24

Not everyone will go to see the document in full detail. As a general basic framework, the method naming should consider not only readability but also understandability. At least, it should also establish a cognition for developers. LambdaQueryWrapper is recommended. The official only briefly said that QueryWrapper may lead to SQL injection risks, There are no detailed examples (many people don't understand what SQL injection is). Now I met a jerk and submitted it to CVE to see who is the most powerful

-SORA- 2024-06-01 09:30

American characters

Others are still watching

More highlights

Top 10 distinctive highlights of the new ORM mybatis mp

The first anniversary of open source, the new version of Qinghua was released

MooTool 1.6.1 is released, and developers always have small tools

Jboot v4.1.6 release, based on JFinal microservice framework

Blazork8s released a new version, updated AI execution command logic, and added multiple AI functions

Kangaroo database tool v5.0 has been released

SQLE 3.2405.0 Release

Maximizing the Effective Throughput of LLM Serving

State fine-tuning, PointRWKV, Chinese documents online... The latest news of RWKV community in May is coming!

Java AI has a bright future

Databases that do not need data

This PHP application server looks a bit trendy! FrankenPHP

The Code R&D management platform - the innovative realization of the integration of daily reports and working hours

The easy-to-use ORM framework mybatis-mp 1.5.3 was released

GRPC 1.64.1 release, cross language RPC framework

Malware is rampant in pirated Microsoft Office

Bun's May update: performance improvement and memory optimization

Fedora Traditional Artistic Energy - Anaconda, the default web UI installer, skipped the ticket again

PDM -- Modern Python Package Manager

Summary of Fourteen Tips for PHP Beginners

What is Node.js

The most convenient IP location query in history

How can programmers improve efficiency

Why do we migrate from NodeJS to Ruby on Rails

Pay attention to scalability when designing Web applications

JavaScript realizes automatic jump after x seconds

7 Legends about html5

Three things should never be put in the database

Implement function overloading and parameter default values in JavaScript

How to learn a new PHP framework

Java 8 Lambda expression: simulate multiple inheritance of Mixin implementation class

What is Node - Learn Node

Technical preparations for websites with millions of visitors

What skills a qualified programmer needs to master

10 things you may not know about PHP

The process of creating pages on github

The beauty of code - how to write elegant PHP code

Research on the unavailability of javascript

What a qualified programmer should do every day, every week, every month, every year

Does the rise of Javascript mean the demise of LAMP

Ten Steps to Become an Excellent Web Developer

26 points for improving java performance

A day of four programmers

Continuous assignment operation that Javascript may not fully understand after 10 years of writing

How to increase the number of websites

Choose Apache or Tomcat for website construction server

Four key points of website code writing

Top 10 Interview Skills

How to quickly find the element in the middle of a single linked list with unknown length

The difference between equals and==in java

Five Tips for Improving the Security of PHP Websites

Best Web Chinese Default Font

Eight Mistakes in Domain Name Resolution

Ten reasons for turning to Spine.js

10 Practical Typography Skills to Improve the Readability of Web Pages

What books should a qualified programmer read

Mobile Web Interface Style - CSS3

Several things to pay attention to when coding Python

Web.py 0.3 Novice Guide

Explanation on the number of concurrent connections in website construction

Six problems that Java must understand

15 Things You Should Know About Win 8 RT

How to close the follow mode in artDialog

A day of four programmers

Good code is cheap code

Default Web Font Style

Do you really understand HTML

Why is the for loop hateful?

Top 10 reasons to use HTML5 now

6 Daliyou makes you like jQuery

Analysis of the underlying knowledge of website construction Socket and Http

Decrypting Redis persistence

28 new features, techniques and technologies of HTML5 that you must know

The mentality of website promotion

Ten Steps to Become an Excellent Web Developer

17 Tips to Improve Your Hibernate 4 Development Ability

How the browser renders text

Eight isolation levels necessary for Web development

MVC mode is not easy to use? Why not try MOVE

What a qualified programmer should do every day, every week, every month, every year

Fact: Red Boy is not the natural son of the Bull Demon King

Be a programmer with Chinese characteristics

What skills a qualified programmer needs to master

Ajax Request and Browser Cache in Website Construction

Three elements of code - winning the hearts of interviewers

10 CoffeeScript single line code tricks that make your friends admire you

PHP programmers are most likely to make 10 mistakes

The trilogy of learning technology: WHAT, HOW, WHY

About returning null values

How to get the div of artDialog

Common shortcut keys for Eclipse programmers

Why++[[]] [+[]]+[+[]]=10?

Write less code

How much is your code worth

5 tips to improve your SEO ability

Top 10 reasons to use HTML5 now

Don't tell me you know Javascript

Death Penalty in the Kingdom of Nouns (Translation) - A Story of Hello World

Avoid six common HTML5 incorrect uses

Software Author

Authors who have not been certified for this software

Use WeChat to log in quickly

Ministry of Industry and Information Technology

Open Source Software Promotion Alliance

Designated official community

Community norms

Yue ICP Bei No. 12009483

Top

WizardLM LLaMA based fine-tuning big language model

Software Introduction

code

comment

{{formatAllHtml(o.title)}}

{{formatAllHtml(o.title)}}

Awesome software

Hot content

Popular comments of the whole site

Others are still watching

Software Author

Hot News

OSCHINA Community

Online tools

Introduction

QQ group

Public account

Video number

Software Introduction

code

comment

Awesome software

Hot content

Popular comments of the whole site

Others are still watching

Software Author

Recommendation of similar software

Hot News

OSCHINA Community

Online tools

Introduction

QQ group

Public account

Video number