Handwritten Java HashMap core source code

In the previous chapter, we wrote the LinkedList core source code. In this chapter, we will write the Java HashMap core source code. Let's first understand the principle of HashMap. HashMap literally means hash+map, map means mapping, and HashMap means mapping with hash. don't get it? No problem. Let's explain the principle of HashMap in detail.

HashMap Usage Analysis

 //1 deposit HashMap<String,String> map = new HashMap<>(); map.put("name","tom"); //2 out System.out.println(map.get("name"));// Output tom

It is so simple to use.

Analysis of HashMap Principle

We know that the Object class has a hashCode() method, which returns the hashCode value of the object, which can be understood as returning the memory address of the object. For the time being, regardless of whether the memory address or anything else is returned, what is the number returned by the hashCode() method? We don't care

First, we just need to remember that this function returns a number. The second HashMap uses an array to store data

1 How does HashMap store name and tom? Let's use a diagram to demonstrate

It can be seen from the above figure that: Note: The size of the array in the figure above is 7, which can be any number. However, we have drawn 7 elements here. We will use the size of the array as 7 to illustrate the principle of HashMap.

The size of the array is 7, and the index range of the array is [0, 6]
Get the hashCode of the key, that is, "name". This is a number. No matter how many this number is, if 7 is remaindered, the range must be [0, 6], which is exactly the same as the index of the array.
If the value of "name". hashCode()% 7 is 2, then the value, that is, the location where "tom" should be stored, is 2
Data [2]="tom", stored in the array. Is it very clever.

2 Let's see how to get it again? Also use a diagram to demonstrate the underlying principle, as follows

It can be seen from the above figure that:

First, obtain the hashCode value of the key, that is, "name"
Use the hashCode value to fetch the remainder of the size 7 of the array, which is the same as running when saving, and must also be 2
Take the value from the second position of the array, that is, String value=data [2]

Note: Several points need attention

The value returned by the hashCode () method of an object is the same when called at any time
Take the remainder of a number n, the range is [0, n - 1]

Note: Several problems need to be solved

When saving, what if the hashCodes of different keys take the remainder of the array exactly the same, that is, they are all mapped to the same position in the array? This is the hash conflict problem, such as 9 % 7 == 2 ， 16 % 7 == 2 Both are equal to 2 Answer: The data structure stored in the array is a node, and the node has the next attribute. If the hash conflicts, the single linked list will be stored. The same is true when fetching, and the linked list will be traversed
What if the array is full? Answer: Same as ArrayList, expand capacity and remap
Directly use the hashCode() value for mapping, and the probability of hash conflict is very large. What should I do? Answer: referring to the implementation in HashMap in JDK, there is a hash() function, and then run the value of hashCode() and map it

It can be seen from the above that HashMap uses an array to store data. If there is already a value on the mapping location, the linked list will be used to store the data in front of the current location. The array+linked list structure is the underlying structure of HashMap If the element stored in our array is QEntry, as shown below:

Handwritten HashMap core source code

The principle has been analyzed above. Next, we will use the least code to prompt the principle of HashMap. We call it the QHashMap class. At the same time, the elements in the array need to define a class, which we define inside the QHashMap class. It's called QEntry

QEntry is defined as follows:

 //Element classes stored in the underlying array public static class QEntry<K, V> { K key;//Store the key V value;//Store value Int hash;//The hash value corresponding to the key //When the hash conflicts, that is, there is already an element in the mapping location //Then the newly added element is used as the chain header, and the stored element is placed at the back //That is, it is saved in next. One sentence: when adding new elements, add them in the header QEntry<K, V> next;   public QEntry(K key, V value, int hash, QEntry<K, V> next) { this.key = key; this.value = value; this.hash = hash; this.next = next; } }

With the definition of the QEntry class, let's see what properties are required in the QHashMap class? The QHashMap class is defined as follows:

 public class QHashMap<K, V> { //The size of the default array private static final int DEFAULT_INITIAL_CAPACITY = 16; //The default expansion factor. When there are more elements in the data, hash conflicts are also easy to occur //Therefore, you need to expand the capacity before the array is used up //0.75 means that when the number of elements reaches 75% of the array size, the capacity will be expanded //For example, the size of the array is 100. When the number of elements in the array increases to 75, it will start to expand private static final float DEFAULT_LOAD_FACTOR = 0.75f; //Array for storing elements private QEntry[] table; //Number of elements in the array private int size; ...... }

It only needs two constants and two variables. Let's take a look at the constructor of QHashMap. For simplicity, only one default constructor is implemented

 public QHashMap() { //Create an array with the default size of 16 table = new QEntry[DEFAULT_INITIAL_CAPACITY]; //At this time, the number of elements is 0 size = 0; }

Let's see how QHashMap stores data map.put("name","tom") The put() function is implemented as follows:

 /** *1. The parameter key and value are easy to understand *2 Return V, we know that HashMap has one feature, *If map.put ("name", "tom") is called multiple times; map.put("name","lilei"); *The following values will overwrite the previous ones. If this happens, the old value will be returned, and "tom" will be returned here */ public V put(K key, V value) { //1. For simplicity, key does not support null if (key == null) { throw new RuntimeException("key is null"); } //Instead of using key. hashCode() directly, we perform another operation on key. hashCode() as the hash value //I copied the hash () method directly from the HashMap source code. Do not care about the hash() algorithm itself //Just know that hash () inputs a number and returns a number. int hash = hash(key.hashCode()); //Use the hash value of the key and the size of the array to make a mapping to get the location that should be stored int index = indexFor(hash, table.length); //Check whether the key of an existing element in the array is equal to the key in the parameter //If equal, replace the old value with a new one, and then return the old value QEntry<K, V> e = table[index]; while (e !=  null) { //First compare whether hashes are equal, then compare whether objects are equal, or compare equals methods //If they are equal, it means that there is the same key. At this time, the old value should be updated to the new value, and the old value should be returned if (e.hash == hash && (key == e.key || key.equals(e.key))) { V oldValue = e.value; e.value = value; return oldValue; } e = e.next; } //If the key without element in the array is equal to the passed key //Save the elements in the current position QEntry<K, V> next = table[index]; //Next may or may not be null, regardless of whether it is null //Next should be the next node of the new element (next is passed to the constructor of QEntry) //Then the new element is saved in the index location table[index] = new QEntry<>(key, value, hash, next); //If you need to expand, the number of elements is greater than table. length * 0.75 (don't ask why it is 0.75, experience) if (size++ >= (table.length * DEFAULT_LOAD_FACTOR)) { resize(); } return null; }

The comments are very detailed. Here are several functions, the hash () function, which is copied directly from the HashMap source code, so you don't need to bother with this algorithm. IndexFor(), passing in the hash and array size to know where we should go to find or save the source code of these two functions as follows:

 //The HashCode is calculated, and the implementation of HashMap in JDK is directly copied static int hash(int h) { h ^= (h >>> 20) ^ (h >>> 12); return h ^ (h >>> 7) ^ (h >>> 4); } //Find where the key falls in the array according to h static int indexFor(int h, int length) { //Or return h&(length-1) has better performance //Here we take the remainder of length in the easiest way to understand. The range is [0, length - 1] //Exactly the range of all indexes of the table array H=h>0? H: - h;//Prevent negative numbers return h % length; }

There is also an expansion function. When the number of elements is greater than table. length * 0.75, we start to expand the source code of resize() as follows:

 //Capacity expansion, the number of elements is greater than table. length * 0.75 //Expand the array to twice the original size private void resize() { //Create a new array whose size is twice the size of the original array int newCapacity = table.length * 2; QEntry[] newTable = new QEntry[newCapacity]; QEntry[] src = table; //Traverse the old array and remap it to the new array for (int j = 0;  j < src.length; j++) { //Get old array elements QEntry<K, V> e = src[j]; //Release old array src[j] = null; //Because e is a linked list, there may be multiple nodes, which are mapped by looping through while (e !=  null) { //Save the next node of e QEntry<K, V> next = e.next; //E The current node is mapped in a new array int i = indexFor(e.hash, newCapacity); //The newTable [i] position may or may not be null //Whether it is null or not, it will be the next node of the e node e.next = newTable[i]; //Save e in the location of i of the new array newTable[i] = e; //Continue the same processing of the next node of e e = next; } } //All nodes are mapped to the new array. Don't forget to assign the new array to table table = newTable; }

Compared with the put () function, get () is much simpler. Just find the position of the corresponding array through the hash value, and then traverse the linked list to find that the key in an element is equal to the key passed. The source code of the put () method is as follows:

 //Get the value according to the key public V get(K key) { //Also for simplicity, key does not support null if (key == null) { throw new RuntimeException("key is null"); } //Hash the key int hash = hash(key.hashCode()); //Map with hash value to get the data at which position in the array int index = indexFor(hash, table.length); //Save the element at index for traversal //Because e is a linked list, we need to traverse the linked list //Find the QEntry equal to the key and return the value QEntry<K, V> e = table[index]; while (e !=  null) { //Compare whether hash values are equal if (hash == e.hash && (key == e.key || key.equals(e.key))) { return e.value; } //If not, continue to find the next e = e.next; } return null; }

The above is the core source code of QHashMap, which we have not deleted. The following is the source code of the entire QHashMap class

The complete source code of QHashMap is as follows:

 public class QHashMap<K, V> { //The size of the default array private static final int DEFAULT_INITIAL_CAPACITY = 16; //The default expansion factor. When the size of the array is greater than or equal to the current capacity * 0.75, it starts to expand private static final float DEFAULT_LOAD_FACTOR = 0.75f; //The underlying layer uses an array to store data private QEntry[] table; //Array size private int size; //A dot node, the unit stored in the array public static class QEntry<K, V> { K key; V value; int hash; QEntry<K, V> next; public QEntry(K key, V value, int hash, QEntry<K, V> next) { this.key = key; this.value = value; this.hash = hash; this.next = next; } } public QHashMap() { table = new QEntry[DEFAULT_INITIAL_CAPACITY]; size = 0; } //Get the value according to the key public V get(K key) { //Also for simplicity, key does not support null if (key == null) { throw new RuntimeException("key is null"); } //Hash the key int hash = hash(key.hashCode()); //Map with hash value to get the data at which position in the array int index = indexFor(hash, table.length); //Save the element at index for traversal //Because e is a linked list, we need to traverse the linked list //Find the QEntry equal to the key and return the value QEntry<K, V> e = table[index]; while (e !=  null) { //Compare whether hash values are equal if (hash == e.hash && (key == e.key || key.equals(e.key))) { return e.value; } //If not, continue to find the next e = e.next; } return null; } /** *1. The parameter key and value are easy to understand *2 Return V, we know that HashMap has one feature, *If map.put ("name", "tom") is called multiple times; map.put("name","lilei"); *The following values will overwrite the previous ones. If this happens, the old value will be returned, and "tom" will be returned here */ public V put(K key, V value) { //1. For simplicity, key does not support null if (key == null) { throw new RuntimeException("key is null"); } //Instead of using key. hashCode() directly, we perform another operation on key. hashCode() as the hash value //I copied the hash () method directly from the HashMap source code. Do not care about the hash() algorithm itself //Just know that hash () inputs a number and returns a number. int hash = hash(key.hashCode()); //Use the hash value of the key and the size of the array to make a mapping to get the location that should be stored int index = indexFor(hash, table.length); //Check whether the key of an existing element in the array is equal to the key in the parameter //If equal, replace the old value with a new one, and then return the old value QEntry<K, V> e = table[index]; while (e !=  null) { //First compare whether hashes are equal, then compare whether objects are equal, or compare equals methods //If they are equal, it means that there is the same key. At this time, the old value should be updated to the new value, and the old value should be returned if (e.hash == hash && (key == e.key || key.equals(e.key))) { V oldValue = e.value; e.value = value; return oldValue; } e = e.next; } //If the key without element in the array is equal to the passed key //Save the elements in the current position QEntry<K, V> next = table[index]; //Next may or may not be null, regardless of whether it is null //Next should be the next node of the new element (next is passed to the constructor of QEntry) //Then the new element is saved in the index location table[index] = new QEntry<>(key, value, hash, next); //If you need to expand, the number of elements is greater than table. length * 0.75 (don't ask why it is 0.75, experience) if (size++ >= (table.length * DEFAULT_LOAD_FACTOR)) { resize(); } return null; } //Capacity expansion, the number of elements is greater than table. length * 0.75 //Expand the array to twice the original size private void resize() { //Create a new array whose size is twice the size of the original array int newCapacity = table.length * 2; QEntry[] newTable = new QEntry[newCapacity]; QEntry[] src = table; //Traverse the old array and remap it to the new array for (int j = 0;  j < src.length; j++) { //Get old array elements QEntry<K, V> e = src[j]; //Release old array src[j] = null; //Because e is a linked list, there may be multiple nodes, which are mapped by looping through while (e !=  null) { //Save the next node of e QEntry<K, V> next = e.next; //E The current node is mapped in a new array int i = indexFor(e.hash, newCapacity); //The newTable [i] position may or may not be null //Whether it is null or not, it will be the next node of the e node e.next = newTable[i]; //Save e in the location of i of the new array newTable[i] = e; //Continue the same processing of the next node of e e = next; } } //All nodes are mapped to the new array. Don't forget to assign the new array to table table = newTable; } //The HashCode is calculated, and the implementation of HashMap in JDK is directly copied static int hash(int h) { h ^= (h >>> 20) ^ (h >>> 12); return h ^ (h >>> 7) ^ (h >>> 4); } //Find where the key falls in the array according to h static int indexFor(int h, int length) { //Or return h&(length-1) has better performance //Here we take the remainder of length in the easiest way to understand. The range is [0, length - 1] //Exactly the range of all indexes of the table array H=h>0? H: - h;//Prevent negative numbers return h % length; } }

The above is the principle of QHashMap. Let's write a test code to see whether our QHashMap can run normally. The test code is as follows:

 public static void main(String[] args) { QHashMap<String, String> map = new QHashMap<>(); map.put("name", "tom"); map.put("age", "23"); map.put("address", "beijing"); String oldValue = map.put("address", "shanghai"); // The key returns the old value and saves the new value System.out.println(map.get("name")); System.out.println(map.get("age")); System. out. println ("old value="+oldValue); System. out. println ("new value="+map. get ("address")); }

The output is as follows:

 tom twenty-three Old value=beijing New value=shanghai

Through the above simple implementation of QHashMap, there are still many functions that have not been implemented, such as remove, clear, containsKey(), and traversal. Interested readers can implement them by themselves

Small and beautiful software development 2024-06-01 05:06

Cheat one's job

Li Yinghui 2024-05-09 16:40

Buddhism has a good word, evil opinion. In dealing with the world, it is meaningless to draw conclusions from preset positions; It is also important to receive good logic training.

Xiao Xu Middle aged 2024-05-31 19:13

Very good

-SORA- 2024-06-01 09:30

American characters

Rocket ship 2024-05-31 19:22

It's a ghost anyway.

osc_92224065 2024-04-29 10:57

Long term oppressed outsourcing of state-owned enterprises

kangaroo 2024-06-01 22:23

The next version focuses on improving existing functions * improving internal power and qi * and continues to move towards the goal of Grand Master.

osc_25732934 2024-06-01 19:30

It seems that the current version of the Foreign Function&Memory API is not as fast as that of jni, or even worse. In addition, before vallhala comes out, all interactions between java and c have to get an additional memory. Even if it comes out, it may not be possible to directly throw a copy of binary data into memory as a structure. When the two apis are completely stable, the day lily is cold

Code craftsman 2024-06-01 11:22

I also said "user controllable parameters"

Chief taxi captain 2024-05-17 11:17

I suggest that 360 open source all its products, and then become the leading enterprise in the domestic open source industry through open source, leading everyone to compete with foreign enterprises

Yoona520 2024-05-17 16:34

Zhou Hongyi is now living more and more like a clown. If he stays behind the scenes, he has to become an online celebrity. Can you learn from Lei Jun?

CodeDoger 2024-05-02 20:48

35 It's too old to go to work and too early to retire at 60

One code Yma 2024-05-06 09:14

My technical article was moved by CSDN. Why didn't anyone step on the sewing machine? This kind of report is a joke to me. The monsters with background are fine, and the monsters without background fight to death

haol666 2024-05-31 18:56

This story is powerful, I take it seriously, until I see the end.

young crops 2024-06-01 16:21

There is no tipping point. There are also many official documents stating that SQL fragments involving direct string splicing need to be controlled by the user, and specific solutions are also provided. If you say that the value part is injected, then we are also 100% free of any dispute. This obvious SQL fragment is unrealistic for ORM to explain without your control, Since SQL allows splicing fragments, there must be some scenarios that cannot be forced into non SQL strings. It is also very simple. Have you ever thought about why not force them???

osc_566335 2024-04-28 14:44

This is also called floor washing? Does it mean that Tesla will not wash the floor if it releases all the source code? Some people HWptds? That is to say, the language is ambiguous, which will also rise to the washing ground? Are some people too focused? Think the people he pays attention to must be staring at?

Happy LeapFrog 2024-05-18 09:18

But the question is: "What's the use of this for ordinary Android users?" Now the answer seems to be: "Almost nothing.".

sunday12345 2024-05-15 18:31

What does the bank do? It's blamed on the remote desktop. Persimmons really pick up soft pinches~?

Xiao Xu Middle aged 2024-06-01 06:49

thank

Love to eat raw pears 2024-06-01 11:48

Why is this so-called "vulnerability" not a vulnerability? Spring, MyBatis and other frameworks can accept all kinds of CVE criticism, while MyBatisPlus has to dump the pot and accuse programmers of being too low-level# There is a difference. The premise is that you write XML, MyBatisPlus encapsulates Wrapper and claims to simplify code. Since it encapsulates and hides $#, it is not appropriate to do some necessary security checks? Instead of doubting the authority of CVE, you should know that SQL ->MyBatis ->MyBatisPlus ->various back-end scaffolds have multiple layers, each layer is simplifying, and each layer is throwing away the upper layer of the boiler. Who dares to use them. The programmers who use MyBatisPlus can't be expected to be at a high level. Every programmer wants to save effort. The front-end parameters can be directly obtained by HttpServletRequest from the back-end. Wrapper splicing can be found everywhere. If something goes wrong, is it the front-end or the framework? According to Qingmiao, can the injection vulnerability of the previous log4j and the deletion vulnerability of the Druid be used to eliminate low-level programmers?

kakai 2024-05-10 10:21

The world only knows that Android was created by Google. Several people know that Android is only a product acquired by Google. Similarly, what is the problem with Huawei's contribution to the collection of OGG open source work and integration into its own proprietary product line?

MrChen89 2024-04-29 09:18

There are a group of people like this. I don't know what they have experienced. When it comes to HW, I can't say anything good, even if it's neutral

Yeah, for 2024-05-17 13:42

That's too right. Old Zhou can't control Google, but he can control 360. Do not do to others what you do not want. All 360 products should be opened first.

infoworld 2024-05-11 15:12

Universities should use open source free software instead of commercial ones. In this way, hands and feet will not be tied technically.

Simple code 2024-06-02 20:15

Does JBoot solve the problem that the join template in JFinal only supports Java 8? Is the dependency on Javax to be changed to Jakarta?

zhy 2024-05-16 13:16

At the end of Shannon is Nong

Ma Nong Little Fatty Brother 2024-05-16 14:40

I give you six seconds. I give you six moves with the same effect in the martial arts contest, which shows the invincibility and confidence of the master

Shuimu Yi'an 2024-05-20 09:58

The news should be read continuously. I'm waiting for the third news besides rustdesk and teamviewer. Localized remote desktop software is far ahead.

xiaoqibabby 2024-05-15 17:36

The bank is strongly required to be responsible for

-SORA- 2024-04-30 17:07

When this happened in a foreign country, the comment area suddenly became very objective and rational**

monkey_cici 2024-05-09 00:25

My I9 CPU, 64GB memory module and 3080Ti computer are inferior to the top configuration of 19999 on a tablet

Dogo_Little People 2024-06-02 12:24

Not everyone will go to see the document in full detail. As a general basic framework, the method naming should consider not only readability but also understandability. At least, it should also establish a cognition for developers. LambdaQueryWrapper is recommended. The official only briefly said that QueryWrapper may lead to SQL injection risks, There are no detailed examples (many people don't understand what SQL injection is). Now I met a jerk and submitted it to CVE to see who is the most powerful

Yokesily 2024-06-02 15:11

So designed

gamedot 2024-05-17 11:14

Old Zhou is deeply concerned about Huawei's great cause of open source. He is not a Huawei person, but has Huawei's soul.

Starry Night Destiny 2024-06-01 21:49

It feels like Mybatis. It's OK to provide users with optional security solutions. It's useless for users to complain about this problem

Monkeys think of apes 2024-05-31 18:31

You can cheat your brother. Just don't cheat yourself

Francesca 2024-05-19 18:00

Wine runs the Android emulator of Windows. Chrome OS is installed in the Android emulator. Linux environment is installed in chrome OS. Linux environment is installed in the Linux environment. Wine is installed in the Android emulator

Single structure 2024-05-11 10:09

Selected as Open Source China's disgrace pillar

All the way north GP 2024-04-25 14:55

America, the future of mankind

The seven in one little King Kong 2024-06-02 15:54

Those people only use resources, others are not developed by NPM...

sweet potato chips 2024-05-31 22:08

Glue code consumes few resources

One code Yma 2024-05-09 09:58

Recently, I often go to interviews. People who hate Ali background most regard me as a fool, even though I am a fool

zzeric 2024-04-28 20:01

Although France is the parent community, the core developers of OCCT on github are all Russians. Without Russians, the French parent community cannot continue to operate. So Huawei took over, moved to China, changed its name and resumed open source and community operations. What's the problem?

People are addicted to food 2024-06-01 13:53

History history combination

Ning Jinnong 2024-06-01 21:04

Correct it. The example of loading the library is wrong. It should be # library=@ loading the dynamic library, "./yards to the treasurer. dll"

Shen Lang Panda 2024-06-01 08:16

You can directly ask questions in the project work order. The comment area is not suitable for answering such questions

Brother Xiao Yang 2024-06-01 20:39

Isn't Ali developed? What are you afraid of? There's no need for every family to set up a set

GDWhisperer 2024-05-15 17:23

I transferred tens of thousands of yuan to my own account, which was under risk control. How did I do this? The bank should be responsible for this**

jalena 2024-05-31 23:57

I can imagine that I will also receive the CVE repair request next week..... I don't use the key!!!!!!!!!

looly 2024-06-02 14:32

@Qingmiao Hutool has also been mentioned some loopholes that I think are relatively "low-level", or I think are not loopholes. At first, I was also very angry, but after thinking it through, I found that CVE's idea was that once you did not actively remind users that there was a pit, the user fell into the pit is your fault, that is, your vulnerability. For example, as a traffic policeman, you should remind everyone who crosses the road to pay attention to safety, and ask him to answer whether he knows. Once you don't remind someone and are hit by a car, you can't get away from it. Similarly, when using frameworks and tools, you should provide at least one parameter to remind users that there may be SQL injection vulnerabilities. Note that it is not in the comments, but in the method parameters, which is the user's responsibility. Therefore, it is not comprehensive to provide solutions in comments or documents.

Xiao Xu Middle aged 2024-06-01 07:03

good

Hakuna 2024-05-31 18:28

It is compatible with Oracle, but does not know "just" or "just". Those who can be compatible with Oracle and do well are real men and real warriors. You should know that compatibility means that even bugs must be compatible, and you have no other code that can not be copied. It's all based on real skills and understanding of oracle.

Bright 2024-05-19 23:25

What a fool! I killed myself. How can people deal with me later.

Love to eat raw pears 2024-06-01 19:18

Don't expect programmers to have a deep understanding of the document. I still think that since the tool hides the details of $#, some necessary security checks are necessary. Many people do not use MybatisPlus directly, but use various so-called rapid development platforms. The MyBatisPlus rapid development platform Snowy, Guns, etc., has an impression that many versions have the problem of using Wrapper directly to splice the Request parameter. I remember that JeecgBoot was opened a lot of CVEs last year or the year before last because of the Wrapper splicing problem. Do you know the author of ibeetl? Many CVE blaming holes have been opened before. The problem is similar. The lack of basic knowledge "script editing permission" is actively handed over to the front end. What a low-level error or even low-energy behavior. However, I accepted it with an open mind and added a white list check.

oldpig 2024-04-28 09:59

”Huawei contributed all the source code "?, the title is completely inconsistent with the content.

Bright Stars 2 2024-05-31 23:28

Remove Unsafe? You don't want netty anymore?

Apizza 2024-06-01 17:52

You can switch from lodash to radash in 2024!!!

Voice of God 2024-06-01 20:47

By default, injection ($) and splicing are turned off. If you want to use it, you need to sign the birth and death form and press the fingerprint.

Qin Liming 2024-05-11 09:12

be devoid of any sense of shame