Computer Operating System Interview Question

<trans data-src="> 整本书**内存分配**是重点。**">>The whole book * * memory allocation * * is the key point**</trans> <trans data-src="IO和CPU协同工作**是难点。**">IO and CPU work together * * is a difficult point**</trans> <trans data-src="进程同步**是难点。">Process synchronization * * is difficult.</trans> <trans data-src="## 进程">##Process</trans> <trans data-src="### 进程和线程的区别">###Differences between processes and threads</trans> <trans data-src="* 进程是资源分配单位，线程是调用单位">*Process is the resource allocation unit, and thread is the calling unit</trans> <trans data-src="* 线程共享进程资源，共享地址空间，独享栈空间">*Threads share process resources, shared address space, and exclusive stack space</trans> <trans data-src="### 进程调度的方法">###Method of process scheduling</trans> <trans data-src="FCFS(先来先服务)，优先级，时间片轮转，多级反馈">FCFS (first come, first serve), priority, time slice rotation, multi-level feedback</trans> <trans data-src="### 进程状态转换">###Process state transition</trans> <trans data-src="就绪、执行、阻塞">Ready, Execute, Block</trans> <trans data-src="* 时间片用完，进程从执行到**就绪**(而不是阻塞)">*The time slice runs out, and the process runs from execution to * * ready * * (instead of blocking)</trans> <trans data-src="* 执行到阻塞一定是**缺失**继续执行下去的资源">*When the execution is blocked, it must be * * missing * * resources that continue to be executed</trans> <trans data-src="* 进程一定要**从就绪状态**才能转换成执行状态">*The process must * * change from ready state * * to execution state</trans> <trans data-src="### 进程同步">###Process Synchronization</trans> <trans data-src="进程同步的目的即**在当前访问临界资源的进程被系统调度切换到另一个进程时候，保证临界资源不被别的进程访问。**">The purpose of process synchronization is * * to ensure that critical resources will not be accessed by other processes when the process currently accessing critical resources is switched to another process by the system scheduling**</trans> <trans data-src="**方法有：**信号量、关中断、硬件指令">**Methods: * * semaphore, off interrupt, hardware instruction</trans> <trans data-src="## 内存分配">##Memory allocation</trans> <trans data-src="首先看到**内存分配**四个字，就能条件反射知道这是讲什么内容。">First, you can see the four words * * memory allocation * *, so you can know what this is about conditionally.</trans> <trans data-src="内存用来干嘛的，作业要运行，必须装入到内存中。">What is memory used for? To run a job, it must be loaded into memory.</trans> <trans data-src="有两个问题：">There are two problems:</trans> <trans data-src="* 作业是整个装进去，还是离散装进去，离散装进去，是全部都装进去，还是一开始只装需要那部分呢？">*Is the operation to be loaded in whole, discrete, discrete, all or only the part needed at the beginning?</trans> <trans data-src="* 内存是固定分成大小，还是动态可以变化的(分区可以分割，可以合并)">*Whether the memory is fixed or dynamically changeable (partitions can be split and merged)</trans> <trans data-src="基于这两个问题，有不同的内存分配算法。">Based on these two problems, there are different memory allocation algorithms.</trans> <trans data-src="### 固定分区分配">###Fixed partition allocation</trans> <trans data-src="内存分为N块，每块大小可以不相同，但是是固定的，不能变化的。">Memory is divided into N blocks. Each block can be different in size, but it is fixed and cannot be changed.</trans> <trans data-src="### 动态分区分配">###Dynamic partition allocation</trans> <trans data-src="动态就体现在，内存分块大小是可以变化的。">The dynamic is that the size of memory blocks can be changed.</trans> <trans data-src="一开始内存不分块。">At first, the memory is not partitioned.</trans> <trans data-src="作业**需要多大，就划分多大**。">The task * * can be divided as large as it needs * *.</trans> <trans data-src="但是当作业回收的时候，这个地方就会留下一个空闲块。">But when the job is recycled, a free block will be left in this place.</trans> <trans data-src="时间长了，整个内存就会有很多个空闲块。">Over time, there will be many free blocks in the whole memory.</trans> <trans data-src="此时可以用一个**空闲分区链表**把这些空闲的分区都连起来。">At this time, you can use a * * free partition linked list * * to connect these free partitions.</trans> <trans data-src="当有新的作业装入内存，此时就涉及到**空闲分区分配算法**的问题。">When a new job is loaded into memory, the problem of * * idle partition allocation algorithm * * is involved.</trans> <trans data-src="* **首次适应**，按地址，从头开始找">** * First adaptation * *, search from the beginning by address</trans> <trans data-src="* **下次适应**，按地址，从上次分配的空闲分区地址开始找">** * Next adaptation * *, search by address from the last allocated free partition address</trans> <trans data-src="* **最佳适应**，按分区大小，选择最小且能装得下的。">** * Best fit * *, select the smallest and can be installed according to the partition size.</trans> <trans data-src="* **最差适应**，按分区大小，选择最大的">** * Worst fit * *, select the largest according to the partition size</trans> <trans data-src="### 分页/分段分配">###Page/Segment Allocation</trans> <trans data-src="我们一直折腾这个内存的分块。">We have been wrestling with this memory partition.</trans> <trans data-src="分页/分段是对作业而言的，将作业分成很多个的块。">Paging/segmentation is for jobs, which are divided into many blocks.</trans> <trans data-src="> 页和段区别就是">>The difference between page and segment is</trans> <trans data-src="**页面大小是固定的，而且和内存的物理块大小是一致的（内存分成等分的物理块）**是内存管理自动划分的，用户是看不到的。">**The page size is fixed and consistent with the physical block size of memory (memory is divided into equal physical blocks) * * It is automatically divided by memory management and cannot be seen by users.</trans> <trans data-src="**段的大小是按照程序员意愿进行划分成大小不等的块。**">**The size of the segment is divided into blocks of different sizes according to the programmer's wishes**</trans> <trans data-src="段是信息的逻辑单位。">Segments are logical units of information.</trans> <trans data-src="---">---</trans> <trans data-src="还有一个问题：我把离散的作业块调入内存，我怎么知道作业的哪一块在内存的哪一块中呢。">Another problem: when I transfer discrete job blocks into memory, how do I know which job block is in which memory block.</trans> <trans data-src="这就是**页/段表**的作用。">This is the function of * * page/segment table * *.</trans> <trans data-src="页表就是作业的**逻辑地址**和内存的**物理地址**的映射。">The page table is the mapping between the * * logical address * * of the job and the * * physical address * * of the memory.</trans> <trans data-src="比如32位机器的逻辑地址位数是32位。">For example, the number of logical address bits of a 32-bit machine is 32.</trans> <trans data-src="物理地址位数由真实内存大小确定。">The number of physical address bits is determined by the real memory size.</trans> <trans data-src="> 页表和段表的区别[^table]">>Difference between page table and segment table [^ table]</trans> <trans data-src="!">!</trans> <trans data-src="[页表](">[Page Table]（</trans> <trans data-src="https://www.ihewro.com/usr/uploads/sina/5cc07fa90f90f.jpg">https://www.ihewro.com/usr/uploads/sina/5cc07fa90f90f.jpg</trans> <trans data-src=")">)</trans> <trans data-src="*页表*">*Page Table*</trans> <trans data-src="!">!</trans> <trans data-src="[段表](">[Segment Table]（</trans> <trans data-src="https://www.ihewro.com/usr/uploads/sina/5cc07fa91ec13.jpg">https://www.ihewro.com/usr/uploads/sina/5cc07fa91ec13.jpg</trans> <trans data-src=")">)</trans> <trans data-src="*段表*">*Segment table*</trans> <trans data-src="**为什么说段表是二维结构？**">**Why is the segment table a two-dimensional structure**</trans> <trans data-src="因为段号对应两个元素(段长、基址)">Because the segment number corresponds to two elements (segment length and base address)</trans> <trans data-src="而页面只对应一个元素(块号)">The page only corresponds to one element (block number)</trans> <trans data-src="### 请求分页分配(虚拟内存技术)">###Request paging allocation (virtual memory technology)</trans> <trans data-src="既然作业都分为离散的块了，根据程序的局部性原理，我们没必要一次性把作业所有的块都调入内存。">Since the job is divided into discrete blocks, according to the principle of program locality, we do not need to transfer all the blocks of the job into memory at one time.</trans> <trans data-src="内存肚子很小的，就不能体谅一下嘛。">The memory stomach is very small, so I can't make allowances for it.</trans> <trans data-src="所以一个作业我只给你分配一定数目的物理块。">So I only assign you a certain number of physical blocks for an assignment.</trans> <trans data-src="就要考虑下面几个问题（**这个5个问题对于理解整个过程非常重要**）：">Consider the following questions (* * These five questions are very important for understanding the whole process * *):</trans> <trans data-src="* 分配作业的物理块数目是固定的还是动态的？">*Is the number of physical blocks in the allocation job fixed or dynamic?</trans> <trans data-src="(**页面分配算法**)">(* * Page Allocation Algorithm * *)</trans> <trans data-src="* 作业的哪些页面应该调入到内存中呢？">*Which pages of the job should be transferred to memory?</trans> <trans data-src="(**页面调入算法**)">(* * Page call in algorithm * *)</trans> <trans data-src="* 如果分配给作业的物理块满了，而现在向又需要调入某个页面，要将谁给置换出去？">*If the physical block assigned to the job is full, and now it needs to call in a page, who should be replaced?</trans> <trans data-src="(**页面置换算法**)">(* * Page replacement algorithm * *)</trans> <trans data-src="* 如果分配给作业的物理块有空闲，作业的某个页面调入内存，放在什么位置？">*If the physical block allocated to the job is free, where is a page of the job transferred to memory?</trans> <trans data-src="(**逻辑地址和物理地址的映射关系**)">(* * Mapping relationship between logical address and physical address * *)</trans> <trans data-src="* 如果内存中页面内容修改了，如何处理？">*What if the contents of the page in memory are modified?</trans> <trans data-src="(**数据一致性问题**)">(* * Data consistency problem * *)</trans> <trans data-src="这四个问题，对**cache的分配**是同样要面对的。">The allocation of * * cache * * also faces these four problems.</trans> <trans data-src="只不过把**页面/物理块**换成**cache块**(cache块大小和物理块大小并不一定相等)。">Just replace * * page/physical block * * with * * cache block * * (cache block size and physical block size are not necessarily equal).</trans> <trans data-src="所以下面我们从共性出发，对于不同的地方进行特别记忆即可。">So let's start from the commonness and make special memories of different places.</trans> <trans data-src="#### TLB(块表)、 页表 、cache 、内存">####TLB (block table), page table, cache, memory</trans> <trans data-src="* cache 是存储在高速缓存中，页表存储在内存中">*Cache is stored in cache, and page table is stored in memory</trans> <trans data-src="* 页表是作业与内存的页号和块号的映射。">*The page table is the mapping of page numbers and block numbers of jobs and memory.</trans> <trans data-src="TLB是内存与cache的分块映射。">TLB is a block mapping between memory and cache.</trans> <trans data-src="* cache分配和内存分配的区别：">*Difference between cache allocation and memory allocation:</trans> <trans data-src="* cache内容是内存的一个副本，**需要保存数据一致**。">*Cache content is a copy of memory, * * data consistency needs to be saved * *.</trans> <trans data-src="而内存内容是从磁盘(作业的位置)调入的，可以不一致，当置换出去的时候再写入磁盘即可，**无需特别的保持数据一致性**。">The memory content is transferred from the disk (job location), which can be inconsistent. When it is replaced, it can be written to the disk. * * There is no need to specifically maintain data consistency * *.</trans> <trans data-src="* 一般给作业分配的物理块比较少，所以**内存分配一般没说明的话都是直接相连**，而**cache与内存的映射关系比较复杂**。">*Generally, there are few physical blocks allocated to jobs, so * * memory allocation is generally directly connected * * without instructions, and * * the mapping relationship between cache and memory is complex * *.</trans> <trans data-src="#### 页面分配算法">####Page allocation algorithm</trans> <trans data-src="**对于内存分配来说**，可以是固定的，即一开始就给作业分配好固定数目的物理块。">**For memory allocation, * * can be fixed, that is, a fixed number of physical blocks are allocated to the job at the beginning.</trans> <trans data-src="这样当内存中物理块满了，就需要将这些物理块其中一个的内容置换出去。">In this way, when the physical blocks in the memory are full, the contents of one of these physical blocks need to be replaced.</trans> <trans data-src="即**固定分配，局部置换**。">That is * * fixed distribution, local replacement * *.</trans> <trans data-src="也可以是动态的，又分为两种情况：">It can also be dynamic and can be divided into two situations:</trans> <trans data-src="* 一开始就不给作业分配物理块，内存维护一个空闲物理块链表，作业页面需要进内存，就直接从这个链表中选择一个空的物理块装入即可。">*At the beginning, no physical block is allocated to the job. The memory maintains a linked list of free physical blocks. If the job page needs to enter the memory, you can directly select an empty physical block from the linked list to load it.</trans> <trans data-src="如果系统整个内存都满了，就调出其他进程的一页。">If the whole memory of the system is full, call up a page of other processes.</trans> <trans data-src="即**动态分配，全局置换**">That is * * dynamic allocation and global replacement**</trans> <trans data-src="* 一开始分配一定数目的物理块，内存也维护一个空闲物理块链表。">*At the beginning, a certain number of physical blocks are allocated, and the memory also maintains a linked list of free physical blocks.</trans> <trans data-src="作业页面需要进内存，如果自己物理块满了，先置换自己的。">The job page needs to be stored in memory. If your physical block is full, replace it first.</trans> <trans data-src="如果这个作业频繁的发送缺页中断，频繁置换自己的物理块，那系统就看不下去了，就从空闲物理块链接给他分配一个。">If the job frequently sends page faults and interrupts, and frequently replaces its own physical block, the system will not be able to see it anymore, so it will allocate one from the link of the free physical block.</trans> <trans data-src="如果这个作业基本不发送中断，就可以适当减少他的物理块。">If the job does not send interrupts, its physical blocks can be reduced appropriately.</trans> <trans data-src="即**动态分配，局部置换**">Namely * * dynamic allocation, local replacement**</trans> <trans data-src="对于内存分配一般使用的是**固定分配，局部置换**算法。">The * * fixed allocation and local replacement * * algorithm is generally used for memory allocation.</trans> <trans data-src="对于cache来说，因为cache大小很小，所以所有作业共享同一个cache空间，即使用的是**动态分配，全局置换**。">For the cache, because the cache size is very small, all jobs share the same cache space, that is, * * dynamic allocation and global replacement * * are used.</trans> <trans data-src="---">---</trans> <trans data-src="这三种方式就和教育方式很像。">These three ways are very similar to the way of education.</trans> <trans data-src="第一种，给孩子一个月零花钱固定的，你要用完了，还想买别的东西，那你就卖掉一些你自己的东西，我不会多给你。">The first is to give your child a fixed monthly allowance. If you want to buy something else when you run out of money, you can sell some of your own things. I won't give you more.</trans> <trans data-src="第二种，明显的溺爱，你要买啥，我就给你钱。">The second is obvious indulgence. I will give you money if you want to buy anything.</trans> <trans data-src="如果家里没钱了，爸爸少花点钱买烟。">If the family has no money, Dad will spend less money on cigarettes.</trans> <trans data-src="第三种，一开始给你一个固定数额，你要用完了，还想买别的东西，那你也得卖掉一些你自己的东西。">The third one is to give you a fixed amount at the beginning. If you want to buy other things when you run out, you have to sell some of your own things.</trans> <trans data-src="但是如果家长发下这个孩子频繁的买东西，就多加一点钱。">But if the parents send the child to buy things frequently, they will add more money.</trans> <trans data-src="如果发现这个孩子基本上不靠卖东西赚钱，就可以少给点钱。">If you find that the child basically doesn't make money by selling things, you can give him less money.</trans> <trans data-src="#### 页面调入算法">####Page call in algorithm</trans> <trans data-src="一开始cache/内存中都是空闲的。">At first, the cache/memory is free.</trans> <trans data-src="可以一开始就进行判断调入一些可能接下来会用到的页面/cache数据块，即**预调页策略**。">You can judge at the beginning to call in some pages/cache data blocks that may be used next, namely * * page pre calling policy * *.</trans> <trans data-src="也可以一开始不调入，等缺页/cache未命中，再进行调入。">You can also not call in at the beginning, wait for missing pages/cache misses, and then call in.</trans> <trans data-src="即**请求调入策略**。">That is, * * Request Transfer in Policy * *.</trans> <trans data-src="#### 映射关系">####Mapping relationship</trans> <trans data-src="这个我当时学起来真的很费劲。">I had a hard time learning this.</trans> <trans data-src="它解决的问题就是数据块要调入cache的时候，不能有空闲的位置，就直接坐下来，必须遵循一个**规则**，比如这个数组块在内存是某个组的第三个成员，那它一定要坐在cache的第三组中。">The problem it solves is that when a data block needs to be transferred into the cache, it cannot have a free location, so it can sit down directly. It must follow the * * rule * *. For example, if the array block is the third member of a group in memory, it must sit in the third group of the cache.</trans> <trans data-src="就算第一组有空位置，你都不能去。">Even if the first group has free seats, you can't go there.</trans> <trans data-src="这种映射规则也就决定了当cache满的时候，它只能置换掉**指定组的一个成员**，如果指定组有多个成员，就继续按照页面置换算法进行选择置换。">This mapping rule also determines that when the cache is full, it can only replace * * one member of the specified group * *. If the specified group has more than one member, continue to select the replacement according to the page replacement algorithm.</trans> <trans data-src="**规则如下**：">**The rules are as follows * *:</trans> <trans data-src="cache有M行，给所有行分组，一共有N组。">There are M rows in the cache. All rows are grouped into N groups.</trans> <trans data-src="主存也要分组，每组成员数为N。">The main memory should also be grouped, and the number of members in each group is N.</trans> <trans data-src="**主存的某个数据块和cache的某个组对应**。">**A data block in main memory corresponds to a group in cache * *.</trans> <trans data-src="---">---</trans> <trans data-src="如果cache只分了一组，就是直接相连。">If the cache is only divided into one group, it is directly connected.</trans> <trans data-src="cache分了M组(N = M)，就是全相连。">The cache is divided into M groups (N=M), that is, all connected.</trans> <trans data-src="否则就是组相连。">Otherwise, it is group connected.</trans> <trans data-src="---">---</trans> <trans data-src="> 为什么要搞这么复杂的映射关系？">>Why do we have such a complex mapping relationship?</trans> <trans data-src="我内存分配多简单是吧。">How simple is my memory allocation.</trans> <trans data-src="给定的物理块集合有空闲的，直接进去完事了，你还得规定我只能进特定的组，你是不是有毛病？">The given set of physical blocks is free, and you have to specify that I can only enter specific groups after going in directly to finish the work. Is there something wrong with you?</trans> <trans data-src="原因就是因为**页表存在内存中，内存空间大啊。">The reason is that the * * page table exists in memory and has a large memory space.</trans> <trans data-src="而TLB存在高速缓存中，人家寸土寸金，你的TLB块表自然越小越好**。">The TLB is stored in the cache. If people have no money, the smaller your TLB block table, the better * *.</trans> <trans data-src="> > TLB表大小和映射关系到底什么关系？">>>What is the relationship between TLB table size and mapping?</trans> <trans data-src="TLB表结构和页表很像，就是把页号变成数据块内容。">The structure of a TLB table is similar to that of a page table, that is, the page number is changed into data block content.</trans> <trans data-src="即每一行结构可以简单的如下：">That is, the structure of each line can be simple as follows:</trans> <trans data-src="数据块号 + 数据块内容">Data block number+data block content</trans> <trans data-src="数据块号标记内存的哪一个数据块。">The data block number marks which data block of the memory.</trans> <trans data-src="(特别提醒：数据块大小和物理块大小不一定一致，所以这里块号准确的名词应该是**标记位**)">(Special note: the size of the data block is not necessarily the same as the size of the physical block, so the exact noun of the block number here should be * * tag bit * *)</trans> <trans data-src="**通过映射关系可以帮我们减少这个标记位的长度，因为映射关系本身能提供一些信息。**">**The mapping relationship can help us reduce the length of this tag bit, because the mapping relationship itself can provide some information**</trans> <trans data-src="举个例子，cache一共16行，分为8组，每组2行。">For example, there are 16 rows of cache, divided into 8 groups with 2 rows each.</trans> <trans data-src="主存一共分成了32组，每组8个成员。">The main memory is divided into 32 groups with 8 members in each group.</trans> <trans data-src="可以看到主存本来是有32×8 = 2^8，即数据块号需要8位。">It can be seen that the main memory originally has 32 × 8=2 ^ 8, that is, the data block number needs 8 bits.</trans> <trans data-src="主存的第三块必须要进入第三组(3%8=3)">The third block of main memory must enter the third group (3% 8=3)</trans> <trans data-src="这样块号可以分割成两个部分：剩余部分（5位），组号(3位)">In this way, the block number can be divided into two parts: the remaining part (5 digits) and the group number (3 digits)</trans> <trans data-src="其中**组号直接可以从数据块在cache的位置看出来，就没必要存在标记位里了**（这个组号信息就是映射关系提供的）。">The * * group number can be seen directly from the location of the data block in the cache, so there is no need to exist in the tag bit * * (this group number information is provided by the mapping relationship).</trans> <trans data-src="所以标记位只需要存储剩余部分即可。">So the tag bit only needs to store the remaining part.</trans> <trans data-src="标记位长度由8位减小成5位。">The marker bit length is reduced from 8 bits to 5 bits.</trans> <trans data-src="还可以看出，**分组越多，标记位越小**（全相连映射标记位数最少）。">It can also be seen that * * the more groups, the smaller the tag bit * * (the least tag bit of all connected mapping).</trans> <trans data-src="但带来的问题就是**容易发生冲突**。">But the problem is that * * conflict is easy to occur * *.</trans> <trans data-src="比如主存数据块号为3和11就会抢占同一组。">For example, if the main memory block numbers are 3 and 11, the same group will be preempted.</trans> <trans data-src="#### 页面置换算法">####Page replacement algorithm</trans> <trans data-src="不管是固定分配还是动态，当所有的物理块都被占用，新的页面又必须调入进来，就必须淘汰一些人。">Whether it is fixed allocation or dynamic, when all physical blocks are occupied and new pages must be brought in, some people must be eliminated.</trans> <trans data-src="我们学习这部分，课本上讲的都是页面置换，即内存分配中发生的情况，但实际上cache分配同样会使用这个算法(一个组有多个成员)。">When we study this part, the textbook talks about page replacement, that is, what happens in memory allocation, but actually cache allocation also uses this algorithm (a group has multiple members).</trans> <trans data-src="如果题目考察比较细致，综合性就比较强。">If the topic is examined carefully, it will be more comprehensive.</trans> <trans data-src="有最佳置换、FIFO、LRU(最近最少使用算法)、Clock算法。">There are optimal replacement, FIFO, LRU (least recently used algorithm), and Clock algorithms.</trans> <trans data-src="具体细节以后单独再展开。">The specific details will be expanded separately later.</trans> <trans data-src="#### 数据一致性问题">####Data consistency</trans> <trans data-src="这个其实并不是cache分配特有的问题。">This is not a problem specific to cache allocation.</trans> <trans data-src="内存分配只是默认都是**回写法**">Memory allocation only defaults to * * write back method**</trans> <trans data-src="因为cache是内存一些块的副本。">Because cache is a copy of some memory blocks.</trans> <trans data-src="必须保证这两个地方的数据是一致的。">The data in these two places must be consistent.</trans> <trans data-src=">  所以当cache中的数据被修改了（写操作），内存那边要怎么办？">>So when the data in the cache is modified (write operation), what should we do in the memory?</trans> <trans data-src="* 未命中">*Misses</trans> <trans data-src="* 先读到cache中，对cache中数据修改">*Read to the cache first, and modify the data in the cache</trans> <trans data-src="* 先对内存中数据修改，再读到cache中">*Modify the data in memory first, and then read it to the cache</trans> <trans data-src="* 命中：">*Hit:</trans> <trans data-src="* cache和内存同时修改(**全写法**)">*Cache and memory are modified simultaneously (* * full write * *)</trans> <trans data-src="* 只对cache数据进行修改，当该数据块被置换出去时候，再对内存中数据回写(**回写**)">*Only the cache data is modified. When the data block is replaced, the data in memory is written back (* * write back * *)</trans> <trans data-src="可以看出第二和第三种方法是时刻保持cache和内存的数据一致的。">It can be seen that the second and third methods are to keep the cache and memory data consistent at all times.</trans> <trans data-src="但第一和第四可以提升速度。">But first and fourth can increase speed.</trans> <trans data-src="#### 总结">####Summary</trans> <trans data-src="内存分配是固定分配+局部置换+直接相连映射+页面置换+回写法数据一致性算法">Memory allocation is a fixed allocation+local replacement+direct connection mapping+page replacement+write back data consistency algorithm</trans> <trans data-src="cache分配是动态分配+全局置换+多种映射关系+对指定映射的组成员进行页面置换+多种数据一致性算法">Cache allocation is dynamic allocation+global replacement+multiple mapping relationships+page replacement for members of the specified mapping group+multiple data consistency algorithms</trans> <trans data-src="如果能理解上面这个过程，应该对内存分配和cache分配理解差不多。">If you can understand the above process, you should have a similar understanding of memory allocation and cache allocation.</trans> <trans data-src="## 文件管理">##Document management</trans> <trans data-src="### 硬链接和软链接">###Hard link and soft link</trans> <trans data-src="这两个概念真的是记了忘，忘了记。">These two concepts are really forgotten.</trans> <trans data-src="我很无奈。">I am helpless.</trans> <trans data-src="* 软链接，比如Windows上面的新建一个快捷方式。">*Soft links, such as creating a shortcut on Windows.</trans> <trans data-src="当原来的程序卸载了，快捷方式还保留着，但是打开的时候提示"找不到应用程序"。">When the original program is uninstalled, the shortcut remains, but when you open it, you will be prompted that "the application cannot be found".</trans> <trans data-src="这个软链接文件的目录项的内容只是目标文件的路径名，而且该目录项会被标识为"link"。">The content of the directory entry of the soft link file is only the path name of the target file, and the directory entry will be identified as "link".</trans> <trans data-src="* 硬链接，硬链接文件的目录下内容和目标文件的是同一个索引节点(索引分配)，或者是同一个FCB（隐式链接分配）。">*Hard link: The directory contents of the hard link file and the target file are the same inode (index allocation) or the same FCB (implicit link allocation).</trans> <trans data-src="删除的时候，删掉一个其中文件，该文件的引用计数会减1，只有文件">When deleting a file, the reference count of the file will decrease by 1. Only the file</trans> <trans data-src="### 文件在磁盘上的物理分配">###Physical allocation of files on disk</trans> <trans data-src="#### 目录项、inode节点、FCB">####Directory entry, inode node, FCB</trans> <trans data-src="操作系统名词贼多，一个个看起来很厉害的样子。">There are many operating system terms, and they all look very powerful.</trans> <trans data-src="inode节点和FCB都是描述目录项一种方式。">The inode node and FCB are both ways to describe directory entries.</trans> <trans data-src="目录项则是描述文件的信息(包括**名称和在磁盘上位置**)。">The directory entry is the information describing the file (including * * name and location on the disk * *).</trans> <trans data-src="**inode节点就是FCB把文件名称信息去掉**，其他信息组合在一起，创建一个新的名字。">**The inode node is that the FCB removes * * from the file name information and combines other information to create a new name.</trans> <trans data-src="#### 连续分配">####Continuous distribution</trans> <trans data-src="文件挨个划分为等大小的磁盘块，然后挨个顺序的存放在磁盘中。">The files are divided into equal sized disk blocks one by one, and then stored on the disk one by one.</trans> <trans data-src="FCB记录其实块号和大小。">FCB records the actual block number and size.</trans> <trans data-src="#### 显式链接分配">####Explicit link assignment</trans> <trans data-src="磁盘块中除了文件内容，再划分一小块作为指针区域，指向下一个磁盘块。">In addition to the file content, the disk block is divided into a small area as the pointer area to point to the next disk block.</trans> <trans data-src="FCB只需要记录起始块号。">FCB only needs to record the starting block number.</trans> <trans data-src="#### 隐式链接分配">####Implicit link assignment</trans> <trans data-src="指针的信息都放在一个FAT表中。">The pointer information is placed in a FAT table.</trans> <trans data-src="表是一维结构。">A table is a one-dimensional structure.</trans> <trans data-src="表的行号表示物理块号，表的行的内容表示下一个链接的物理块号。">The row number of the table represents the physical block number, and the content of the row of the table represents the physical block number of the next link.</trans> <trans data-src="FCB只需要记录起始物理块号，然后去FAT表中查询即可。">FCB only needs to record the starting physical block number, and then query the FAT table.</trans> <trans data-src="#### 索引分配">####Index Allocation</trans> <trans data-src="每个文件建立一个索引表，索引表存储在磁盘中。">Each file establishes an index table, which is stored on disk.</trans> <trans data-src="索引表的内容就是该文件在磁盘中的所有物理块。">The contents of the index table are all the physical blocks of the file on the disk.</trans> <trans data-src="FCB记录索引表的物理块号即可。">The FCB records the physical block number of the index table.</trans> <trans data-src="## IO系统">##IO system</trans> <trans data-src="### 程序中断">###Program interrupt</trans> <trans data-src="CPU读磁盘数据的时候，磁盘输出的速率慢，CPU的读的速度很快。">When the CPU reads the disk data, the disk output speed is slow, and the CPU read speed is fast.</trans> <trans data-src="磁盘有一个数据寄存器，只能存放一个字节。">The disk has a data register, which can only store one byte.</trans> <trans data-src="磁盘输出的时候，CPU不要忙等，而是等数据寄存器满了之后，叫CPU(发起中断请求)。">When the disk outputs, the CPU is not busy waiting, but when the data register is full, it is called the CPU (initiate an interrupt request).</trans> <trans data-src="### DMA">### DMA</trans> <trans data-src="在程序中断原理上，增加DMA控制器。">On the principle of program interruption, add DMA controller.</trans> <trans data-src="CPU一开始把要读取的字节数告诉DMA。">At the beginning, the CPU tells the DMA the number of bytes to read.</trans> <trans data-src="每次磁盘的数据寄存器满了，直接发起DMA请求，DMA**把数据运到内存**（这个过程需要和CPU争夺总线周期的）。">Every time the data register of the disk is full, the DMA request is directly initiated, and the DMA * * transports the data to the memory * * (this process needs to compete with the CPU for the bus cycle).</trans> <trans data-src="直到指定的字节数读完了，DMA控制器发起中断请求，让CPU处理后事。">Until the specified number of bytes has been read, the DMA controller initiates an interrupt request to let the CPU handle the subsequent events.</trans> <trans data-src="## 参考文章">##Reference article</trans> <trans data-src="https://zhuanlan.zhihu.com/p/23755202">https://zhuanlan.zhihu.com/p/23755202</trans> <trans data-src="[^table]: ">[^table]: </trans> <trans data-src="https://blog.csdn.net/qq_34127958/article/details/71079942">https://blog.csdn.net/qq_34127958/article/details/71079942</trans>

Entire book memory allocation Is the key point. IO and CPU work together It is difficult. Process Synchronization It is difficult.

process

Differences between processes and threads

Process is the resource allocation unit, and thread is the calling unit
Threads share process resources, shared address space, and exclusive stack space

Method of process scheduling

FCFS (first come, first serve), priority, time slice rotation, multi-level feedback

Process state transition

Ready, Execute, Block

The time slice is exhausted, and the process runs from execution to be ready (instead of blocking)
It must be defect Resources for continued implementation
The process must From Ready Can be converted to execution status

Process Synchronization

The purpose of process synchronization is When the process that currently accesses critical resources is switched to another process by the system scheduling, ensure that critical resources are not accessed by other processes.

Methods: Semaphore, off interrupt, hardware instruction

memory allocation

First see memory allocation Four words, you can know what this is about.
What is memory used for? To run a job, it must be loaded into memory. There are two problems:

Is the operation to be loaded in whole, discrete, discrete, all or only the part needed at the beginning?
Whether the memory is fixed or dynamically changeable (partitions can be split and merged)

Based on these two problems, there are different memory allocation algorithms.

Fixed partition allocation

Memory is divided into N blocks. Each block can be different in size, but it is fixed and cannot be changed.

Dynamic partition allocation

The dynamic is that the size of memory blocks can be changed.

At first, the memory is not partitioned. task Divide as big as needed 。

But when the job is recycled, a free block will be left in this place. Over time, there will be many free blocks in the whole memory. You can use a Free partition linked list Connect these spare partitions. When a new job is loaded into memory, this involves Idle partition allocation algorithm Problems.

First adaptation , by address, from the beginning
Next adaptation , by address, starting from the last allocated free partition address
Optimal adaptation , select the smallest one that can be installed according to the partition size.
Worst adaptation , select the largest by partition size

Page/Segment Allocation

We have been wrestling with this memory partition.

Paging/segmentation is for jobs, which are divided into many blocks.

The difference between page and segment is

The page size is fixed and consistent with the physical block size of memory (memory is divided into equal physical blocks) It is automatically divided by memory management and cannot be seen by users.

The size of the segment is divided into blocks of different sizes according to the programmer's wishes. Segments are logical units of information.

Another problem: when I transfer discrete job blocks into memory, how do I know which job block is in which memory block. This is it. Page/Segment Table Role of.

The page table is the job Logical address And memory Physical address Mapping of.

For example, the number of logical address bits of a 32-bit machine is 32. The number of physical address bits is determined by the real memory size.

Difference between page table and segment table^one

Page Table

Page Table

Segment table

Segment table

Why is the segment table a two-dimensional structure?

Because the segment number corresponds to two elements (segment length and base address)

The page only corresponds to one element (block number)

Request paging allocation (virtual memory technology)

Since the job is divided into discrete blocks, according to the principle of program locality, we do not need to transfer all the blocks of the job into memory at one time. The memory stomach is very small, so I can't make allowances for it. So I only assign a certain number of physical blocks to you for an assignment.

Consider the following questions（ These five questions are very important for understanding the whole process ）：

Is the number of physical blocks in the allocation job fixed or dynamic? ( Page allocation algorithm )
Which pages of the job should be transferred to memory? ( Page call in algorithm )
If the physical block assigned to the job is full, and now it needs to call in a page, who should be replaced? ( Page replacement algorithm )
If the physical block allocated to the job is free, where is a page of the job transferred to memory? ( Mapping relationship between logical address and physical address )
What if the contents of the page in memory are modified? ( Data consistency )

These four questions are right Cache allocation It is the same to face. Just Page/physical block change into Cache block (The cache block size is not necessarily equal to the physical block size.). So let's start from the commonness and make special memories of different places.

TLB (block table), page table, cache, memory

Cache is stored in cache, and page table is stored in memory
The page table is the mapping of page numbers and block numbers of jobs and memory. TLB is a block mapping between memory and cache.
Difference between cache allocation and memory allocation:
- Cache content is a copy of memory, Data consistency needs to be saved 。 The memory content is transferred from the disk (job location), which can be inconsistent. When it is replaced, it can be written to the disk, No need for special data consistency 。
- Generally, there are fewer physical blocks allocated to jobs, so Memory allocation is usually directly connected without explanation , and The mapping relationship between cache and memory is complex 。

Page allocation algorithm

For memory allocation , which can be fixed, that is, a fixed number of physical blocks are allocated to the job at the beginning. In this way, when the physical blocks in the memory are full, the contents of one of these physical blocks need to be replaced. I.e Fixed distribution 。

It can also be dynamic and can be divided into two situations:

At the beginning, no physical block is allocated to the job. The memory maintains a linked list of free physical blocks. If the job page needs to enter the memory, you can directly select an empty physical block from the linked list to load it. If the whole memory of the system is full, call up a page of other processes. I.e Dynamic allocation, global replacement
At the beginning, a certain number of physical blocks are allocated, and the memory also maintains a linked list of free physical blocks. The job page needs to be stored in memory. If your physical block is full, replace it first. If the job frequently sends page faults and interrupts, and frequently replaces its own physical block, the system will not be able to see it anymore, so it will allocate one from the link of the free physical block. If the job does not send interrupts, its physical blocks can be reduced appropriately. I.e Dynamic allocation, local replacement

For memory allocation, we usually use Fixed distribution Algorithm.

For the cache, because the cache size is very small, all jobs share the same cache space, that is Dynamic allocation, global replacement 。

These three ways are very similar to the way of education.

The first is to give your child a fixed monthly allowance. If you want to buy something else when you run out of money, you can sell some of your own things. I won't give you more.

The second is obvious indulgence. I will give you money if you want to buy anything. If the family has no money, Dad will spend less money on cigarettes.

The third one is to give you a fixed amount at the beginning. If you want to buy other things when you run out, you have to sell some of your own things. But if the parents send the child to buy things frequently, they will add more money. If you find that the child basically doesn't make money by selling things, you can give him less money.

Page call in algorithm

At first, the cache/memory is free. You can judge at the beginning to call in some pages/cache data blocks that may be used later, that is Pre paging policy 。

You can also not call in at the beginning, wait for missing pages/cache misses, and then call in. I.e Request transfer in policy 。

Mapping relationship

I had a hard time learning this.

The problem it solves is that when a data block is to be transferred to the cache, it cannot have a free location, so it can sit down directly rule For example, if the array block is the third member of a group in memory, it must sit in the third group of cache. Even if the first group has free seats, you can't go there.

This mapping rule also determines that when the cache is full, it can only be replaced Specify a member of the group If the specified group has multiple members, continue to select replacement according to the page replacement algorithm.

The rules are as follows ：

There are M rows in the cache. All rows are grouped into N groups.

The main memory should also be grouped, and the number of members in each group is N.

A data block in main memory corresponds to a group in cache 。

If the cache is only divided into one group, it is directly connected.

The cache is divided into M groups (N=M), that is, all connected.

Otherwise, it is group connected.

Why do we have such a complex mapping relationship?

How simple is my memory allocation. The given set of physical blocks is free, and you have to specify that I can only enter specific groups after going in directly to finish the work. Is there something wrong with you?

The reason is because The page table is in memory, and the memory space is large. The TLB is stored in the cache, and people have little money. The smaller your TLB block table, the better 。

What is the relationship between TLB table size and mapping?

The structure of a TLB table is similar to that of a page table, that is, the page number is changed into data block content. That is, the structure of each line can be simple as follows:

Data block number+data block content

The data block number marks which data block of the memory. (Special note: the size of the data block is not necessarily the same as the size of the physical block, so the exact name of the block number here should be Marker bit )

The mapping relationship can help us reduce the length of this tag bit, because the mapping relationship itself can provide some information.

For example, there are 16 rows of cache, divided into 8 groups with 2 rows each. The main memory is divided into 32 groups with 8 members in each group.

It can be seen that the main memory originally has 32 × 8=2 ^ 8, that is, the data block number needs 8 bits. The third block of main memory must enter the third group (3% 8=3)

In this way, the block number can be divided into two parts: the remaining part (5 digits) and the group number (3 digits)

among The group number can be seen directly from the cache location of the data block, so it is unnecessary to store it in the tag bit (This group number information is provided by the mapping relationship.). Therefore, the tag bit only needs to store the remaining part. The marker bit length is reduced from 8 bits to 5 bits.

It can also be seen that, The more groups, the smaller the marker bit (Fully connected mapping has the least number of markers). But the problem is Conflict prone 。 For example, if the main memory block numbers are 3 and 11, the same group will be preempted.

Page replacement algorithm

Whether it is fixed allocation or dynamic, when all physical blocks are occupied and new pages must be brought in, some people must be eliminated.

When we study this part, the textbook talks about page replacement, that is, what happens in memory allocation, but actually cache allocation also uses this algorithm (a group has multiple members). If the topic is examined carefully, it will be more comprehensive.

There are optimal replacement, FIFO, LRU (least recently used algorithm), and Clock algorithms. The specific details will be expanded separately later.

Data consistency

This is not a problem specific to cache allocation. Memory allocation only defaults to Postscript method

Because cache is a copy of some memory blocks. The data in these two places must be consistent.

So when the data in the cache is modified (write operation), what should we do in the memory?

Misses
- Read to the cache first, and modify the data in the cache
- Modify the data in memory first, and then read it to the cache
Hit:
- Cache and memory are modified simultaneously（ Complete writing )
- Only the cache data is modified. When the data block is replaced, the data in the memory is written back（ Write back )

It can be seen that the second and third methods are to keep the cache and memory data consistent at all times. But first and fourth can increase speed.

summary

Memory allocation is a fixed allocation+local replacement+direct connection mapping+page replacement+write back data consistency algorithm

Cache allocation is dynamic allocation+global replacement+multiple mapping relationships+page replacement for members of the specified mapping group+multiple data consistency algorithms

If you can understand the above process, you should have a similar understanding of memory allocation and cache allocation.

file management

Hard link and soft link

These two concepts are really forgotten. I am helpless.

Soft links, such as creating a shortcut on Windows. When the original program is uninstalled, the shortcut remains, but when you open it, you will be prompted that "the application cannot be found". The content of the directory entry of the soft link file is only the path name of the target file, and the directory entry will be identified as "link".
Hard link: The directory contents of the hard link file and the target file are the same inode (index allocation) or the same FCB (implicit link allocation). When deleting a file, the reference count of the file will decrease by 1. Only the file

Physical allocation of files on disk

Directory entry, inode node, FCB

There are many operating system terms, and they all look very powerful.

The inode node and FCB are both ways to describe directory entries. The directory entry is the information describing the file (including Name and location on disk )。

The inode node is the FCB that removes the file name information , other information is combined to create a new name.

Continuous distribution

The files are divided into equal sized disk blocks one by one, and then stored on the disk one by one. FCB records the actual block number and size.

Explicit link assignment

In addition to the file content, the disk block is divided into a small area as the pointer area to point to the next disk block. FCB only needs to record the starting block number.

Implicit link assignment

The pointer information is placed in a FAT table. A table is a one-dimensional structure. The row number of the table represents the physical block number, and the content of the row of the table represents the physical block number of the next link.

FCB only needs to record the starting physical block number, and then query the FAT table.

Index Allocation

Each file establishes an index table, which is stored on disk. The contents of the index table are all the physical blocks of the file on the disk.

The FCB records the physical block number of the index table.

IO system

Program interrupt

When the CPU reads the disk data, the disk output speed is slow, and the CPU read speed is fast. The disk has a data register, which can only store one byte.

When the disk outputs, the CPU is not busy waiting, but when the data register is full, it is called the CPU (initiate an interrupt request).

DMA

On the principle of program interruption, add DMA controller. At the beginning, the CPU tells the DMA the number of bytes to read. Every time the data register of the disk is full, DMA request is directly initiated Transport data to memory (This process needs to compete with CPU for bus cycle). Until the specified number of bytes has been read, the DMA controller initiates an interrupt request to let the CPU handle the subsequent events.

Reference article

https://zhuanlan.zhihu.com/p/23755202

https://blog.csdn.net/qq_34127958/article/details/71079942 ↩

Last modification: March 25, 2019

Do you like my article?
Don't forget to praise or appreciate, let me know that you accompany me on the way of creation.

Comment Cancel Reply
Use cookie technology to keep your personal information for your next quick comment. Continuing to comment means that you have agreed to the terms

comment *

Private comments

name *

🎲

mailbox *

address

Unknown migrant workers
P0 The blog of the accident is mainly dedicated to heaven
Xinc
The same BUPT, now just a sophomore. Just finished reading the blogger (or should be called a senior, ha
a period of peace and prosperity
The blogger mistyped, "My sister got married during the Dragon Boat Festival", is it "My sister is in the Dragon Boat Festival
Unknown migrant workers
The big guy howls!
Third Uncle
Wow, you are so young!!!!