Memory Mapped Files

When first seeing such usage, then many developers may think that it is pretty silly, because they expect an implementation like:

A memory mapped file API could work that way, but on all common platforms it works completely differently in a way that makes much more sense.

Memory mapped files are tied closedly to the virtual memory systems that all main stream OS has today and the associated paging mechanism.

while an OS *with* virtual memory may run multiple applications conceptually like:

But what happens when there is not enough physical memory (RAM) for all programs?

Well in that case the OS use disk as backing storage. Heap data and stack data can be in pagefile instead of RAM. Code already exist on disk.

Today most computers has so much RAM that actual paging of data is rare. But the mechanism still exist in the OS.

So with a memory mapped file then typical only the actual used parts of the file will be in memory. And that works well with big files.

Demo:

Memory mapped files make most sense for big files, so we will demo with a 1 GB file.

The demo will test the time to count the number of lines. It is obviously not a realistic example, but it illustrates the technique.

For memory mapped file the test code will read all bytes sequentially using a a random access API.

Code:

File generation:

Read all lines into memory:

package mapfile;

import java.io.IOException;
import java.nio.file.Files;
import java.nio.file.Paths;
import java.util.List;

public class Read1 {
    private static void test(String fnm) throws IOException {
        System.out.printf("%s %s / %s %s\n", System.getProperty("java.vendor"), System.getProperty("java.version"), System.getProperty("os.name"), System.getProperty("os.version"));
        long t1 = System.currentTimeMillis();
        List<String> lines = Files.readAllLines(Paths.get(fnm));
        System.out.printf("%d lines\n", lines.size());
        long t2 = System.currentTimeMillis();
        System.out.printf("Read all lines : %d ms\n", t2 - t1);
    }
    public static void main(String[] args) throws IOException {
        for(int i = 0; i < 3; i++) {
            test(args[0]);
        }
    }
}

using System;
using System.IO;

namespace Read1
{
    public class Program
    {
        private static void Test(string fnm)
        {
            Console.WriteLine("{0} / {1}", Environment.Version, Environment.OSVersion);
            DateTime dt1 = DateTime.Now;
            string[] lines = File.ReadAllLines(fnm);
            Console.WriteLine("{0} lines", lines.Length);
            DateTime dt2 = DateTime.Now;
            Console.WriteLine("Read all lines : {0} ms", (dt2 - dt1).Ticks / TimeSpan.TicksPerMillisecond);
        }
        public static void Main(string[] args)
        {
            for(int i = 0; i < 3; i++)
            {
                Test(args[0]);
            }
        }
    }
}

import mmap
import time
import sys
import platform

def test(fnm):
    print('%s / %s' % (sys.version, platform.platform()))
    with open(fnm, 'r') as f:
        t1 = time.time()
        lines = f.readlines()
        print('%d lines' % (len(lines)))
        t2 = time.time()
        print('Read all lines : %d ms' % ((t2 - t1) * 1000.0))

for i in range(3):
    test(sys.argv[1])

Read one line at a time into memory:

package mapfile;

import java.io.BufferedReader;
import java.io.FileReader;
import java.io.IOException;

public class Read2 {
    private static void test(String fnm) throws IOException {
        System.out.printf("%s %s / %s %s\n", System.getProperty("java.vendor"), System.getProperty("java.version"), System.getProperty("os.name"), System.getProperty("os.version"));
        long t1 = System.currentTimeMillis();
        try(BufferedReader br = new BufferedReader(new FileReader(fnm))) {
            int n = 0;
            @SuppressWarnings("unused")
            String line;
            while((line = br.readLine()) != null) {
                n++;
            }
            System.out.printf("%d lines\n", n);
        }
        long t2 = System.currentTimeMillis();
        System.out.printf("Read one line at a time : %d ms\n", t2 - t1);
    }
    public static void main(String[] args) throws IOException {
        for(int i = 0; i < 3; i++) {
            test(args[0]);
        }
    }
}

#include <stdio.h>

#include "high_res_timer.h"

static void test(const char *fnm)
{
    FILE *fp;
    char line[1000];
    TIMECOUNT_T t1, t2;
    int n;
    t1 = GET_TIMECOUNT;
    fp = fopen(fnm, "r");
    n = 0;
    while(fgets(line, sizeof(line), fp))
    {
        n++;
    }
    printf("%d lines\n", n);
    fclose(fp);
    t2 = GET_TIMECOUNT;
    printf("Read one line at a time : %d ms\n", (int)((t2 - t1) * 1000 / UNITS_PER_SECOND));
}

int main(int argc, char *argv[])
{
    int i;
    for(i = 0; i < 3; i++)
    {
        test(argv[1]);
    }
    return 0;
}

program Read2(input, output);

{$APPTYPE CONSOLE}

uses
  SysUtils, Windows;

procedure test(fnm : string);

var
  f : text;
  line : string;
  t1, t2 : Cardinal;
  n : integer;

begin
  t1 := GetTickCount;
  assign(f, fnm);
  reset(f);
  n := 0;
  while not eof(f) do begin
    readln(f, line);
    n := n + 1;
  end;
  writeln(n:1,' lines');
  close(f);
  t2 := GetTickCount;
  writeln('Read one line at a time : ', (t2 - t1):1, ' ms');
end;

var
  i : integer;

begin
  for i := 1 to 3 do begin
    test(paramStr(1));
  end;
end.

[inherit('sys$library:pascal$lib_routines')]
program Read2(input, output);

type
    string = varying[32767] of char;

procedure test(fnm : string);

var
  f : text;
  line : string;
  t1, t2 : Cardinal;
  n : integer;

begin
  t1 := Clock;
  open(f, fnm, old);
  reset(f);
  n := 0;
  while not eof(f) do begin
    readln(f, line);
    n := n + 1;
  end;
  writeln(n:1,' lines');
  close(f);
  t2 := Clock;
  writeln('Read one line at a time : ', (t2 - t1):1, ' ms');
end;

var
  i : integer;
  cmdlin : string;

begin
  lib$get_foreign(resultant_string:=cmdlin.body,resultant_length:=cmdlin.length);
  for i := 1 to 3 do begin
    test(cmdlin);
  end;
end.

using System;
using System.IO;

namespace Read2
{
    public class Program
    {
        private static void Test(string fnm)
        {
            Console.WriteLine("{0} / {1}", Environment.Version, Environment.OSVersion);
            DateTime dt1 = DateTime.Now;
            using(StreamReader sr = new StreamReader(fnm))
            {
                int n = 0;
                string line;
                while((line = sr.ReadLine()) != null)
                {
                    n++;    
                }
                Console.WriteLine("{0} lines", n);
            }
            DateTime dt2 = DateTime.Now;
            Console.WriteLine("Read one line at a time : {0} ms", (dt2 - dt1).Ticks / TimeSpan.TicksPerMillisecond);
        }
        public static void Main(string[] args)
        {
            for(int i = 0; i < 3; i++)
            {
                Test(args[0]);
            }
        }
    }
}

import mmap
import time
import sys
import platform

def test(fnm):
    print('%s / %s' % (sys.version, platform.platform()))
    with open(fnm, 'r') as f:
        t1 = time.time()
        n = 0
        for line in f:
            n = n + 1
        print('%d lines' % (n))
        t2 = time.time()
        print('Read one line at a time : %d ms' % ((t2 - t1) * 1000.0))

for i in range(3):
    test(sys.argv[1])

Using memory mapped file:

Different languages/technologies have different API's available for memory mapped files.

*) Note that today the *nix API is widely available. It can be used on Windows using Cygwin. It can be used on VMS as well.

Pascal also use platform API's. Delphi use the Win32 API. VMS Pascal us ethe VMS API.

package mapfile;

import java.io.IOException;
import java.nio.ByteBuffer;
import java.nio.channels.FileChannel;
import java.nio.channels.FileChannel.MapMode;
import java.nio.file.Paths;

public class Read3 {
    private static final int OFFSET = 0;
    private static final int SIZE = 1000000000;
    private static void test(String fnm) throws IOException {
        System.out.printf("%s %s / %s %s\n", System.getProperty("java.vendor"), System.getProperty("java.version"), System.getProperty("os.name"), System.getProperty("os.version"));
        long t1 = System.currentTimeMillis();
        try(FileChannel fc = FileChannel.open(Paths.get(fnm))) {
            ByteBuffer bb = fc.map(MapMode.READ_ONLY, OFFSET, SIZE);
            bb.rewind();
            int n = 0;
            for(int ix = 0; ix < SIZE; ix++) {
                byte b = bb.get(ix);
                if(b == '\n') {
                    n++;
                }
            }
            System.out.printf("%d lines\n", n);
        }
        long t2 = System.currentTimeMillis();
        System.out.printf("Map file to memory : %d ms\n", t2 - t1);
    }
    public static void main(String[] args) throws IOException {
        for(int i = 0; i < 3; i++) {
            test(args[0]);
        }
    }
}

#include <stdio.h>
#include <stdlib.h>

#include <windows.h>

#include "high_res_timer.h"

#define OFFSET 0
#define SIZE 1000000000

static void test(const char *fnm)
{
    HANDLE f;
    HANDLE mf;
    char *b;
    TIMECOUNT_T t1, t2;
    int n, ix;
    t1 = GET_TIMECOUNT;
    f = CreateFile(fnm, GENERIC_READ, 0, NULL, OPEN_EXISTING, FILE_ATTRIBUTE_NORMAL, NULL);
    if(f == INVALID_HANDLE_VALUE)
    {
        printf("Error opening file\n");
        exit(1);
    }
    mf = CreateFileMapping(f, NULL, PAGE_READONLY, 0, SIZE, NULL);
    if(mf == NULL)
    {
        printf("Error mapping file\n");
        exit(1);
    }
    b = MapViewOfFile(mf, FILE_MAP_READ, 0, OFFSET, SIZE);
    n = 0;
    for(ix = 0; ix < SIZE; ix++)
    {
        if(b[ix] == '\n')
        {
            n++;
        }
    }
    printf("%d lines\n", n);
    CloseHandle(mf);
    CloseHandle(f);
    t2 = GET_TIMECOUNT;
    printf("Map file to memory : %d ms\n", (int)((t2 - t1) * 1000 / UNITS_PER_SECOND));
}

int main(int argc, char *argv[])
{
    int i;
    for(i = 0; i < 3; i++)
    {
        test(argv[1]);
    }
    return 0;
}

#include <stdio.h>
#include <stdlib.h>
#include <unistd.h>
#include <fcntl.h>
#include <sys/mman.h>

#include "high_res_timer.h"

#define OFFSET 0
#define SIZE 1000000000

static void test(const char *fnm)
{
    int fd;
    char *b;
    TIMECOUNT_T t1, t2;
    int n, ix;
    t1 = GET_TIMECOUNT;
    fd = open(fnm, O_RDONLY);
    if(fd == -1)
    {
        printf("Error opening file\n");
        exit(1);
    }
    b = mmap(NULL, SIZE, PROT_READ, MAP_PRIVATE, fd, OFFSET);
    if(b == MAP_FAILED)
    {
        printf("Error mapping file\n");
        exit(1);
    }
    n = 0;
    for(ix = 0; ix < SIZE; ix++)
    {
        if(b[ix] == '\n')
        {
            n++;
        }
    }
    printf("%d lines\n", n);
    munmap(b, SIZE);
    close(fd);
    t2 = GET_TIMECOUNT;
    printf("Map file to memory : %d ms\n", (int)((t2 - t1) * 1000 / UNITS_PER_SECOND));
}

int main(int argc, char *argv[])
{
    int i;
    for(i = 0; i < 3; i++)
    {
        test(argv[1]);
    }
    return 0;
}

#include <stdio.h>
#include <string.h>
#include <stdlib.h>

#include <fabdef.h>
#include <secdef.h>
#include <starlet.h>

#define PAGE_SIZE 8192
#define PAGELET_SIZE 512
#define DISKBLOCK_SIZE 512

#define BY2U(n, unit_size) (n - 1) / unit_size + 1

#include "high_res_timer.h"

#define OFFSET 0
#define SIZE 1000000000

static void test(char *fnm)
{
    struct FAB myfab;
    void *range[2];
    unsigned short chan;
    long int stat;
    char *b;
    TIMECOUNT_T t1, t2;
    int n, ix;
    t1 = GET_TIMECOUNT;
    myfab = cc$rms_fab;
    myfab.fab$l_fna = fnm;
    myfab.fab$b_fns = strlen(fnm);
    myfab.fab$l_fop = FAB$M_UFO;
    myfab.fab$b_fac = FAB$M_GET;
    stat = sys$open(&myfab, 0, 0);
    if((stat & 1) == 0)
    {
        printf("Error opening file\n");
        exit(1);
    }
    range[0] = 0;
    range[1] = 0;
    chan = myfab.fab$l_stv;
    stat = sys$crmpsc(range, range, 0, SEC$M_EXPREG, 0, 0, 0, chan, BY2U(SIZE, PAGELET_SIZE), BY2U(OFFSET, DISKBLOCK_SIZE), 0, 0);
    if((stat & 1) == 0) 
    {
        printf("Error mapping file\n");
        exit(1);
    }
    b = range[0];
    n = 0;
    for(ix = 0; ix < SIZE; ix++)
    {
        if(b[ix] == '\n')
        {
            n++;
        }
    }
    printf("%d lines\n", n);
    sys$deltva(range, 0, 0);
    sys$close(&myfab, 0, 0);
    t2 = GET_TIMECOUNT;
    printf("Map file to memory : %d ms\n", (int)((t2 - t1) * 1000 / UNITS_PER_SECOND));
}

int main(int argc, char *argv[])
{
    int i;
    for(i = 0; i < 3; i++)
    {
        test(argv[1]);
    }
    return 0;
}

program Read3;

{$APPTYPE CONSOLE}

uses
  SysUtils, Windows;

procedure test(fnm : string);

const
  OFFSET = 0;
  SIZE = 1000000000;

type
  data = packed array [1..SIZE] of char;

var
  f : THandle;
  mf : THandle;
  b : ^data;
  t1, t2 : Cardinal;
  n, ix : integer;

begin
  t1 := GetTickCount;
  f := CreateFile(PChar(fnm), GENERIC_READ, 0, nil, OPEN_EXISTING, FILE_ATTRIBUTE_NORMAL, 0);
  if f = INVALID_HANDLE_VALUE then begin
    writeln('Error opening file');
    halt;
  end;
  mf := CreateFileMapping (f, nil, Page_ReadOnly, 0, SIZE, nil);
  if mf = 0 then begin
    writeln('Error mapping file');
    halt;
  end;
  b := MapViewOfFile(mf, FILE_MAP_READ, 0, OFFSET, SIZE);
  n := 0;
  for ix := 1 to SIZE do begin
    if b^[ix] = chr(10) then begin
      n := n + 1;
    end;
  end;
  writeln(n:1,' lines');
  CloseHandle(mf);
  CloseHandle(f);
  t2 := GetTickCount;
  writeln('Read one line at a time : ', (t2 - t1):1, ' ms');
end;

var
  i : integer;

begin
  for i := 1 to 3 do begin
    test(paramStr(1));
  end;
end.

[inherit('sys$library:pascal$lib_routines', 'sys$library:starlet')]
program Read3(input, output);

type
    string = varying[32767] of char;
    word = [word] 0..65535;

procedure test(fnm : string);

const
  PAGE_SIZE = 8192;
  PAGELET_SIZE = 512;
  DISKBLOCK_SIZE = 512;
  OFFSET = 0;
  SIZE = 1000000000;

function BY2U(n : integer; unit_size : integer) : integer;

begin
  BY2U := (n - 1) div unit_size + 1;
end;

type
  data = packed array [1..SIZE] of char;

var
  myfab : fab$type;
  range : array [1..2] of pointer;
  chan : word;
  stat : integer;
  b : ^data;
  t1, t2 : Cardinal;
  n, ix : integer;

begin
  t1 := Clock;
  myfab.fab$b_bid := FAB$C_BID;
  myfab.fab$b_bln := FAB$C_BLN;
  myfab.fab$l_fna := iaddress(fnm.body);
  myfab.fab$b_fns := fnm.length;
  myfab.fab$l_fop := FAB$M_UFO;
  myfab.fab$b_fac := FAB$M_GET;
  myfab.fab$w_ifi := 0;
  myfab.fab$b_shr := 0;
  myfab.fab$l_nam := 0;
  stat := $open(fab := myfab);
  if not odd(stat) then begin
    writeln('Error opening file');
    halt;
  end;
  range[1] := nil;
  range[2] := nil;
  chan := myfab.fab$l_stv;
  stat := $crmpsc(inadr := range, retadr := range, flags := SEC$M_EXPREG, chan := chan, pagcnt := BY2U(SIZE, PAGELET_SIZE), vbn := BY2U(OFFSET, DISKBLOCK_SIZE));
  if not odd(stat) then begin
    writeln('Error mapping file');
    halt;
  end;
  b := range[1];
  n := 0;
  for ix := 1 to SIZE do begin
    if b^[ix] = chr(10) then begin
      n := n + 1;
    end;
  end;
  writeln(n:1,' lines');
  $deltva(inadr := range);
  $close(fab := myfab);
  t2 := Clock;
  writeln('Read one line at a time : ', (t2 - t1):1, ' ms');
end;

var
  i : integer;
  cmdlin : string;

begin
  lib$get_foreign(resultant_string:=cmdlin.body,resultant_length:=cmdlin.length);
  for i := 1 to 3 do begin
    test(cmdlin);
  end;
end.

using System;
using System.IO;
using System.IO.MemoryMappedFiles;

namespace Read3
{
    public class Program
    {
        private const int OFFSET = 0;
        private const int SIZE = 1000000000;
        private static void Test(string fnm)
        {
            Console.WriteLine("{0} / {1}", Environment.Version, Environment.OSVersion);
            DateTime dt1 = DateTime.Now;
            using (MemoryMappedFile mmf = MemoryMappedFile.CreateFromFile(fnm, FileMode.Open))
            {
                using(MemoryMappedViewAccessor mmva = mmf.CreateViewAccessor(OFFSET, SIZE, MemoryMappedFileAccess.Read))
                {
                    int n = 0;
                    for(int ix = 0; ix < SIZE; ix++)
                    {
                        byte b = mmva.ReadByte(ix);
                        if(b == '\n')
                        {
                            n++;
                        }
                    }
                    Console.WriteLine("{0} lines", n);
                }
            }
            DateTime dt2 = DateTime.Now;
            Console.WriteLine("Map file to memory : {0} ms", (dt2 - dt1).Ticks / TimeSpan.TicksPerMillisecond);
        }
        public static void Main(string[] args)
        {
            for(int i = 0; i < 3; i++)
            {
                Test(args[0]);
            }
        }
    }
}

import mmap
import time
import sys
import platform

SIZE = 1000000000

def test(fnm):
    print('%s / %s' % (sys.version, platform.platform()))
    with open(fnm, 'r+b') as f:
        t1 = time.time()
        mm = mmap.mmap(f.fileno(), SIZE, access=mmap.ACCESS_READ)
        n = 0
        for ix in range(SIZE):
            mm.seek(ix)
            b = mm.read_byte()
            if b == ord('\n') or b == '\n': # little trick - the first works with Python 3.x - the second works with Python 2.x
                n = n + 1
        print('%d lines' % (n))
        t2 = time.time()
        print('Map file to memory : %d ms' % ((t2 - t1) * 1000.0))

for i in range(3):
    test(sys.argv[1])

Results:

*) On VMS Itanium the file is only 100 MB not 1 GB to compensate for old HW, old SW and traditional slow IO on VMS. The VMS file is in Stream LF record format - not Variable record format.

We see that the benfits of memory mapped files does not vary much for the OS tested (but it may for other OS!).

We see that the benfits of memory mapped files does not vary much for disk type (SSD vs HDD). This is somewhat unexpected, but could be due to the fact that even though the file is big (1 GB) then it can still be cached in memory.

Conclusion:

If one need to process big files and performance is important then memory mapped files are woth considering.

But decision will depend on language and platform. Doing some research is a must before going with memory mapped files.

Also note that even though memory mapped files may be an efficient technique, then it is also a very low level technique exposing the physical file format to the application. Example: for a text file it will be visible to the application code whether the platform use CRLF like Windows, LF like *nix or a more complex format. That should also be considered before going with memory mapped files.

Database:

The demo above was accessing the file in readonly mode. That may be the most common use case, but it is also possible to access the file as readwrite.

When having a readwrite memory mapped file, then it can be used as a no effort database. All changes to memory data structures get persisted to the file by the OS without any update/save code in the application.

The example will be using C, because more high level languages typical already have features to make very easy persistence (ORM, NoSQL databases exposing a collection API etc.).

The example will be using the *nix API as it is more portable than any other C API.

Starting code:

Obviously data_add_item should check for overflow, but that is besides what we are trying to show.

Traditional approach:

Now comes a new requirement: persist the data so they are kept between runs. And without writing the entire file every time (not a problem for this example as data is only 6 KB, but if data was 6 TB, then it would be a problem).

It works fine, but it is a very intrusive change as persisting code has to be inserted everywhere in data updating code.

Using memory mapped file:

No changes at all to the data updating code at all. The above uses the same functions as the in-memory-only example. The code just update memory and the OS do the persisting transparently.

IPC:

If a readwrite memory mapped file is accessed in shared mode, then it can be used for communication between processes. And due to persistence then the processes is not required to be running at the same time.

Note that this may require some external synchronization to work reliable in many cases.

Language	OS	Disk type	Read all lines	Read one line at a time	Map file to memory
Java 17	Windows 10	SSD	5868	3507	636
Java 17	Windows 10	HDD	6475	3447	653
MSVC++ 19 - Win32 API	Windows 10	SSD	N/A	6243	743
MSVC++ 19 - Win32 API	Windows 10	HDD	N/A	6433	761
GCC 11 - *nix API	Windows 10 + Cygwin	SSD	N/A	7739	610
GCC 11 - *nix API	Windows 10 + Cygwin	HDD	N/A	8009	582
FPC 3 - Win32 API	Windows 10	SSD	N/A	32672	1125
FPC 3 - Win32 API	Windows 10	HDD	N/A	31484	1172
.NET 4	Windows 10	SSD	OutOfMemoryException	3750	69019
.NET 4	Windows 10	HDD	OutOfMemoryException	3633	69706
.NET 7	Windows 10	SSD	14111	3329	17510
.NET 7	Windows 10	HDD	13304	3017	16845
CPython 3.11	Windows 10	SSD	10559	12275	152126
CPython 3.11	Windows 10	HDD	10621	11486	149158
CPython 2.7	Windows 10	SSD	7371	6157	233756
CPython 2.7	Windows 10	HDD	7485	5996	234865
PyPy 3.10	Windows 10	SSD	40431	8442	2573
PyPy 3.10	Windows 10	HDD	35070	8206	2710
PyPy 2.7	Windows 10	SSD	21898	118418	4710
PyPy 2.7	Windows 10	HDD	23175	117784	4973
Java 17	Ubuntu / Linux kernel 5	SSD	OutOfMemoryError	3282	401
GCC 11 *nix API	Ubuntu / Linux kernel 5	SSD	N/A	2407	188
.NET 7	Ubuntu / Linux kernel 5	SSD	15550	2917	17315
CPython 3.10	Ubuntu / Linux kernel 5	SSD	8882	6484	160869
CPython 2.7	Ubuntu / Linux kernel 5	SSD	4332	3878	Killed
GraalPython 3.10 / Java 17	Ubuntu / Linux kernel 5	SSD	Killed	8756	55210
Java 8	VMS 8.4 Itanium (*)	HDD	Manually aborted after 8 minutes	1514	559
C - VMS API	VMS 8.4 Itanium (*)	HDD	N/A	2927	358
C - *nix API	VMS 8.4 Itanium (*)	HDD	N/A	2927	454
VMS Pascal - VMS API	VMS 8.4 Itanium (*)	HDD	N/A	31040	510
Python 3.10	VMS 8.4 Itanium (*)	HDD	8319	17141	345261

Language	Memory Mapped Files
C other native languages (Pascal etc.) Java	Good
newer .NET PyPy	OK
older .NET Python except PyPy	Bad

Memory Mapped Files

Content:

Introduction:

Theory:

Demo:

Code:

File generation:

Read all lines into memory:

Read one line at a time into memory:

Using memory mapped file:

Results:

Conclusion:

Database:

Starting code:

Traditional approach:

Using memory mapped file:

IPC:

Article history:

Other articles:

Comments: