The MASM Forum Archive 2004 to 2012

General Forums => The Workshop => Topic started by: jj2007 on June 01, 2011, 11:21:20 AM

Title: Memory-mapped files: Reading beyond the border
Post by: jj2007 on June 01, 2011, 11:21:20 AM
mov pMap, rv(MapViewOfFile, eax, FILE_MAP_READ, 0, 0, 10000h) returns a pointer to mapped memory.
This loop...
mov esi, pMap
add esi, 10000h-10
xor ecx, ecx
.Repeat
movzx eax, byte ptr [esi]
inc esi
inc ecx
.Until ecx>20

... goes bang when esi reaches the end of the allocated memory. Olly can handle this, but my code can't. So a simple len(esi) would cause an access violation if esi happens to be near the border. How would you handle this problem? SEH?
Title: Re: Memory-mapped files: Reading beyond the border
Post by: dedndave on June 01, 2011, 11:40:16 AM
that's how C programmers would do it   :P
they have something called "try" - lol

isn't there a way to get the size of the file ?
another approach - extend the file beyond the data size a little bit
Title: Re: Memory-mapped files: Reading beyond the border
Post by: clive on June 01, 2011, 12:34:02 PM
Well the lazy route is to use an SEH to catch the page fault. But the real solution is to recognize you have a limit, and handle the spanning explicitly. You need to have a len(esi) function that is aware of where the end is, and act accordingly.

(http://trus.imageg.net/graphics/product_images/pTRU1-5754458_alternate2_reg.jpg)
Title: Re: Memory-mapped files: Reading beyond the border
Post by: drizz on June 01, 2011, 02:01:06 PM
I see that some lessons are still not learned!

http://www.masm32.com/board/index.php?topic=1807.msg81031#msg81031 -- actively participated
http://www.masm32.com/board/index.php?topic=14353.0 -- participated
http://www.masm32.com/board/index.php?topic=10925.msg80337#msg80337 -- you are the thread starter

back to the drawing board.
Title: Re: Memory-mapped files: Reading beyond the border
Post by: jj2007 on June 01, 2011, 02:33:41 PM
Quote from: drizz on June 01, 2011, 02:01:06 PM
I see that some lessons are still not learned!

http://www.masm32.com/board/index.php?topic=1807.msg81031#msg81031 -- actively participated
http://www.masm32.com/board/index.php?topic=14353.0 -- participated
http://www.masm32.com/board/index.php?topic=10925.msg80337#msg80337 -- you are the thread starter

back to the drawing board.

drizz,

I am aware of my own threads, don't worry. I just had hoped that somebody had a more elegant solution than SEH, e.g. an API that makes the page behind the border writeable, and allows to poke a nullbyte at the border ::)
Title: Re: Memory-mapped files: Reading beyond the border
Post by: qWord on June 01, 2011, 02:46:07 PM
Quote from: jj2007 on June 01, 2011, 02:33:41 PMI just had hoped that somebody had a more elegant solution than SEH, e.g. an API that makes the page behind the border writeable, and allows to poke a nullbyte at the border ::)
you know VirtualProtect (http://msdn.microsoft.com/en-us/library/aa366898(v=vs.85).aspx)(+VirtualAlloc)?
Title: Re: Memory-mapped files: Reading beyond the border
Post by: drizz on June 01, 2011, 02:49:20 PM
QuoteSo a simple len(esi) would cause an access violation if esi happens to be near the border
Is null byte inside the mapped region?
Title: Re: Memory-mapped files: Reading beyond the border
Post by: jj2007 on June 01, 2011, 03:31:48 PM
Quote from: drizz on June 01, 2011, 02:49:20 PM
QuoteSo a simple len(esi) would cause an access violation if esi happens to be near the border
Is null byte inside the mapped region?

Yeah, you are right. I'll have to redesign the whole thing. I have a 2.x Gigabyte ASCII file from which I want to extract floats - and it needs to be fast as you can imagine. So I thought of streaming it in with a 64k window; but the more I think about it, the less attractive more stupid seems the idea to test for end of streaming window using a nullbyte... so I must rewrite the parser, which I wanted to avoid.

Thanks anyway :thumbu
Title: Re: Memory-mapped files: Reading beyond the border
Post by: clive on June 01, 2011, 11:30:52 PM
Quote from: jj2007
I just had hoped that somebody had a more elegant solution than SEH, e.g. an API that makes the page behind the border writeable, and allows to poke a nullbyte at the border ::)
You map 0x11000 from the file, and then advance the offset into the file 0x10000 at a time.

Each time your pointer into the mapping exceeds 0x10000 you move the mapping forward 0x10000, and dereference the pointer back by 0x10000, so the new reference falls in the 0x000-0xFFF range in the front of new mapping.

This will work for strings or packets, or whatever that might span, but don't exceed 4KB.

Adjust the mapping size, and overlap according to your needs.
Title: Re: Memory-mapped files: Reading beyond the border
Post by: jj2007 on June 02, 2011, 07:00:56 AM
Quote from: clive on June 01, 2011, 11:30:52 PMYou map 0x11000 from the file, and then advance the offset into the file 0x10000 at a time.

Thanks, Clive - good idea :U

P.S.: Memory mapped file EOF (http://social.msdn.microsoft.com/Forums/en-US/vcgeneral/thread/a9d5009f-5bdc-4711-920e-0e5bc1292c81) - another tricky question :wink
Title: Re: Memory-mapped files: Reading beyond the border
Post by: hutch-- on June 02, 2011, 07:30:05 AM
JJ,

You are safer using the file system file length rather than an EOF character. I have recently written a scanner for files where you use the normal CreateFile() ReadFile() as this handles the read length easily but it will depend on the type of data you are reading, text or binary data. If I am processing the data myself I tend to read it in larger chunks but depending on the data type you may need a strategy for handling target data that falls across the boundary that you set for chunks.
Title: Re: Memory-mapped files: Reading beyond the border
Post by: jj2007 on June 02, 2011, 08:12:22 AM
Quote from: hutch-- on June 02, 2011, 07:30:05 AM
JJ,

You are safer using the file system file length rather than an EOF character.

Hutch,

I haven't seen an EOF character since the days of DOS, and I do not plan to rely on one, don't worry :bg
Title: Re: Memory-mapped files: Reading beyond the border
Post by: FORTRANS on June 02, 2011, 12:42:31 PM
Quote from: jj2007 on June 02, 2011, 08:12:22 AM
I haven't seen an EOF character since the days of DOS, and I do not plan to rely on one, don't worry :bg

Hi,

   And it is not really needed in DOS for anything except for
CP/M compatibility.  Its main use seemed to be truncating
binary files randomly when using the COPY command.

Cheers,

Steve N.
Title: Re: Memory-mapped files: Reading beyond the border
Post by: Rockoon on June 02, 2011, 11:46:02 PM
The EOF character actually comes from the pre-DOS days of CP/M, because that OS used a file system that only stored the number of sectors allocated to the file (it had no concept of byte length)
Title: Re: Memory-mapped files: Reading beyond the border
Post by: FlySky on July 31, 2011, 02:45:09 PM
Hey guys,

I am playing with the MapViewOfFile API and somehow it's not doing what I want.

I have an 50 MB file which I map into the Memory using:

      Invoke CreateFile, [FilePath], GENERIC_READ, FILE_SHARE_READ, NULL, OPEN_EXISTING, FILE_ATTRIBUTE_NORMAL, 0
     Cmp Eax, INVALID_HANDLE_VALUE
     Jne > FileMapping
     Invoke MessageBox, [hWnd], 'File not created succesfully', 'File not created succesfully', MB_OK
FileMapping:
     Mov [hFile], Eax
     Invoke CreateFileMapping, [hFile], 0, PAGE_READONLY, 0, 0, 0
   Cmp Eax, NULL
   Jne > MapFile
   Invoke MessageBox, [hWnd], 'CreateFileMapping failed', 'CreateFileMapping failed', MB_OK
   Invoke CloseHandle, [hFile]
MapFile:
     Mov [hMap], Eax
;     Invoke GetFileSize, [hFile], NULL
;     Mov [FileSize], Eax
     Invoke MapViewOfFile, [hMap], FILE_MAP_READ, 0, 0, 0
     Cmp Eax, NULL
     Jne > StartCheckingPE
     Invoke MessageBox, [hWnd], 'MapViewOfFile failed', 'MapViewOfFile failed', MB_OK
     Invoke CloseHandle, [hMap]

Here is the thing:

The MapViewOfFile returns a pointer to the mapped view for example: 05A70000

When looking at that view I can see that at offset 3C is the offset to the PEHeader. That seems to go perfectly fine.

The thing is it somehow is not mapping the file complete / right.

For example:

The file on the disk has:

At offset: 1D86049

02186049 9C                                            pushf
0218604A 50                                            push    eax
0218604B 51                                            push    ecx
0218604C 52                                            push    edx
0218604D 53                                            push    ebx
0218604E 54                                            push    esp

When I looked at the Mapped view:

I get at the same offset: 1D86049 so at address 5A70000 + 1D86049 = 77F6049

MZ_:077F6049 db  61h ; a
MZ_:077F604A db 0D6h ; Í
MZ_:077F604B db  70h ; p
MZ_:077F604C db  3Dh ; =
MZ_:077F604D db 0DFh ; ¯
MZ_:077F604E db  2Bh ; +
MZ_:077F604F db  59h ; Y

See the bytes pattern does not match. Now when I search the pattern: 9C 50 51 52 53 54

MZ_:06DEE249 db  9Ch ; £
MZ_:06DEE24A db  50h ; P
MZ_:06DEE24B db  51h ; Q
MZ_:06DEE24C db  52h ; R
MZ_:06DEE24D db  53h ; S
MZ_:06DEE24E db  54h ; T
MZ_:06DEE24F db  55h ; U
MZ_:06DEE250 db  56h ; V
MZ_:06DEE251 db  57h ; W

Now when I substract: 06DEE249 - 5A70000 = 137E249

This is an completely different offset. Is there any limitation on the MapViewOfFile API? I don't understand why it doesn't map the file right.

Title: Re: Memory-mapped files: Reading beyond the border
Post by: qWord on July 31, 2011, 03:06:03 PM
Quote from: FlySky on July 31, 2011, 02:45:09 PMThe file on the disk has:

At offset: 1D86049

02186049 9C                                            pushf
0218604A 50                                            push    eax
0218604B 51                                            push    ecx
0218604C 52                                            push    edx
0218604D 53                                            push    ebx
0218604E 54                                            push    esp
You got this information from a hex editor or from a debugger/disassembler?
Title: Re: Memory-mapped files: Reading beyond the border
Post by: FlySky on July 31, 2011, 03:58:02 PM
Got it with IDA Pro loading the file from my disk.
Title: Re: Memory-mapped files: Reading beyond the border
Post by: qWord on July 31, 2011, 04:23:30 PM
The relative offsets of a loaded PE are not equal to the file offsets. If you want to read or write data in a PE's section, you must parse the file header: Microsoft PE and COFF Specification (http://msdn.microsoft.com/en-us/windows/hardware/gg463119)
Title: Re: Memory-mapped files: Reading beyond the border
Post by: dedndave on July 31, 2011, 08:49:57 PM
here it is in PDF format
http://www.masm32.com/board/index.php?topic=13135.0