NetMon Notes

1. NetMon +
1.1. Kernel Drivers: Visual Studio SetUp
1.3. Firewalls
1.4. Forensics
➪ 1.5. NetMon Notes +
1.5.1. BCDedit

2014-12-29 22:01:27 chip Page 1151 📢 PUBLIC

20150331 - Chip - Mar 31 2015

I was using WinDbg to trace through the highly experimental deferment code when WinDebug764 crashed on boot. This began a three hour tour to restore and update Windows.

20150330 - Chip - Mar 30 2015

I started working on the code to allow the user to interactively allow/deny network connections. This requires deferring operations until the user has a chance to decide. It also meant reviving the WinDebug764 VM so I could step through this code using WinDbg.

20150322 - Chip - Mar 22 2015

NetMon: Process Information

I am now able to retrieve the process name from NetMon.sys, which solves one more problem. Once I had it running I noticed a connection to South Africa by my least favorite process SvcHost.exe. I need to have the ThreadID to know which client is responsible.

OK, that took about 30 minutes. I now have the ThreadID. With this extra bit of information I was able to trace the connection to South Africa back to SvcHost.exe > Dnscache Service > dnsrslvr.dll. I can assume this was a DNS lookup that was being referred to a nameserver in South Africa.

20150320 - Chip - Mar 20 2015

I extracted the core code from GeoDB into its own library project, GeoLib. I then dropped GeoDB.cpp from the Control project and replaced the calls with the GeoLib functions. There was a bit of clean up, and I added a new function "GeoDB::Lookup4_Precheck()" that looks for predefined IP4 ranges (ie 192.168.X.X) and shortcuts the lookup. After about 2 hours, NetMon was running again with improved GeoDB info.

The performance is now very stable and not much (a bit) drag when running FireFox or IE. I now feel comfortable running NetMon in the background on my primary development system (VS12).

The stream capture seems to be stable as well. I captured ~300KB while browsing the Amazon site and I was able to step through all the HTTP GET requests. The stream viewer needs a lot of UI work to become usable and useful.

I found that the HTTPS streams are encoded; I had thought I was capturing the stream outside the encryption. I will need to do some research on this. The relevant docs are found in the VS13 help at
Windows Driver Device and Driver Technologies Network Devices Windows Filtering Platform Callout Drivers Developing IPSec-Compatible Callout Drivers

I bumped the VerProd version to 2.2, indicating a new stability milestone.

****

While writing up my notes during the review of the Driver\Log.cpp code, I realized that the while() loop could be replaced by a set of if-then-else statements, making infinite loops impossible. This made me rethink all the circular FIFO code I have written over the years. I wrote a non-loop version of BufRead():

Driver/Log.cpp: int Log::BufWrite(LOGHDR *pSrc, UINT EntrySz) { int Err= ERR_OK; UINT ChunkSz; BYTE *pSrc1= (BYTE*)pSrc; //First make sure EntrySz will fit into available space. ChunkSz= (BufWr<BufRd) ? BufRd-BufWr : (BufSz-BufWr)+BufRd; if(EntrySz>=ChunkSz) { Err= ERR_NO_ROOM; //A partial item will only cause problems. } else { pSrc->Signature= SIGNATURE_LOGENTRY; pSrc->ByteCt= (UINT16)EntrySz; pSrc->SeqID= ++SeqID; KeQuerySystemTime((LARGE_INTEGER*)&pSrc->Time); //I am now certain EntrySz will fit without overwriting valid data. if(BufWr<BufRd) { //Case 2: Empty space is a single contiguous block memcpy(&pBuf[BufWr],pSrc,EntrySz); if((BufWr+=EntrySz)>=BufSz) BufWr= 0; } else { //Case 1: Empty space is two blocks split by end-of-buffer. ChunkSz= min(EntrySz,BufSz-BufWr); memcpy(&pBuf[BufWr],pSrc1,ChunkSz); //Second chunk may not be necessary. if(ChunkSz<EntrySz) memcpy(&pBuf[0],&pSrc1[ChunkSz],EntrySz-ChunkSz); if((BufWr+=EntrySz)>=BufSz) BufWr-= BufSz; } } BufCt= (BufRd<BufWr) ? (BufWr-BufRd) : (BufWr + (BufSz-BufRd)); return(Err); }

BufRead() can also be rewritten using two if-then-else blocks, one for the header and the second for the body. The performance improvement is nonzero but miniscule, its advantage is rendering infinite loops logically impossible.

I can also drop BufCt completely, relying strictly on BufRd/BufWr. I would replace BufCt with an inline function to calculate it from BufRd and BufWr. BufCt was always redundant.

20150317 - Chip - Mar 17 2015

I noticed that my GeoDB data is over a year old and opened up the GeoDB project to rebuild it. I found that I had neglected to commit the source database to SCM and I never wrote up any notes on the process of obtaining the source data and how to convert it. In addition, maxmind.com has moved to a new data format. So I am working on GeoDB to bring it back to life.

The notes on GeoDB are here: GeoDB

20150316 - Chip - Mar 16 2015

It seems it is the easy things that take the longest...

I wanted to be able to control the sort order in the event list, which looked easy -- just add a call to ConnectSetSort() in response to a double-click message. My first mistake was using the column index directly and the log was sorted on the wrong column. That was an easy fix. I thought it would be a 15 minute task until I noticed that the items started falling out of order after a while. I stared at the sorting code for hours, tried different variations, rewrote it a couple times, added verbose logging, error checking, and debug dumps.

After about six hours, I finally started looking upstream and eventually found the bug in code before the re-sort is called. I was giving the re-sort routine a bad index to the updated item IIF the item was a file access that was newly created.

The event list is now sorting properly.

The driver has been rock-solid for over two days now. I left it installed for autoload, so it is loading up every time VS12 boots. I can now run VS as my normal user and connect to the driver every time, even if the control app crashes. The driver has not caused any problems, even when the hooks are in place without the control app running. In this case, the log buffer is continuously full and cycling.

===*=*===

20150315 - Chip - Mar 15 2015

NetMon can now run with normal user credentials!

This is big news, because popping up the UAC admin dialog is always jarring. Being able to run normally dramatically lowers the annoyance factor for daily use.

I accomplished this by adding a couple options the control what happens during shutdown, then checking for an existing driver before trying to install on startup. I do not need to be an admin to open a handle to a driver that is already loaded.

The new options are:
--fastexit: Exit immediately
--uninstall: Completely uninstall the driver on exit.
--reinstall: Completely reinstall the driver on start.
--nounload: Leave the driver in place on exit.
--autoload: Configure driver to load on boot.
--unhook: Unhook file and net hooks on exit.

To install the driver so that it will always be available for use by normal users, run once as admin:
> NetMonUI --fastexit --reinstall --autoload --nounload --unhook --driver \path\to\NetMon.sys
This configuration loads the NetMon driver during every boot, but it remains inactive (no hooks) until NetMonUI runs. I can now run NetMonUI as a normal user anytime.

To install the driver so that it will only be loaded on demand, run once as admin:
> NetMonUI --fastexit --reinstall --driver \path\to\NetMon.sys
To start the driver on demand, I need to run as admin:
> NetMonUI --fastexit --nounload --unhook
Now I can run NetMon as a normal user during the current session:
> NetMonUI
This configuration does not load the NetMon driver at boot, the admin must manually start the driver after each boot.

To uninstall the driver so that it never loads, run as admin:
> NetMonUI --fastexit --uninstall

Crash: IpHelper::SendQueryV() crashed on exit. I am assuming the thread was closed before getaddrinfo() had returned. I need to make sure there are no outstanding address queries when closing.

Links:
OSR IFS FAQ

=============

20150314 - Chip - Mar 14 2015

NetMon has been very stable for the past few days, the Log:BufRead bug seems to have been the guilty party.

I had a new BlueScreen pop up a few minutes after closing NetMonUI:
DRIVER_UNLOADED_WITHOUT_CANCELLING_PENDING_OPERATIONS

I was unable to force WinDbg to load the NetMon symbols, so I have no more information than:
NetMon+2948

The most aggrevating thing about WinDbg is that it seems to be a never-ending headache to load the proper symbols, and it spends so freaking much time loading Microsoft symbols that are usually of little or no interest.

I was able to trace the code, from this slim clue, back to DrvIrpPassThrough(). There must have been an orphaned IRP and NetMon was blamed.

20150313 - Chip - Mar 13 2015

Woot!

I finally found the IRQL_NOT_LESS_THAN BlueScreen bug! It naturally had nothing to do with IRQL. This error code can apparently also be caused by a bad pointer reference, (perhaps) if the memory is valid but paged out. The actual cause was a logic bug in how I was reading log entries when the entry header was split by a buffer wrap-around. Once I rewrote Log:BufRead() I was able to run NetMon with both Net and File logging active while I browsed MSN.com using Internet Explorer 11 (a stress test for anything!).

NetMon is now safe to demo!

I found the bug by loading a crash dump from C:\Windows\MiniDump\ into WinDbg, setting the sympath to the directory containing NetMon.pdb, and reloading NetMon.sys. This then pointed the fickle finger of blame directly at the memcpy() in Log:BufRead(). It only took about five minutes for me to then discover the bug, and another half hour to rewrite the function. This time checking every possible error condition.

This is the original function. The bug is that I assume the LOGHDR will be coherent. But in reality, the buffer wrap-around can happen anywhere, including inside the LOGHDR. This opened the door for a buffer overrun to occur when I try to read pEntry->ByteCt, resulting in a bogus EntrySz and a possible memcpy() beyond the end of pBuf[].

Log.cpp: Broken version int Log::BufRead(LOGHDR *pDst, UINT DstSz) { int Err= ERR_OK; UINT8 *pDst1= (UINT8*)pDst; LOGHDR *pEntry= (LOGHDR*)&pBuf[BufRd]; UINT EntrySz,ChunkSz; DebugA(DBG_FILE,FUNCA "Wr=%d Rd=%d Sz=%d",BufWr,BufRd,BufSz); if(BufCt>=sizeof(LOGHDR)) { Err= EntrySz= pEntry->ByteCt; if(DstSz < EntrySz) { Err= ERR_TOO_SMALL; } else { while(EntrySz) { ChunkSz= (BufRd < BufWr) ? (BufWr-BufRd) : BufSz-BufRd; ChunkSz= min(EntrySz,ChunkSz); if(!ChunkSz || ChunkSz>BufCt) { //NOTE: This should NEVER happen. Err= ERR_CORRUPT; break; } DebugA(DBG_FILE,FUNCA "%X <= pBuf[%d]:%d",pDst1,BufRd,ChunkSz); memcpy(pDst1,&pBuf[BufRd],ChunkSz); pDst1+= ChunkSz; EntrySz-= ChunkSz; BufCt-= ChunkSz; BufRd+= ChunkSz; if(BufRd>=BufSz) BufRd-= BufSz; } } } return(Err); }

Once again, the "Rule of Fast Computing: If it happens once in a million times, it happens every second."

Here is the fixed version. Now I treat the header no differently than the body and only use ByteCt after I have read the entire LOGHDR.

Log.cpp: int Log::BufRead(LOGHDR *pDst, UINT DstSz) { int Err= ERR_OK; UINT8 *pDst1= (UINT8*)pDst; //LOGHDR *pEntry= (LOGHDR*)&pBuf[BufRd]; //Very bad: BlueScreen when pEntry is split by buffer wrap. UINT WriteCt=0; UINT AvailCt,ReadCt,ChunkSz; memset(pDst,0,sizeof(*pDst)); DebugA(DBG_FILE,FUNCA "Wr=%d Rd=%d Sz=%d",BufWr,BufRd,BufSz); if(BufCt < sizeof(LOGHDR)) { //No data available. } else if(DstSz < sizeof(LOGHDR)) { Err= WarnA(ERR_TOO_SMALL,"Log:BufRead: DstSz[%d] is too small[%d].",DstSz,sizeof(LOGHDR)); } else { //NOTE: pBuf wrap-around can happen anywhere, including inside LOGHDR. // pDst->ByteCt is not valid until WriteCt>=sizeof(LOGHDR) while(!IsErr(Err) && (WriteCt < sizeof(LOGHDR) || WriteCt<pDst->ByteCt)) { //AvailCt is bytes availabe in pBuf[] from BufRd to either BufWr or BufSz. AvailCt= (BufRd < BufWr) ? (BufWr-BufRd) : BufSz-BufRd; //ReadCt is the number of bytes remaining to be read in the entry. //If I have not read the entire LOGHDR yet, ReadCt is LOGHDR-WriteCt //Otherwise, ReadCt is EntrySz-WriteCt ReadCt= WriteCt<sizeof(LOGHDR) ? sizeof(LOGHDR)-WriteCt : pDst->ByteCt-WriteCt; //ChunkSz the number of bytes to copy on this loop. //It is limited by both ReadCt and BufAvailCt. ChunkSz= min(ReadCt,AvailCt); //Check for errors before trying to copy memory, or risk bluescreen. if(ChunkSz==0) { //NOTE: Should never happen, but would be a kernelmode inifinite loop. Err= ErrorA(ERR_CORRUPT,"Log:BufRead: ChunkSz=0!"); } else if(ChunkSz > BufCt) { //NOTE: Should never happen, but means an accounting error. Err= ErrorA(ERR_CORRUPT,"Log:BufRead: ChunkSz[%d] > BufCt[%d]",ChunkSz,BufCt); } else if(BufRd+ChunkSz > BufSz) { //NOTE: Should never happen, but would be buffer read overrun, possible bluescreen. Err= ErrorA(ERR_CORRUPT,"Log:BufRead: Buf overrun, BufRd[%d]+ChunkSz[%d] > BufSz[%d]",BufRd,ChunkSz,BufSz); } else { //Make sure I don't overrun the destination buffer. //I think it is better to truncate the destination buffer and complete the read. //Otherwise, I would need to unwind the read and risk clogging the read pipeline. //This means the reader needs to be aware that pDst->ByteCt may be greater than //DstSz, indicating that he has received only a partial entry. The entry has been lost. if(WriteCt < DstSz) { if(WriteCt+ChunkSz < DstSz) memcpy(&pDst1[WriteCt],&pBuf[BufRd],ChunkSz); else memcpy(&pDst1[WriteCt],&pBuf[BufRd],DstSz-WriteCt); } //Always complete the read. WriteCt+= ChunkSz; BufCt-= ChunkSz; BufRd+= ChunkSz; if(BufRd>=BufSz) BufRd= 0; } } } //If error, pDst is invalid if(IsErr(Err)) { pDst->ByteCt= 0; //Errors (other than destination too small) will be persistent, //so I may as well consider the buffer to be lost. if(Err!=ERR_TOO_SMALL) BufReset(); } return(Err); }

Interesting note: While NetMon was running in Visual Studio, devenv.exe was scanning through every source file on my system -- including the Linux projects, projects that did not involve Visual Studio in any way. I'm sure Redmond this this is a convenience, but I think it is illicit snooping.

20150312 - Chip - Mar 12 2015

The NetMon project has been gathering dust for over a year...

I spent some time creating an improved scrolling text control. The standard edit control repaints the window after every update, which brings the system to a crawl when NetMon is spewing out a line of text for every file access on the system. The new version only repaints once every 500ms or in response to user input (scrollbar, text selection). This dramatically improves system performance while NetMon is running. The code is in OsUser/TextCtrl.cpp

I am now displaying the process name, which makes the event display much more useful.

This link describes how to extract the command line from a remote process.

20140111 - Chip - Jan 11 2014

I started working on the control application.

Added the ability to sort list items.
Since I am focusing on debugging the user mode app and the driver seems to be stable now, I am debugging locally in VS12. The requires building and loading NetMon.sys locally also.
I had to import the signing certificate into the ChipAdmin user.
Driver would not load.
WinDbg reported the driver loading, but NetMonUI was unable to open the device file.
This was because DrvEntry defaults to setting NO irp function handlers. I changed this so that if there is no registry setting the mask will default to 0xFFFFFFFF instead.

20140108 - Chip - Jan 08 2014

Added volume drive letter to the reported file names. Still missing the full path for relative files.

I noticed that svchost.exe was opening connections to Zhengzhou China 1.199.19.184. Hmmm...

20140107 - Chip - Jan 07 2014

I am still struggling with the "IE goes away" bug. Part of the problem is that I do not have a Known Good Reference to start from, and the IFS filter is pretty much "all or nothing" -- it is difficult to disable any part without disabling the whole thing.

I rebuilt the CodeProjects.com FsFilter project, hoping it could serve as my KGR. I based my IfsFilter on this project, so the code is very similar. I had to create a VS13 project to build it, but that went quickly since I had created my step-by-step notes. I was able to build, install, and run FsFilter in under an hour.

Good news: FsFilter does not seem to suffer from the same problem. I am able to install FsFilter, then launch IE without any problems. (I did need to decimate the debug output, printing only 1 of every 256 files opened. Otherwise WinDbg slowed the target down to an unbearably slow pace.) So now I have a KGR that works and I can start a careful comparison of FsFilter and NetMon to find the critical difference.

=====

That was quick! I modified FsFilter by zeroing out MajorFunction[IRP_MJ_CLEANUP], and FsFilter immediately demonstrated the same "IE goes away" problem.

SUCCESS! All I had to do was set the IrpMask registry setting to 0xFFFFFFFF and now IE works while NetMon is running! I was worried that the "Crash on exit" bug would reappear, but that must have been cured by one of the other fixes made in the past few days.

The NetMon driver is now feature complete! I can now focus on the NetMonUI, which is much more straight-forward and can be debugged using VS13. I will need to set up User-Mode remote debugging next.

=8^)

20140106 - Chip - Jan 06 2014

Ran into an interesting crash. IRP_MJ_READ where pIrp->AssociatedIrp.SystemBuffer was zero even though pStack->Parameters.Read.Length was 0x1000. The MSDN docs imply this would never happen if Device.Flags has DO_BUFFERED_IO set.

Ahhh... but I forgot to set DO_BUFFERED_IO when creating the Stream device. My bad. Easy catch with WinDbg! :) Hopefully this will fix the Stream code, which always was a bit flaky.

Another crash. This time it blew up in the TCP driver. There are some hints that might point to mutex usage.

I caught a BugCheck in WinDbg. It appears LogWrite() is sometimes being called at IRQL DISPATCH_LEVEL, which is an illegal condition for calling KeWaitForMutexObject() with a timeout. I believe that if I am running at DISPATCH_LEVEL or above, I can assume I am not interruptible (except by hardware events, which have nothing to do with me). Also, I know that all mutexes are cleared before a kernel thread is rescheduled. So... I am now checking KeGetCurrentIrql() and only calling KeWaitForMutexObject() if less than DISPATCH_LEVEL.

=====

I tried several alternatives and variations on my mutex code. Eventually I created a version that bypassed everything if IRQL was >= DISPATCH_LEVEL (2). This version did not crash, but it became immediately apparent that my driver was being called a lot at DISPATCH_LEVEL. It is illegal to do any scheduling or waiting at dispatch level. So I abandoned the mutex and replaced them with spin locks. And everything seems to work. I don't know whether to laugh or cry. Live and learn.

The browser (both IE and FF) is still hosed when I enable the file filter, but hopefully I have finally put a stake through the heart of the Stream and Log problems. (Fingers crossed...)

=====

Today's links:

CodeProject.com Kernel Mutexes
OSROnline.com: Use spinlocks
OSROnline.com: FastIoDispatch
AntiGhost: Filter driver project

20140105 - Chip - Jan 05 2014

After I shut everything down last night, I was doing a little bedtime reading on my Kindle when I came across this cylus.org blog post. It talks about setting up a VM-to-VM WinDbg environment. I made a mental note to investigate in the morning.

This morning I awoke early and tried it out. I had to make a couple changes, but it worked! I now have a completely self-contained development environment in VS12 which is able to debug device drivers running in DbgWin764! Yippee!

Within moments of debugging, NetMon popped into WinDbg with a page fault. Normally, this would be a BSoD with the faulting instruction deep inside NT.dll. But since I had WinDbg with full and up-to-date symbols, I could look at the stack and within seconds uncovered a very nasty problem. Can you see it?

I had assumed that UNICODE_STRING.Buffer[] would be a valid null-terminated string. It is not! I must use and rely on UNICODE_STRING.Length, as Buffer[] is not required to have a Zbyte. In this case, the end of the string ran off into unpaged memory. This caused StrLenW() to fault.

The good news: WinDbg paid for itself within moments. With a debugger, this was a trivial bug to find. Without a debugger, it could have had me scratching my head about random crashes for a very long time.

The bad news: I am embarrassed that I made such a fundamentally wrong assumption about UNICODE_STRING.

Qualified Yippee! I am now able to use the integrated WinDbg in VS2013 to debug NetMon. However, this version of WinDbg runs much, much slower than the command-line version. The target system takes many times longer to boot. Also, the output from DbgPrint() is not being drawn to the debugger immediate window properly so I cannot see the debug output from NetMon. On the other hand, it sure is nice to pop straight to the source code!

=====

Now that I am able to use WinDbg from within the VS12 VM, I can abandon the staging arena on the Host system. I no longer need to make the source code visible to the Host. All I need from the Host is a place to exchange the binaries between VS12 and the debug target.

I made a lot of progress over these two days, fixed several bugs, and put myself in a position to be much more efficient in the future. I have not seen a BSoD since this morning. The last(?) remaining show stopper is the mysterious "IE goes away" bug.

=====

Go 49ers!!

With the game tied and 3 seconds left the 49ers kick the winning field goal, which passes through the arms of the defender. The kick was only milliseconds away from being blocked.

=====

Today's Links:

OSR Forum
Nynaeve.net
Cylus.org

20140104 - Chip - Jan 04 2014

Yesterday was a day of frustration with WinDbg. With renewed vigor I took another stab at it this morning...

A glimmer of success! I was able to connect using WinDbg from the command line on the host (not using VS12). The problem seemed to be a mismatch between the "debugport" entry in the boot manager. First, I used ComPorts inside the target VM to identify the "physical" COM port available to the guest OS. ComPorts reported a single port, COM1. I then used bcdedit from an administrator DOS prompt to set the following entries to match:

X: C:> bcdedit /set {current} debug yes C:> bcdedit /set {current} debugtype serial C:> bcdedit /set {current} debugport 1 C:> bcdedit /set {current} baudrate 115200 C:> bcdedit

In the target VM settings, there should be a single Serial Port device:

Connected: YES
Connect at power on: YES
Use named pipe: \.pipecom_1
This end is the server
The other end is an application
Yield CPU: Yes.

In the Host system (not in a VM), find WinDbg in the WDK and launch it:

X: C:Program Files (x86)Windows Kits8.1Debuggersx64> > windbg -b -k com:pipe,port=\.pipecom_1,resets=0,reconnect

This will sit in the "Waiting to reconnect..." state. Reboot the target, making sure to select the correct boot configuration. If the debug port is correct, the target will pause for about ten seconds after the screen goes black and "Starting Windows" is displayed but before the Windows flag appears. This is the one and only chance WinDbg has to connect to the target. WinDbg will sometimes close itself (crash) at this point, and I will have only a couple of seconds to restart WinDbg to catch the target connection. If the flag appears, I have missed the connection. If all the stars align WinDbg will spew some text, probably complain that the symbols are not correct, and the target will freeze until I release it with the "g" command from WinDbg.

NOTE: VMware seems to move the COM ports around when there are multiple VM's running around. After getting this to work with debugport=1, ComPorts later reported only COM2 available -- while WinDbg was connected to COM1. This may be why it was so incredibly difficult to get the settings right -- I may be chasing a moving target for the configuration. If I am having trouble connecting to the target, the first thing to check is that the COM port reported by ComPorts is the same as the debugport entry in BCD. UPDATE: This may be a red herring; it is not clear whether debugport is actually used.

I tried again to run WinDbg from within the VS12 VM, but it never connected. It appears VM-to-VM connections do not work.

=====

The next task is to load up the proper symbols for NetMon.sys and source code debugging.

WinDbg needs to run on the host machine and it needs access to the symbols and source files for the project. I don't want to create a complete development environment on the Host, I want to use a single, portable environment in VS12. I can't share folders out of a VM without creating a network share, which involves the domain controller (which may not always be accessible).

My solution is to share the source code directory on the host into VS12 and run Visual Studio in the VS12 VM, loading the project from Z:NetMonNetMon.sln, which is physically located on the Host system. I compile everything inside VS12, run WinDbg from the Host, and run NetMon from the target VM.

Host directory: C:SrcnlitedHQDevSBChipWin8NetMon
This directory is shared into VS12 as "NetMon".
VS12 directory: Z:NetMon

I copy the binaries to the DbgWin764 VM to the C:NetMon directory and run it from there.

When I break into WinDbg, I can load the NetMon symbols:

X: kd> .sympath C:\Src\nlited\HQ\Dev\SB\Chip\Win8\NetMon\Out\Winx64Debug kd> !reload NetMon.sys

SUCCESS! I now have source code for NetMon.sys! Within five minutes after loading the symbols, I found two printf() bugs.

=====

The log corruption is definitely due to re-entrant calls to to Log. I re-enabled the mutex code, reread the WDK docs, and found a mistake in the parameters to KeWaitForMutex(). Once I corrected this, the log corruption errors went away and the overall performance improved dramatically. NetMon is now much more stable than it was this morning.

=====

I still see a problem where Internet Explorer hangs after some time. NetMon is still running and there are no error messages. Once IE goes away it never comes back, even after closing and restarting the program. It seems there is a deadlock in the Services Host process.

I did see "NetMon|ERR ERR[-3]: Filter::CbClassify: EXCEPTION!". NetMon continued to operate without any apparent problems, so the exception handler did its job. Of course, now I have no idea what caused the exception. Now that I have WinDbg running, it is better to crash than to handle the exception.

=====

I need to log the FastIO operations. Access to files that are cached is not being recorded.

=====

Today was a very productive day!

20140103 - Chip - Jan 03 2014

Once again attempting to use WinDbg on Win764...

Visual Studio 2013 is already installed, but not the WDK. I copied the WDK environment, which allowed me to rebuild the drivers. However, the "Driver" menu item is not present nor the kernel transport debugger.

I am installing the WDK from ColdStore W:DiskDevMSDNVS2013_WDKwdksetup.exe

The WDK 8.1 installed seemed to go smoothly, but installed to the stupid path:
C:Program Files (x86)Windows Kits8.1

DbgWin764 appeared in the "Attach to process" window, but never connected.

I eventually tried to "reprovision" the target, but it stalled at this point:

Then it failed:

After several attempts, the deployment completed without errors, but I was still stuck at "Waiting to reconnect..."

I tried to do a deployment to Cree, a (relatively) virgin target. I logged in as the local Administrator, installed the target setup from ColdStore W:DisksDevMSDNVS2013_WDKInstallersWDK Test Target Setup x64-x64_en-us.msi, then provisioned from VS12.

The setup fails on the "Configuring debuggers (x64)" step. I can watch the target reboot during the installation, so I know network communication is happening. I think the problem is that the x64 debugger is not starting up on the target.

I need to disable the firewall between the Installing .NET and Installing VC redist steps, where it prints "Attempting to connect..."

I am now trying to deploy to a Win8/32bit target...

The installation seemed clean, but the same end result: unable to connect.

20140102 - Chip - Jan 02 2014

I found the cause of the "Still calling IFS filter after unloading" crash. I was decrementing FilterCt twice during the removal process. NetMon now unloads cleanly after installing the IFS filters.

I disabled the WFP filters and IE loaded fine. So the problem is on the WFP side, not on the IFS side. IFS logging seems to be working fine now.

The following sequence works:

Start IE
Close IE
Start NetMon
Start IE
Browse

There is an eventual crash, but it seems that the "IE stalls" problem may possibly be related to IE starting up for the first time. Possibly creating new devices? I am not sure that the previous statement about WFP/IFS responsibility is true, the positive result may have been due to starting IE first and unrelated to disabling the WFP filter.

The combination that seems to work:
Start and close IE first, set registry entries to disable logging and the IFS filter, start NetMon, start IE, browse.

I tried starting NetMon first, then IE, and it worked fine. Logging and IFS are still disabled. I tried with logging enabled, and it again worked fine. So I am right back to where I was before I adding the IFS filter.

I tried with nothing disabled, and everything ran OK for about five minutes, then crashed. The blue screen usually reports the crash in the vicinity of NetMon+A5E6.

This tends to point the fickle finger of blame at the IFS filter.

20131222 - Chip - Dec 22 2013

I found the cause of the "Crash on Exit" bug: using the default handler for IRP_MJ_CLEANUP. If I leave pDriver->MajorFunction[IRP_MJ_CLEANUP] unset (0), everything works fine.

Netmon now mostly works, capturing Network and File access.

Network monitor is working.
File monitor is working.
Missing the volume letter
Still see occasional bad filenames
Some programs seem to be unable to open files
File Explorer, double-click image under doc/pictures/samples. Microsoft Picture Viewer is unable to open the file.
Log.cpp starts complaining about buffer corruption under load.
Internet Explorer stalls while trying to open msn.com
NetMonUI is displaying files using the same format as network.
ProcID is missing
Previous shutdown crash has been fixed.
NetMon does not shutdown completely.
All the hooks seem to be removed OK.
Handles are being left open somewhere.
Unable to delete NetMon.sys or NetMonUI.exe after closing.

20131217 - Chip - Dec 17 2013

Trying to compile NetMon on Hopi. Configuration problems again. The WDK8.1 does not include the kernel mode libraries and headers. I have a WDK80 that I think I copied from ChipPro.

Success! After some WDK fun (see below) and installing the nlited driver signing certificate, I rebuilt NetMon and ran it on DbgWin764. Capturing network and file accesses.

=====

For some reason, I am seeing this error:

X: 1>------ Build started: Project: Driver, Configuration: Release x64 ------ 1>Build started 12/17/2013 1:41:51 PM. 1>ClCompile: 1> Filter.cpp 1>D:\CL\DDK\Windows\80\Include\km\ndis.h(1413): error C2011: '_NDIS_REQUEST_TYPE' : 'enum' type redefinition 1> D:\CL\DDK\Windows\80\Include\shared\ntdd\ndis.h(334) : see declaration of '_NDIS_REQUEST_TYPE' 1> Filter.cpp: Optimizer disabled. 1> ndis.h:1411 #if NDIS_SUPPORT_NDIS6 typedef enum _NDIS_REQUEST_TYPE {

There is some weird discrepency going on here. Hopindis.h is 13950 lines, while ChipProndis.h is 14185 lines. Both are advertised as WDK 8.0 but one works and the other doesn't. This conspires with Microsoft completely fucking up the integration of the DDK into Visual Studio. It is just a bunch of include files and libraries, it doesn't need to be complicated at all!!! Instead, there are hidden registry settings, directories scattered in esoteric locations (Who was the idiot who came up with "C:Program Files (x86)Windows Kits8.1" for fuck's sake!), and all sorts of hidden incompatibilities between versions. The WDK refuses to install or uninstall on Hopi for some unknown reason.

My only recourse is to zip up the WDK directory from a Known Good Reference machine, unzip it on Hopi, and use that. In fact, the WDK should be distributed as a zipfile -- no installer at all!

After unzipping WDK80_Combined.zip to D:CLDDKWindows80 , (almost) everything built. I need to import my nlited signing certificate.

Hopi: 1>------ Build started: Project: Driver, Configuration: Release x64 ------ 1>Build started 12/17/2013 2:09:32 PM. 1>ClCompile: 1> Filter.cpp 1> Note: including file: s:srcnlitedhqdevsbchipwin8netmondriverNtBuild.h 1> Note: including file: D:CLDDKWindows80IncludekmNtDDK.h 1> Note: including file: D:CLDDKWindows80Includekmwdm.h 1> Note: including file: D:CLVS12VCincludeexcpt.h 1> Note: including file: D:CLVS12VCincludecrtdefs.h 1> Note: including file: D:CLDDKWindows80Includesharedsal.h 1> Note: including file: D:CLDDKWindows80IncludesharedConcurrencySal.h 1> Note: including file: D:CLVS12VCincludevadefs.h 1> Note: including file: D:CLDDKWindows80Includesharedntdef.h 1> Note: including file: D:CLVS12VCincludectype.h 1> Note: including file: D:CLVS12VCincludecrtdefs.h 1> Note: including file: D:CLDDKWindows80Includesharedwinapifamily.h 1> Note: including file: D:CLDDKWindows80Includesharedspecstrings.h 1> Note: including file: D:CLDDKWindows80Includesharedspecstrings_strict.h 1> Note: including file: D:CLDDKWindows80Includesharedspecstrings_undef.h 1> Note: including file: D:CLDDKWindows80Includeshareddriverspecs.h 1> Note: including file: d:clddkwindows80includesharedsdv_driverspecs.h 1> Note: including file: D:CLDDKWindows80Includesharedkernelspecs.h 1> Note: including file: D:CLDDKWindows80Includesharedbasetsd.h 1> Note: including file: D:CLDDKWindows80Includesharedguiddef.h 1> Note: including file: D:CLVS12VCincludestring.h 1> Note: including file: D:CLVS12VCincludecrtdefs.h 1> Note: including file: D:CLDDKWindows80Includesharedsdkddkver.h 1> Note: including file: D:CLDDKWindows80Includesharedntstatus.h 1> Note: including file: D:CLDDKWindows80Includesharedbugcodes.h 1> Note: including file: D:CLDDKWindows80Includesharedntiologc.h 1> Note: including file: D:CLDDKWindows80Includekmmce.h 1> Note: including file: D:CLDDKWindows80Includesharedpshpack4.h 1> Note: including file: D:CLDDKWindows80Includesharedpoppack.h 1> Note: including file: D:CLDDKWindows80Includesharedpshpack4.h 1> Note: including file: D:CLDDKWindows80Includesharedpoppack.h 1> Note: including file: D:CLDDKWindows80Includesharedpshpack1.h 1> Note: including file: D:CLDDKWindows80Includesharedpoppack.h 1> Note: including file: D:CLDDKWindows80Includesharedguiddef.h 1> Note: including file: D:CLDDKWindows80Includeshareddpfilter.h 1> Note: including file: D:CLDDKWindows80Includesharedapiset.h 1> Note: including file: D:CLDDKWindows80Includesharedktmtypes.h 1> Note: including file: D:CLDDKWindows80Includesharedevntprov.h 1> Note: including file: D:CLDDKWindows80Includeshareddevpropdef.h 1> Note: including file: D:CLDDKWindows80Includesharedpshpack1.h 1> Note: including file: D:CLDDKWindows80Includesharedpoppack.h 1> Note: including file: D:CLDDKWindows80Includesharedpshpack1.h 1> Note: including file: D:CLDDKWindows80Includesharedpoppack.h 1> Note: including file: D:CLDDKWindows80Includesharedpshpack1.h 1> Note: including file: D:CLDDKWindows80Includesharedpoppack.h 1> Note: including file: D:CLDDKWindows80IncludesharedWinDef.h 1> Note: including file: D:CLDDKWindows80Includesharedminwindef.h 1> Note: including file: D:CLDDKWindows80Includekmndis.h 1> Note: including file: D:CLDDKWindows80Includesharednetevent.h 1> Note: including file: D:CLDDKWindows80Includesharedntddndis.h 1> Note: including file: D:CLDDKWindows80Includesharedifdef.h 1> Note: including file: D:CLDDKWindows80Includesharedipifcons.h 1> Note: including file: D:CLDDKWindows80Includeshareddevpkey.h 1> Note: including file: D:CLDDKWindows80Includeshareddevpropdef.h 1> Note: including file: D:CLDDKWindows80Includesharedpciprop.h 1> Note: including file: D:CLDDKWindows80Includesharedwindot11.h 1> Note: including file: D:CLDDKWindows80IncludesharedWlanTypes.h 1>D:CLDDKWindows80Includekmndis.h(1413): error C2011: '_NDIS_REQUEST_TYPE' : 'enum' type redefinition 1> D:CLDDKWindows80Includesharedntddndis.h(334) : see declaration of '_NDIS_REQUEST_TYPE' 1> Note: including file: D:CLDDKWindows80Includekmxfilter.h 1> Note: including file: D:CLDDKWindows80Includesharedqos.h 1> Note: including file: D:CLDDKWindows80IncludesharedFwpsk.h 1> Note: including file: d:clddkwindows80includesharedfwpstypes.h 1> Note: including file: d:clddkwindows80includesharedrpc.h 1> Note: including file: D:CLDDKWindows80Includesharedpshpack8.h 1> Note: including file: D:CLDDKWindows80Includesharedrpcdce.h 1> Note: including file: D:CLDDKWindows80Includesharedrpcdcep.h 1> Note: including file: D:CLDDKWindows80Includesharedrpcnterr.h 1> Note: including file: D:CLDDKWindows80Includesharedrpcasync.h 1> Note: including file: D:CLDDKWindows80Includesharedpshpack8.h 1> Note: including file: D:CLDDKWindows80Includesharedpoppack.h 1> Note: including file: D:CLDDKWindows80Includesharedpoppack.h 1> Note: including file: d:clddkwindows80includesharedrpcndr.h 1> Note: including file: D:CLDDKWindows80Includesharedpshpack8.h 1> Note: including file: D:CLDDKWindows80Includesharedrpcsal.h 1> Note: including file: D:CLDDKWindows80Includesharedpoppack.h 1> Note: including file: d:clddkwindows80includesharedfwptypes.h 1> Note: including file: d:clddkwindows80includesharedwtypes.h 1> Note: including file: d:clddkwindows80includesharedwtypesbase.h 1> Note: including file: d:clddkwindows80includesharedguiddef.h 1> Note: including file: d:clddkwindows80includesharedws2def.h 1> Note: including file: D:CLDDKWindows80Includesharedinaddr.h 1> Note: including file: D:CLDDKWindows80Includesharedws2ipdef.h 1> Note: including file: D:CLDDKWindows80Includesharedin6addr.h 1> Note: including file: D:CLDDKWindows80Includesharednetioapi.h 1> Note: including file: D:CLDDKWindows80Includesharednldef.h 1> Note: including file: d:clddkwindows80includesharedfwpvi.h 1> Note: including file: D:CLDDKWindows80IncludesharedFwpmk.h 1> Note: including file: d:clddkwindows80includesharedfwpmtypes.h 1> Note: including file: d:clddkwindows80includesharediketypes.h 1> Note: including file: d:clddkwindows80includesharedipsectypes.h 1> Note: including file: d:clddkwindows80includesharedfwpvi.h 1> Note: including file: S:SrcnlitedHQDevSBChipWin8NetMonIncludeStdTypes.h 1> Note: including file: S:SrcnlitedHQDevSBChipWin8NetMonIncludeErrors.h 1> Note: including file: S:SrcnlitedHQDevSBChipWin8NetMonIncludeVerID.h 1> Note: including file: S:SrcnlitedHQDevSBChipWin8NetMonIncludeOS.h 1> Note: including file: s:srcnlitedhqdevsbchipwin8netmonincludeHandles.h 1> Note: including file: S:SrcnlitedHQDevSBChipWin8NetMonIncludeStrUtil.h 1> Note: including file: S:SrcnlitedHQDevSBChipWin8NetMonIncludeNetMon.h 1> Note: including file: s:srcnlitedhqdevsbchipwin8netmondriverGlobals.h 1> Filter.cpp: Optimizer disabled. 1> 1>Build FAILED. 1> 1>Time Elapsed 00:00:00.27 ========== Build: 0 succeeded, 1 failed, 0 up-to-date, 0 skipped ==========

ChipPro: 1>------ Build started: Project: Driver, Configuration: Release x64 ------ 1> Filter.cpp 1> Note: including file: c:srcnlitedhqdevsbchipwin8netmondriverNtBuild.h 1> Note: including file: C:CLWinDDK80IncludekmNtDDK.h 1> Note: including file: C:CLWinDDK80Includekmwdm.h 1> Note: including file: C:CLVS12VCincludeexcpt.h 1> Note: including file: C:CLVS12VCincludecrtdefs.h 1> Note: including file: C:CLWinDDK80Includesharedsal.h 1> Note: including file: C:CLWinDDK80IncludesharedConcurrencySal.h 1> Note: including file: C:CLVS12VCincludevadefs.h 1> Note: including file: C:CLWinDDK80Includesharedntdef.h 1> Note: including file: C:CLVS12VCincludectype.h 1> Note: including file: C:CLVS12VCincludecrtdefs.h 1> Note: including file: C:CLWinDDK80Includesharedwinapifamily.h 1> Note: including file: C:CLWinDDK80Includesharedspecstrings.h 1> Note: including file: C:CLWinDDK80Includesharedspecstrings_strict.h 1> Note: including file: C:CLWinDDK80Includesharedspecstrings_undef.h 1> Note: including file: C:CLWinDDK80Includeshareddriverspecs.h 1> Note: including file: c:clwinddk80includesharedsdv_driverspecs.h 1> Note: including file: C:CLWinDDK80Includesharedkernelspecs.h 1> Note: including file: C:CLWinDDK80Includesharedbasetsd.h 1> Note: including file: C:CLWinDDK80Includesharedguiddef.h 1> Note: including file: C:CLVS12VCincludestring.h 1> Note: including file: C:CLVS12VCincludecrtdefs.h 1> Note: including file: C:CLWinDDK80Includesharedsdkddkver.h 1> Note: including file: C:CLWinDDK80Includesharedntstatus.h 1> Note: including file: C:CLWinDDK80Includesharedbugcodes.h 1> Note: including file: C:CLWinDDK80Includesharedntiologc.h 1> Note: including file: C:CLWinDDK80Includekmmce.h 1> Note: including file: C:CLWinDDK80Includesharedpshpack4.h 1> Note: including file: C:CLWinDDK80Includesharedpoppack.h 1> Note: including file: C:CLWinDDK80Includesharedpshpack4.h 1> Note: including file: C:CLWinDDK80Includesharedpoppack.h 1> Note: including file: C:CLWinDDK80Includesharedpshpack1.h 1> Note: including file: C:CLWinDDK80Includesharedpoppack.h 1> Note: including file: C:CLWinDDK80Includesharedguiddef.h 1> Note: including file: C:CLWinDDK80Includeshareddpfilter.h 1> Note: including file: C:CLWinDDK80Includesharedapiset.h 1> Note: including file: C:CLWinDDK80Includesharedktmtypes.h 1> Note: including file: C:CLWinDDK80Includesharedevntprov.h 1> Note: including file: C:CLWinDDK80Includeshareddevpropdef.h 1> Note: including file: C:CLWinDDK80Includesharedpshpack1.h 1> Note: including file: C:CLWinDDK80Includesharedpoppack.h 1> Note: including file: C:CLWinDDK80Includesharedpshpack1.h 1> Note: including file: C:CLWinDDK80Includesharedpoppack.h 1> Note: including file: C:CLWinDDK80Includesharedpshpack1.h 1> Note: including file: C:CLWinDDK80Includesharedpoppack.h 1> Note: including file: C:CLWinDDK80IncludesharedWinDef.h 1> Note: including file: C:CLWinDDK80Includesharedminwindef.h 1> Note: including file: C:CLWinDDK80Includekmndis.h 1> Note: including file: C:CLWinDDK80Includesharednetevent.h 1> Note: including file: C:CLWinDDK80Includesharedntddndis.h 1> Note: including file: C:CLWinDDK80Includesharedifdef.h 1> Note: including file: C:CLWinDDK80Includesharedipifcons.h 1> Note: including file: C:CLWinDDK80Includeshareddevpkey.h 1> Note: including file: C:CLWinDDK80Includeshareddevpropdef.h 1> Note: including file: C:CLWinDDK80Includesharedpciprop.h 1> Note: including file: C:CLWinDDK80Includesharedwindot11.h 1> Note: including file: C:CLWinDDK80IncludesharedWlanTypes.h 1> Note: including file: C:CLWinDDK80Includekmxfilter.h 1> Note: including file: C:CLWinDDK80Includesharedqos.h 1> Note: including file: C:CLWinDDK80IncludekmFwpsk.h 1> Note: including file: C:CLWinDDK80Includesharedfwpstypes.h 1> Note: including file: c:clwinddk80includesharedrpc.h 1> Note: including file: C:CLWinDDK80Includesharedpshpack8.h 1> Note: including file: C:CLWinDDK80Includesharedrpcdce.h 1> Note: including file: C:CLWinDDK80Includesharedrpcdcep.h 1> Note: including file: C:CLWinDDK80Includesharedrpcnterr.h 1> Note: including file: C:CLWinDDK80Includesharedrpcasync.h 1> Note: including file: C:CLWinDDK80Includesharedpshpack8.h 1> Note: including file: C:CLWinDDK80Includesharedpoppack.h 1> Note: including file: C:CLWinDDK80Includesharedpoppack.h 1> Note: including file: c:clwinddk80includesharedrpcndr.h 1> Note: including file: C:CLWinDDK80Includesharedpshpack8.h 1> Note: including file: C:CLWinDDK80Includesharedrpcsal.h 1> Note: including file: C:CLWinDDK80Includesharedpoppack.h 1> Note: including file: c:clwinddk80includesharedfwptypes.h 1> Note: including file: c:clwinddk80includesharedwtypes.h 1> Note: including file: c:clwinddk80includesharedwtypesbase.h 1> Note: including file: c:clwinddk80includesharedguiddef.h 1> Note: including file: C:CLWinDDK80Includesharedws2def.h 1> Note: including file: C:CLWinDDK80Includesharedinaddr.h 1> Note: including file: C:CLWinDDK80Includesharedws2ipdef.h 1> Note: including file: C:CLWinDDK80Includesharedin6addr.h 1> Note: including file: C:CLWinDDK80Includesharednetioapi.h 1> Note: including file: C:CLWinDDK80Includesharednldef.h 1> Note: including file: C:CLWinDDK80Includesharedfwpvi.h 1> Note: including file: C:CLWinDDK80IncludekmFwpmk.h 1> Note: including file: C:CLWinDDK80Includesharedfwpmtypes.h 1> Note: including file: c:clwinddk80includesharediketypes.h 1> Note: including file: c:clwinddk80includesharedipsectypes.h 1> Note: including file: C:CLWinDDK80Includesharedfwpvi.h 1> Note: including file: C:SrcnlitedHQDevSBChipWin8NetMonIncludeStdTypes.h 1> Note: including file: C:SrcnlitedHQDevSBChipWin8NetMonIncludeErrors.h 1> Note: including file: C:SrcnlitedHQDevSBChipWin8NetMonIncludeVerID.h 1> Note: including file: C:SrcnlitedHQDevSBChipWin8NetMonIncludeOS.h 1> Note: including file: c:srcnlitedhqdevsbchipwin8netmonincludeHandles.h 1> Note: including file: C:SrcnlitedHQDevSBChipWin8NetMonIncludeStrUtil.h 1> Note: including file: C:SrcnlitedHQDevSBChipWin8NetMonIncludeNetMon.h 1> Note: including file: c:srcnlitedhqdevsbchipwin8netmondriverGlobals.h 1> Filter.cpp: Optimizer disabled. ========== Build: 1 succeeded, 0 failed, 0 up-to-date, 0 skipped ==========

20131216 - Chip - Dec 16 2013

I am working on KOFY today. I rebuilt the NetMon project and noticed that the GeoDB project failed to link, with "can't find OsUser.lib". I had noticed this before, but since I don't need to run GeoDB I ignored it. Today I decided to figure out what was going on. This led to spending about an hour unwinding the build configuration and WDK compiler constants...

The GeoDB problem was because somehow the configuration manager had mixed x32 and x64 builds together. The configuration manager should look like this:

After I straightened this out, I did a batch build for everything. This failed on the 32bit version of the driver, which led to sorting out the compiler constants...

The compiler constants are a tricky thing to get right when building Windows device drivers. This is where the target version of Windows and the WDK are specified, and must be set correctly for both the build environment and the target system. The "normal" method is to load these constants into the command line options for the project ( Driver > Properties > C/C++ > Preprocessor > Definitions ). This can lead to very long strings that differ only slightly between configurations, but since they are stored as strings any difference causes the whole sequence to be stored separately.

My 32bit build problem was caused by a set of constants being repeated in the project properties page. The first set, which was visible in the edit box, set the target OS correctly. The duplicate set, which was scrolled way off to the right, reset the target OS incorrectly. This was very easy to overlook and very difficult to manage since it is buried deep in the project configuration XML files. The 32bit debug settings were:

KERNEL;_WINDOWS;DBGOUT;_DEBUG;NTDDI_VERSION=0x06010000; STD_CALL;CONDITIONAL_HANDLING;NT_UP;NT_INST=0;WIN32=100;_NT1X_=100; WINNT;_WIN32_WINNT=0x0601;WIN32_LEAN_AND_MEAN;DEVL;FPO;_IDWBUILD; _X86_;i686;STD_CALL;CONDITIONAL_HANDLING;NT_UP;NT_INST=0;WIN32=100; _NT1X_=100;WINNT;_WIN32_WINNT=0x0600;WIN32_LEAN_AND_MEAN;DEVL;FPO; _IDWBUILD

A much better method is to move all those constants to a normal header file (NtBuild.h) and manage it just like any other source file. Then the project settings simplify down to Debug/Release and 32bit/64bit.

NtBuild.h: /*************************************************************************/ /** NtBuild.h: Defines WDK constants for selecting the target **/ /** OS and machine. This file is automatically included **/ /** first by Driver > Properties > Advanced > Force include **/ /** (C)2013 nlited systems inc, Chip Doran **/ /*************************************************************************/ #ifndef __NTBUILD_H__ #define __NTBUILD_H__ 0x0101 #define _WIN32_WINNT 0x0601 //Windows7 #define NTDDI_VERSION 0x06010000 //Windows7 #define KERNEL 1 #define _WINDOWS 1 #define STD_CALL 1 #define CONDITIONAL_HANDLING 1 #define NT_UP 1 #define NT_INST 0 #define WIN32 100 #define _NT1X_ 100 #define WINNT 1 #define WIN32_LEAN_AND_MEAN 1 #define DEVL 1 #define FPO 1 #define _IDWBUILD 1 #define NDIS_SUPPORT_NDIS6 1 #define i686 1 #ifdef _DEBUG #define DBGOUT 1 #elif defined(_RELEASE) #else #error("Must define either _DEBUG or _RELEASE !") #endif #ifdef _X86_ #elif defined(_AMD64_) #else #error("Must define either _X86_ or _AMD64_ !") #endif #endif

The key to making this work is to force NtBuild.h to be included first into every source file. This is set in Driver > Properties > C/C++ > Advanced > Forced include File.

20131215 - Chip - Dec 15 2013

I have finally resumed adding the Installable File System filter, after a bit of winding road through converting the driver to DEVICE_OBJECT - centric and fixing the mysterious "Crash on Exit" bug.

I am copying the example from CodeProject.com to start.

SUCCESS! It took about 3 hours to get it working right, coexisting with all the other devices in the driver. I am now able to see all the file opens. Crashes on exit; this is because I had to enable the default IRP handler for all functions for the IfsFilter.

CRASH! The driver would always crash within a few seconds. I noticed this note on the CodeProject page: "A file system filter driver must have the fast-IO dispatch table. If you’ve forgot to set up the fast-IO dispatch table, it will lead to system crash.".

Fortunately, the FastIO.c code dropped into place mostly untouched. Plus, I would still have crashes after a few seconds. The FastIo code as-is assumed the device was created by the IfsFilter. In my project there are four distinct device types all handled by the same driver, and all share the same IRP handler and FastIO vectors. This is a design error with NT, the IRP and FastIO handlers should be a feature of the Device, not the Driver objects. So I had to add some boilerplate to each FastIo function to first determine whether the device was an IfsFilter device before passing it on.

In order to pass all the IFS functions through, I had to switch back to using a default IRP handler for all functions. This rewoke the "Crash on Exit" bug.

Once I did this, NetMon was stable. All this took until about 5pm, seven hours after beginning to add the IFS code.

20131214 - Chip - Dec 14 2013

NetMon.sys now creates three distinct device objects, Control, Stream, and IfsFilter. The original simple approach of creating the DEVICE_OBJECT as part of the DRIVER_OBJECT was causing problems. So I moved all the device code into the individual classes. Turns out the device management code isn't as complex as it originally seemed to be. So now each object (Control, Stream, IfFilter) creates its own DEVICE_OBJECT within its own context.

This was a rewrite of a significant portion of the NetMon.sys driver. I started around 9:30am and had the first draft done by 1pm. Another 45 minutes for the first complete compile and link. Another 2 hours to get it running again. (It crashes on exit.)

I spent about 3 hours trying to find the "Crash on Exit" bug. I found some problems with inappropriate calls to constructors and destructors; because the memory for the device objects is being allocated inside IoCreateDevice(), calling the class constructors and destructors results in a double-alloc or double-free. Even after fixing these, the driver still crashes on exit. I have narrowed it down to CloseHandle() on the Control file, but I cannot see the bug.

FIXED: The problem was setting IoCreateDevice(DeviceType) to a made-up value (0xCD10). I set it to FILE_DEVICE_UNKNOWN and the driver unloaded without any problems. Apparently 0xCD11 just happened to work, and when I changed it to 0xCD10 it crashes.

UNFIXED: Still crashes. DeviceType has nothing to do with it.

REFIXED: I removed the code that assigned a default IRP handler to every possible code, which was a "helpful hint" from the MSDN docs. I am now just assigning CREATE, CLOSE, DEVICE_CONTROL, and READ -- which is the way I originally wrote it. Then the driver unloads properly. I suppose there is a code that is being sent before or after the CLOSE that the default handler is screwing up. Now that I know what is causing the crash I can log all the unhandled IRPs to find which is causing problems. This bug took about six hours to find.

20131212 - Chip - Dec 12 2013

I don't like the idea of using an INF, so I am trying to create an IFS Filter rather than an IFS MiniFilter.

I am working on Kofy today, which introduces another round of "Find the WDK". On Kofy, the WDK81 libraries are located at D:CLWDK81kmx64 . There is no WHL directory, so I had to change (again) the library search path.

Problem: I can't find a attachment point that covers the entire file system; IFS Filters attach to volumes, not systems. This means I need to either enumerate all the volumes, create a device object for each, then attach. Otherwise, I can follow the samples and wait for volume registration callbacks and attach then. Either way, I seems I need to create a separate device object for each volume. This means a rewrite of NetMon.sys that will touch nearly every part, as I will need to create the device objects from a usage perspective, not a driver perspective.

The alternative is to go back to the MiniFilter and resign myself to using an INF file.

20131211 - Chip - Dec 11 2013

The good news: I have not seen a BSoD since the last build.

I started adding a file operation filter to NetMon. It compiles, links, and runs, but fails to install. At least it did not crash. I believe there is something wrong with my REGISTRATION structure, probably in the Contexts list.

I downloaded the MSDN MiniSpy example. I will need to study it for a bit.

The problem is that the registry entries for the minifilter have not been created. The Microsoft Way uses an INF file to install the driver, which also creates the registry entries. I want to programatically install the driver, which MS is declaring taboo. I believe this is part of their strategy to lock down the installation of malware drivers by forcing the installation through the INF parser.

It looks like I will need to install (at least once) with an INF. So tomorrow's task will be to adapt the INF from the MiniSpy project.

Links:
OSR thread
File systems blog
Minifilter project
EaseFilter.com
Tech Kaizen, good list of links

20131205 - Chip - Dec 05 2013

Another stab at the BSoD...

I read this in the MSDN: Using Automatic Synchronization

Almost all of the code in a framework-based driver resides in event callback functions. The framework automatically synchronizes most of a driver's callback functions, as follows:

The framework always synchronizes general device object, functional device object (FDO), and physical device object (PDO) event callback functions with each other so that only one of the callback functions (except EvtDeviceSurpriseRemoval, EvtDeviceQueryRemove, and EvtDeviceQueryStop) can be called at a time for each device. These callback functions support Plug and Play (PnP) and power management events and are called at IRQL = PASSIVE_LEVEL.

This made me think of two things:

I may not need to synchronize at all.
The BSoD may be caused by stalling from within a callback that is already within a kernel lock. Instead of stalling on a mutex, I need to schedule a DPC.
UPDATE 20140105: Removing the mutex introduced log buffer corruption. The code was definitely being re-entered on multiple threads. The problem was confusion (on my part) about the parameters to KeWaitForMutex(). This was fixed on 20140104.

I commented out the mutex calls, and it seems to be running... I left it running overnight, and it is still runnng the next morning.

BSoD: NetMon.sys PAGE_FAULT_IN_NONPAGED_AREA. So sad. This was after running all night and interactively this morning for almost an hour. On the bright side, this is probably a bad (or misused) pointer or buffer overrun. It is (probably) not a synchronization problem.

Finally, a dump with a clue!

X: MODULE_NAME: NetMon FAULTING_MODULE: fffff80002a1f000 nt DEBUG_FLR_IMAGE_TIMESTAMP: 52a168cd WRITE_ADDRESS: unable to get nt!MmSpecialPoolStart unable to get nt!MmSpecialPoolEnd unable to get nt!MmPagedPoolEnd unable to get nt!MmNonPagedPoolStart unable to get nt!MmSizeOfNonPagedPoolInBytes fffffa8103c8b050 FAULTING_IP: NetMon+59a6 fffff880`03ded9a6 f3a4 rep movs byte ptr [rdi],byte ptr [rsi] MM_INTERNAL_CODE: 5 CUSTOMER_CRASH_COUNT: 1 DEFAULT_BUCKET_ID: WIN7_DRIVER_FAULT BUGCHECK_STR: 0x50 CURRENT_IRQL: 0 ANALYSIS_VERSION: 6.3.9600.16384 (debuggers(dbg).130821-1623) amd64fre LAST_CONTROL_TRANSFER: from fffff80002b115b3 to fffff80002a94bc0 STACK_TEXT: fffff880`05fbd158 fffff800`02b115b3 : 00000000`00000050 fffffa81`03c8b050 00000000`00000001 fffff880`05fbd2c0 : nt+0x75bc0 fffff880`05fbd160 00000000`00000050 : fffffa81`03c8b050 00000000`00000001 fffff880`05fbd2c0 00000000`00000005 : nt+0xf25b3 fffff880`05fbd168 fffffa81`03c8b050 : 00000000`00000001 fffff880`05fbd2c0 00000000`00000005 00000000`00000000 : 0x50 fffff880`05fbd170 00000000`00000001 : fffff880`05fbd2c0 00000000`00000005 00000000`00000000 00000000`00000040 : 0xfffffa81`03c8b050 fffff880`05fbd178 fffff880`05fbd2c0 : 00000000`00000005 00000000`00000000 00000000`00000040 00000000`00000000 : 0x1 fffff880`05fbd180 00000000`00000005 : 00000000`00000000 00000000`00000040 00000000`00000000 00000000`00000000 : 0xfffff880`05fbd2c0 fffff880`05fbd188 00000000`00000000 : 00000000`00000040 00000000`00000000 00000000`00000000 00000000`00000000 : 0x5

Pull up the NetMon.map and look up $59A6 in the Rva+Base column (not Address):

X: 0001:00004870 ?BufUnlock@Log@@AEAAXXZ 0000000140005870 f Log.obj 0001:00004880 ?BufWrite@Log@@AEAAHPEAULogHdr_s@@I@Z 0000000140005880 f Log.obj 0001:00004a30 ?Create2@Log@@AEAAHI@Z 0000000140005a30 f Log.obj

NetMon+59a6 is ... Log::BufWrite() [5880+0126]. Now pull up DriverOutWinx64DebugLog.cod and go to Log::BufWrite+0126:

X: ; 301 : memcpy(&pBuf[BufWr],pSrc1,ChunkSz); 00102 8b 04 24 mov eax, DWORD PTR ChunkSz$[rsp] 00105 48 8b 4c 24 40 mov rcx, QWORD PTR this$[rsp] 0010a 8b 49 50 mov ecx, DWORD PTR [rcx+80] 0010d 48 8b 54 24 40 mov rdx, QWORD PTR this$[rsp] 00112 48 8b 52 60 mov rdx, QWORD PTR [rdx+96] 00116 48 03 d1 add rdx, rcx 00119 48 8b ca mov rcx, rdx 0011c 48 8b f9 mov rdi, rcx 0011f 48 8b 74 24 18 mov rsi, QWORD PTR pSrc1$[rsp] 00124 8b c8 mov ecx, eax 00126 f3 a4 rep movsb ; 302 : pSrc1+= ChunkSz;

Aha! There is my offending line! So now I know where it crashed but not why.

The bugcheck code is:

Code	0x50	PAGE_FAULT_IN_NONPAGED_AREA
Referenced	0xFFFFF81`03C8B050	pSrc
Mode	1	Write
IP	0xFFFF880`03DED9A6	Log::BufWrite+0126

Since `3C8B050 appears in the stack twice, I am assuming this is pSrc. Since it is also the referenced bad address, I am assuming it is the bad pointer. Since a '1' appears in the stack I am assuming it is the Type argument to Log:WriteItem(). The only function that calls WriteItem(Type=LOG_TEXTA) is Log::WriteA(). WriteA() passes pSrc as the buffer returned by TmpBufGet(). Since pSrc is nonzero, it can only be a buffer overrun of TmpBuf::pBuf[]. Probably by a single byte.

Just as a quick hack, I added some padding to the byte count requested in TmpBuf::Get(). I ran this for a few minutes on my naked ChipPro system and it did not crash. I need to let it soak for a while. I can believe this is the source of the crash, because it would happen very rarely (when the end of the requested buffer exactly matches the end of TmpBuf::pBuf[] AND StrFormatA() fills the buffer. But this does not explain why the rep movsb apparently failed on the first byte. (But I don't know the value of ECX, so it could have actually been the last byte? I don't think so, the CPU exception catches the actual address, not the value of the registers.) Still a bit of a mystery...

UPDATE 20140105: This crash was most likely due to the UNICODE_STRING bug.

=====

SXS debug spam

Following up on a hint about SXS spewing debug crap... I created zipfiles containing the sxs dll's from retail installations. I will try replacing the dll's on WinDebug. The zipfile includes a .bat file to copy the files. PendMoves is the SysInternals utility to schedule file moves during reboot. I need to be able to debug my drivers!

SXS7.zip (Windows7 64bit): [FILE 7032]
SXS8.zip (Windows8 64bit): Missing
PendMoves.zip: [FILE 7031]

20131129 - Chip - Nov 29 2013

Chasing the BSoD...

It seems to be triggered by calls to StreamWrite().

My OsMutexAcquire() seems to blow up with "driver attempted a wait from a DPC."

Stream capture is not enabled until a connection is selected in NetMonUI and the command "capture in out mask 255.0.0.0". A debug line in CbStream4() causes all streams to be then captured.

If I comment out the BufRead code I am able to capture streams without any problems. I suspect the crash is happening when there is a collision on the mutex.

Next attempt: Replace the mutex with a spin lock.

I think I may have it working; I have recorded over 100MB and it is still going.... Crash! After recording 148MB. The fix was to use KeInitializeMutex(), KeWaitForMutexObject(), KeReleaseMutex().

I made similar changes to Log.cpp and was able to record 120MB before a BSoD. This bsod was unusual, there was a brief bluescreen message saying "Hardware failure" before the standard bluescreen. I am beginning to wonder if there may be some system instability in the VM when I am thrashing both the network and disk constantly. I will need to run it outside a VM.

Well... that didn't go as expected. :( When I ran NetMon on ChipPro/Win81 Firefox could no longer decode any pages. Somehow I am mangling the data?
FIXED: I needed to not set the actionType and rights unless I want to block the data. Hmmm... This tells me that some other filter is modifying the data. NetMon is now working on Windows 8 (8.1).

X: int Stream::Write(const ADDR *pAddrSrc, const ADDR *pAddrDst, const UINT8 *pSrc, UINT SrcSz) { int Err2,Err= ERR_OK; struct StreamHdr_s Hdr; NTSTATUS Status; LARGE_INTEGER TimeOut; if(IsOpen) { DebugA(DBG_STRM_WR,FUNCA "[%X:%d]",pSrc,SrcSz); RtlZeroMemory(&Hdr,sizeof(Hdr)); Hdr.Signature= SIGNATURE_STREAM_SEGMENT; Hdr.ByteCt= sizeof(Hdr)+SrcSz; KeQuerySystemTime((LARGE_INTEGER*)&Hdr.Time); if(pAddrSrc) Hdr.Src= *pAddrSrc; if(pAddrDst) Hdr.Dst= *pAddrDst; TimeOut.QuadPart= -MSEC(500); Status= KeWaitForMutexObject(&BufMtx,Executive,KernelMode,0,&TimeOut); if(NT_SUCCESS(Status)) { if(!IsErr(Err= BufWrite((UINT8*)&Hdr,sizeof(Hdr)))) { if(IsErr(Err2= BufWrite(pSrc,SrcSz))) { Err= Err2; } else { Err+= Err2; } } KeReleaseMutex(&BufMtx,0); } if(IsErr(Err)) { Err= WarnA(Err,FUNCA "Failed."); } else if(Err>0) { DebugA(DBG_STRM,FUNCA "Sz=%d Rd=%d Wr=%d",BufSz,BufRd,BufWr); } } return(Err); } int Stream::Read(UINT8 *pDst, UINT DstSz) { int Err= ERR_OK; NTSTATUS Status; LARGE_INTEGER TimeOut; DebugA(DBG_STRM_RD,FUNCA "pDst[%X:%d] BufUsed=%d",pDst,DstSz,BufUsed()); TimeOut.QuadPart= -MSEC(500); Status= KeWaitForMutexObject(&BufMtx,UserRequest,KernelMode,0,&TimeOut); if(NT_SUCCESS(Status)) { Err= BufRead(pDst,DstSz); KeReleaseMutex(&BufMtx,0); } if(Err>0) { DebugA(DBG_STRM,FUNCA "Sz=%d Rd=%d Wr=%d",BufSz,BufRd,BufWr); } DebugA(DBG_STRM_RD,FUNCA "%d",Err); return(Err); }

Wingsuit

20131127 - Chip - Nov 27 2013

NetMon Running on Windows 7 Use FWPM transactions: FwpmTransactionBegin0() FwpmTransactionCommit0() Reordered FwpmCalloutAdd and FwpsCalloutRegister() FwpmCalloutAdd0() first to create the callout. FwpsCalloutRegister0() to enable callout. Flags=0 (FWPM_CALLOUT_FLAG_REGISTERED fails) calloutKey uses same GUID as FwpmCalloutAdd0()

I found some WFP documents on the wild and wooly web that helped in making NetMon run on Windows 7.

Jonathan Levin
MSDN Forums
Komodia.com
CodeMachine.com
GetCodeSamples.com : Nice collection and organization of the WDK samples.
Stream inspection and modification: NetMon: OOB_EDIT
Support functions: NetMon Notes: Stream CallOut

NetMon now runs on Windows 7.

Trying to install VS2013 remote kernel debugger tools on Win764D. I seem to still be stuck at this screen:

The installer never connected. I clicked Cancel.

VS > Debug > Attach to process > Kernel Transport > WinDbg764. Reboot WinDbg. VS then connects, but now WinDbg764 is running so slow that it is impossible to use. Kernel debugging is horribly broken.

20131124 - Chip - Nov 24 2013

It has been another frustrating day with VS13 and Windows81. I am working on ChipPro2 (Surface Pro) trying to debug NetMon running on VMware/Windows81/64bit/Debug. I resurrected the VM that had been thoroughly hosed by attempting to install the VS13 remote debugging tools. (Won't boot with a BSoD INVALID_BOOT_DEVICE).

I was able to bring the VM back to life using the same product key by performing a "Reset" from the emergency boot mode. This is essentially a new installation. However, every 10 seconds the system locks up for about 40 seconds. This is with nothing running, just boot and go to the desktop. I suspect, but do not know for sure, that this is something to do with being a debug build. My next step will be to create a completely new VM, this time using the Release Windows 8.1 installer.

I eventually gave up on W864D and reverted back to my Win832 (32bit) VM. The first time everything worked fine. The second time, the driver silently refused to load. The only clue was a Windows/System event that reported an error, "Unable to find file." I flailed around a while before I finally tried copying NetMon.sys to C:WindowsSystem32NetMon.sys and invoked NetMonUI --driver C:WindowsSystem32NetMon.sys and voila! the driver loaded.

I am trying to find and fix the BSoD that seems to always eventually occur. I added a try/except block around the Classify handler that should output "EXCEPTION" to the debugger if it triggers. I have been using IE10 on Win832 for over twenty minutes witout any problem. After about 30 minutes a BSoD IRQL_NOT_LESS_OR_EQUAL.

I added IrqlChk() calls around the Device file Open, Ioctl, and Read handlers. There are a small number of entry points to NetMon.sys, and I am running out of ideas about where the crash is occurring. This version ran over 5200 seconds before crashing. I started capturing to a file (CAPTURE IN OUT MASK 255.0.0.0) about 10 minutes before the crash. Note that the error changes, this time it is ATTEMPTED_SWITCH_FROM_DPC.

BUG: The IpHelper resolver seems to get stuck after a while. PendingCt ratchets up to the limit of 100.

20131008 - Chip - Oct 10 2013

Trying to rebuild on Win832 using VS12.
VS13 set the toolset to v12.0; VS12 is toolset v11.0.
I set the toolset version in the project configurations to v11.0 (VisualStudio 2012)
VS12 still complained about v12.0.
The top line of the .vcxproj file contains the text (which is not changed by setting the toolset version in the configuration dialog): <Project DefaultTargets="Build" ToolsVersion="12.0" xmlns="http://schemas.microsoft.com/developer/msbuild/2003">
I editted the .vcxproj files to set 'ToolsVersion="4.0"' (VisualStudio 2012)
The solution then build using VS2012 on W832.
VS13 then complained about not knowing how to use toolset v11.0, and suggested upgrading the projects.
So it appears Microsoft has decided to make VS12 and VS13 incompatible for no good (apparent) reason, once again forcing everyone another step down Microsoft Way.
If I want to use VS13 I will need to upgrade across the board to VS13.
Do not uninstall VS12, leave it in place. This will use an additional 4-10GB of disk space.
The attractions of VS13:
Built-in remote Driver debug tools
I am having trouble getting this to work.
Microsoft Store support
Windows Phone support
The disadvantages of VS13:
Yet another VisualStudio to configure and manage.
Another 10GB of disk space consumed.
W832 (Windows 8.0, 32bit Retail) consumed 24GB of disk space on C:
Minimal VS2012 installation (no help files)
Basic utilities (Perl, P4, Epsilon)
Builds for the NetMon project. (250MB)
I would budget 30GB for any Windows installation
This portends trouble for Win8164D (20GB)...
Notes on configuring Safe Mode (no longer enabled by default!) http://www.eightforums.com/tutorials/2758-safe-mode-add-windows-boot-manager-windows-8-a.html
Windows 8.1 now supports UEFI by default, meaning that F8 no longer works as an emergency option unless previously configured. So do it immediately after installing the OS!
From elevated DOS box:
> bcdedit /enum /v > Copy the identifier for the current boot configuration > bcdedit /c /d "Windows 8.1 Safe Mode"
Win-R / msconfig.exe
Boot tab
Select "Windows 8.1 Safe Mode"
Check Safe Boot, Minimal, No GUI boot
Enter Timeout 10 seconds
Check "Make all boot settings permanent"
Click OK, click Yes
Provisioning WIN8164D for Driver debugging using VS13.
Set VMware network to Bridged
Verify that WIN8164D is visible in Network explorer
Authenticate as WIN8164DSetup
Copying driver test automation files will take a *long* time
Absolutely no indication of progress, how long it will take, or whether the copy is actually happening. *Very* frustrating.
Copying from host to a VM over virtual Ethernet to an SSD drive over USB3.0. In other words, the fastest data transfer rate possible.
(4:28) At least 40 minutes so far...
After 45 minutes the copy fails!
Trying again.
This time the file copy takes 2 minutes
WIN8164D reboots, acts like a new user install.
This means a *long* time staring at RGB shifts with the ever-so-helpful text
You can get new apps at the Store
And my all time favorite, "Taking care of a few things"
+5 minutes, VS13 shows "Installing VC Redist"
+6 minutes, Target pops a series of "Installing..." progress dialogs. Each takes no more than 10 seconds, but there are several (5?).
+8 minutes, target reboots. (17:40)
My boot configurations are gone, no choice for boot.
Reboot takes a unusually long time
Again with zero feedback, just a black screen.
VS13 prints "Attempting to connect..." every few minutes.
After ten minutes of black screen and no disk activity, I am pulling the plug on the Target.
17:52: I reset the Target, it boots into "Automatic Repair..."
17:53: "Diagnosing your PC" No option to abort
Windows 8.1 appears to be the "Windows for Idiots" version.
17:54: "Windows couldn't load correctly"
Only options are System Restore and Cancel.
I am sure Cancel will put me right back here after wasting 4 minutes.
I am sure Restore will undo the past hour's work and I will be right back here an hour from now.
I click cancel
Black Flag screen
Back to Automatic Repair Epic Fail screen, but this time the choices are "Shutdown" and "Advanced Options"
Advanced Options > Use another operating system brings up my boot configurations
I select "Windows 8.1 Original"
18:05: Black Flag with Twirls
18:06: Twirls freeze, no disk activity
16:07: BSOD: 0xFFFFD000207734F0, 0xFFFFFFFFC00000034, 0, 0, INACCESSIBLE_BOOT_DEVICE
16:07: Reset, Preparing Automatic Repair
18:08: Blackness (briefly), Black Flag "Diagnosing your PC" with Twirls
18:09: Automatic Repair, Your PC did not start correctly: Restart or Advanced Options
18:10: Advanced Options > Use another operating system > Windows 8.1 Safe Mode
18:10: Reboot, Black Flag no Twirls, no disk activity (I can browse to VM files on the host)
18:12: BSOD INACCESSIBLE_BOOT_DEVICE.
18:16: Hail Mary play, upgrading from VMware Player 5.0.2 to Player 6.0
18:17: Started download (94MB)
18:24: Download complete
18:25: Starting Player 6.0 installation...
18:26: Uninstalling Player 5.0
18:27: Installing Player 6.0
18:30: Installation complete
18:31: Running Player 6.0
18:32: Downloading VMware tools
18:32: Rebooting WIN8164D
18:33: Automatic Repair, PC did not start correctly.
18:35: Black Flag, no Twirls
18:35: BSOD, INACCESSIBLE_BOOT_DEVICE
18:36: Waiting for VMware tools to download, then I will reboot the host.
18:36: Download complete, updating...
18:37: Update finished.
18:50: I rebooted ChipPro and the Bluetooth host failed to start. Restarting triggered a major Windows Update installation.
18:51: Starting WIN8164D
18:53: PC did not start correctly
18:54: Rebooting into (supposedly) Safe Mode
18:55: BSOD
18:55 Officially declaring WIN8164D dead.

20131007 - Chip - Oct 7 2013

I upgraded ChipPro to VisualStudio 2013RC.
After I (re)installed WDK8.1 VS presented the "Driver" menu item.
I created a new Win81 64bit Debug VM (this took a long time!)
Tried to configure the target
Need to enable network discovery and file sharing on remote
Control > Network > Sharing Center > Advanced sharing
Need to install remote tools first: C:Program FilesWindows Kits8.1Remotex64WDK Test Target Setup x64-x64_en-us.msi
VS13 > Driver > Configure
Automatic settings
Copying to remote computer takes a *LONG* time (20-30 minutes)
After installing the remote tools, Win8164D fails to boot and VS13 fails to connect. (Typical)
Overall experience was a dismal failure. The target ran extremely slowly, much slower than Win832. I was never able to connect using VS13. The target eventually "reset" itself, which was a complete reinstall from installer (which took well over an hour).

20131005 - Chip - Oct 5 2013

Added ARIN queries as alternative to getaddrinfo().

20131002 - Chip - Oct 2 2013

I now have my Authenticode code signing key.
I'm IN!
NetMon.sys x64 loaded on Surface Pro (Win8 Retail x64)!
Trying to remember the process ...
I bought the driver signing certificate from Symantec for $99.
Use Internet Explorer
Symantec and GeoTrust must be added to the "Trusted Websites"
The certificate must be requested on the same computer used to later retrieve the certificate.
Verification required printing a certificate request on company letterhead, having it notarized, and sending a scan to Symantec. Symantec sent the certification about three hours after I sent the scan via email.
After walking through the email/website steps, the certificate can be managed from
IE > Tools > Content > Certificates > Personal
Export the certificate from IE to a .pxf file. Use a disposable password, and DO NOT delete the key after exporting. Do include the private keys. This is for backup only, signtool will use the certificate from the local store.
Download the Microsoft cross-certificate, save as "DriverMSCV_VSClass3.cer"
SignTool:
> signtool sign /v /ac "MSCV_VSClass3.cer" /s MY /n "nlited systems inc." /t http://timestamp.VeriSign.com/scripts/timstamp.dll NetMon.sys
Help hints:
Verisign
Verisign

20130930 - Chip - Sep 30 2013

Added code to capture data streams.
Ran into an aggravating linker problem, NdisGetDataBuffer() would not link. A Google search popped out the solution:
extern "C" { #include "ndis.h" }
This fixed it.
I struggled with trying to capture the packet stream using the FWPS_LAYER_OUTBOUND_IPPACKET_V4 layer, but I could only find the IP headers, not the packet payload.
I switched to FWPS_LAYER_STREAM_V4 and was able to capture the payloads, both inbound and outbound. However this does not give me access to the packet headers. Most of the header info can be extracted from the metadata and other sources.

20130929 - Chip - Sep 29 2013

Moved IP address text conversions to IpHelper.cpp
Added printf-style formatter to IpFormatAddr()
Added background host name resolver.
Added Main window with connection listbox.
Added Connection manager.

20130928 - Chip - Sep 28 2013

I now have multiple filters running!
It seems that I need to create separate FWPS CallOut, FWPM CallOut, and FWPM Filter structures (with unique GUIDs) for each individual event I want to trap; even though I am able to send the events to the same callback function.
This seems to be overkill and overly complicated, and I suspect it is not actually this difficult. However, this seems to work...
I am able to trap connect and disconnect events for IP4.
Added connect event for IP6, but I am not interpreting the IP6 addresses correctly.
FIXED.

20130927 - Chip - Sep 27 2013

Created individual Log items for events.
Trying to install multiple filters, but it is not working yet.
FWPM_LAYER_ALE_ENDPOINT_CLOSURE_V4 returns C022014:STATUS_FWP_INCOMPATIBLE_LAYER
FWPM_LAYER_ALE_AUTH_CONNECT_V6 returns C0220009:STATUS_FWP_ALREADY_EXISTS
Links about elevating privilege level:
MSFT KB 981778
MSDN
CodeProject.com

20130925 - Chip - Sep 25 2013

Wrote GeoDB to convert GeoIP text files into an easily searched binary database with indices.
Wrote ControlGeoDB.cpp to use the database files.

20130924 - Chip - Sep 24 2013

Added user command parser
Added ability to add IPv4 addresses to the Driver/Filter block list.

20130923 - Chip - Sep 23 2013

My barebones firewall is now logging all the connections to the Control app.
FileRead is working
Event log is working
Added GeoIP.cpp, geographic lookup by IPv4 address.

20130922 - Chip - Sep 22 2013

I'm IN! Yippee!!
I am now seeing and logging every IPv4 outbound connection request!
I am able to cleanly uninstall and reinstall.
Catching the events requires a combination of both WFP and NDIS functions.
I downloaded and unpacked the MSDN example WFP example. It is massive. The helper function to extract individual data items from the Classify data is almost 100K of source code, but it is very useful for extracting the (very complicated!) roadmap to the particular data items of interest.
TODO:
Add ability to BLOCK particular remote IP addresses (8.8.8.8) [DONE!]
Obtain a Production Driver signing key so I can build 64bit drivers.
Create a Win8 Pro 32bit target VM
Configure for remote kernel debugging using VS2012
Step through code remotely.
Figure out how to elevate my process so I no longer need to launch as Admin
Create event log buffer that can be pulled by the Control app
Add log viewer window to Control.
I went from knowing nothing about NDIS/WFP to a working (albeit barebones) firewall kernel driver in ~10 days.

20130921 - Chip - Sep 21 2013

Ioctls between the control app and the driver are now working.
Doing a lot of research in the VS12 Help docs, especially the WDK topic.
My NDIS callout is now being called! Yippee!

20130916 - Chip - Sep 16 2013

If my new() function is called with optimizer disabled, it tries to link to __CxxFrameHandler3.
There is a detailed explanation of this.
I do not have any solution for this other than to either not disable optimizations or not call my new().
If I do not call my new() MAKE SURE to zero out the object in its constructor!

20130915 - Chip - Sep 15 2013

I tried to recompile the NetMon project on ChipPro2 (Win8/64bit)
Perforce downloaded ControlResourceresource.h as mangled UTF16.
I had to set the filetype to "text" on Win832 (using an unmangled source file)
UPDATE 20131206: I wrote P4Fix.exe to unmangle the files.
After unmangling:
p4 edit file; p4 reopen -t utf16 file; p4 submit file
The P4Fix project is under \HomeChipAppUtilP4P4Fix
The build could not find NtDDK.h
I was using WinDDK v6.1 (WINDDK) on ChipPro2, WDK v8.0 (WDKPATH) on Win832.
I installed WDK 8.0 on ChipPro2, but it did not include all the various driver models (KM and WDM in particular); it only included UM (Unified Model) and WinRT.
I downloaded WDK 8.1 from MSDN.Microsoft.com
The installer (finally) gave me a choice for destination! I installed to C:CLWinDDK81.
This installed all the driver models.
http://msdn.microsoft.com/en-us/library/windows/hardware/ff557573%28v=vs.85%29.aspx
The build could not find "excpt.h"
Set VS12PATH=C:CLVS12
The build could not find "winapifamily.h" (This is a new one!)
WDK v8.0: C:Program Files (x86)Windows Kits8.0Includesharedwinapifamily.h
It seems to be missing from WDK 8.1, so I copied it: > copy "C:Program Files (x86)Windows Kits8.0Includesharedwinapifamily.h" C:CLWinDDK81Includeshared
The build again failed on "specstrings.h", so I am abandoning the "just copy files" method. I deleted winapifamily.h from the 81 directory.
I am instead just copying the whole WDK 8.0 tree from a know good installation (Win832) to ChipPro C:CLWinDDK80 and WDKPATH pointing to that.
I tried using pkzip to zip up the entire WDK 8.0 tree, but it silently failed to create the final zipfile after compressing all the files!
I used 7z with the same result. Then I realized that Microsoft created the WDK directories using the admin account, so the final writes to the WDK directory fails.
When I created the zipfile from another directory, it included WDK80.zip -- which does not appear in any directory view of the source directory! Somehow an invisible file was created.
After I was (finally!) able to extract the WDK80 tree (to C:CLWinDDK80) I was able to rebuild NetMon.sys on ChipPro.
Even after the project built successfully, Intellicode complained it could not find NtDDK.h.
I closed VS12, deleted NetMon.sdf, reopened the NetMon project, and everything was OK.
TODO: Figure out which Microsoft WDK installer actually works:
Allows me to specify the destination
Installs all driver models
Includes everything required.
For now, just unzip [ColdStore]DisksDevMSDNWin8SDKWDK80.zip and use that.
UPDATE 20131206: VS2013 (not RC) installs well.
Following the instructions in the VS12 help docs for debugging drivers, it refers to "On the host computer, in Microsoft Visual Studio, on the Driver menu, choose Test > Configure Computers." But my installation of VS12 does not have a "Driver" menu.
I guess I am still stuck in printf-debugging mode, which is frustrating. I may need to uninstall, download, reinstall Visual Studio 2012 + WDK81 on ChipPro. (Not today)
VS12 on Win832 does have the "Drivers" menu, so I believe reinstalling a later version of VS2012 should do the trick.
I need to create a new Win832 VM anyway, this time installing Win8 Pro so I can join the domain.
Now the service installs, and reports that it was started OK, but I see no signs of life from the driver in DbgView.
The answer is buried in Event Viewer > Windows Logs > Security: "Audit Failure: Code integrity determined that the image hash of a file is not valid. The file could be corrupt due to unauthorized modification or the invalid hash could indicate a potential disk device error."
So apparently I need to sign the driver even for 32bit Windows.
Frustration mounts, I am unable to load the driver any more, even when signed!
FOUND: This was a FALSE NEGATIVE from DbgView! NetMon.sys was indeed being loaded!
> bcdedit /set testsigning on
ARRRGH!! DbgView has been lying to me!
I found another tool, DriverView (http://www.nirsoft.net/utils/driverview.html) that shows NetMon.sys is indeed loaded even though I am seeing no output in DbgView. NetMon.sys disappeared from the list when the driver was unloaded.
I rebuilt without the signing step, and the driver failed to load and generated an event in th Windows security log.
I rebuilt with the signing step and the driver loaded.
WARNING: DriverView does not re-sort new entries! NetMon.sys appeared out of order at the bottom of the list.
I am backing out the bcdedit testsigning flag:
> bcdedit /deletevalue testsigning
I am disbling IntegrityServices, see if it makes any difference: > bcdedit /set integrityservices disable
Lessons learned:
Even 32bit versions of Win8 require the driver to be signed.
UNLESS bcdedit /set integrityservices disable
Then unsigned drivers will load (but only for 32bit?)
DbgView is not as reliable as I had thought.
FIX: Immediately after starting DbgView, check/uncheck/recheck both "Capture Kernel" and "Verbose Kernel" settings.
DriverView is a useful tool to verify whether a driver has been loaded and the confirm that the version info matches the latest build.
WARN: DriverView does not automatically sort new entries.
VM will run incredibly slowly if the memory is set to more than the available physical memory.
Mysteries:
I don't understand why the first NetMon.sys driver loaded yesterday. My only guess is that it had been built on ChipPro, which has the DevNlited certificate installed.
I don't understand why DbgView worked for a while, then stopped printing DbgPrint messages.
Either (or both) "Capture Kernel" and "Verbose Kernel" settings are *NOT* operationally persistent even though the UI indicators are. (The UI checkboxes may be lying.)
This false indicator wasted an entire afternoon. >:{|
Poking around on the NirSoft.net site I found a shitload of stuff, very similar to SysInternals. One of the utilities is cports.exe, which looks a *lot* like TcpView with some enhancements and functionally the same as what I want NetMon to do. Cports.exe only takes a snapshot of the open sockets and is a user-mode app. It parses a database of the geographic locations of IP addresses.

20130914 - Chip - Sep 14 2013

Authenticode signing seems to be a long-term project. I read in the MSDN docs that supposedly 32bit versions of Windows, including Win8, do not enforce the Authenticode requirement. I will test this, if it works it will allow me to continue working on the driver in advance of being able to Authenticode sign.
I built a Win8/32bit VM (Win832)
Installed VS2012
This was the cleanest and most problem-free installation I have had in a long time.
ANNOYING: The DDK is installed to ugliest directory I can imagine: "C:Program FilesWindows Kits8.0"
Installed Perforce 2013
Installed Epsilon, basic Bin tools.
Installed ActiveState Perl
Building the x86 version of NetMon.sys:
Unresolved externals on 4 NtDDK functions, including __imp__KeInitializeEvent
I need to set the following preprocessor definitions:
X: KERNEL;_WINDOWS; _X86_; //Must match the build i686; STD_CALL; CONDITIONAL_HANDLING; NT_UP;NT_INST=0;WIN32=100;_NT1X_=100;WINNT; NTDDI_VERSION=0x06000000; //This is for Win8 _WIN32_WINNT=0x0600; //Must match NTDDI_VERSION WIN32_LEAN_AND_MEAN;DEVL;FPO;_IDWBUILD
I also need to set the default calling convention to __stdcall C/C++ > Advanced > Calling convention: __stdcall This is not required for x64 but causes problems for x86.
Ran into the Perforce UTF16 problem with the resource.h files again. The files need to be checked out for edit, the "p4 reopen -t utf16"
I'm in! NetMon.sys has successfully loaded into Win832.

20130912 - Chip - Sep 12 2013

Trying to sign my driver... http://msdn.microsoft.com/en-us/library/windows/hardware/hh801887
Symantec order 10267736
Once the certification is assigned, continue with MSFT.
Symantec denied the request because they could not find a listing in any business directories (Yellow Pages, etc).
I also need to include real phone numbers in the certificate. So I will need to create a throwaway Skype number for this.

20130911 - Chip - Sep 11 2013

Started the NetMon project
I want to experiment with "Windows Filtering Platform Callout Drivers"
Described in the VS12 Help documentation under
VS12 Help Windows Driver Development (Windows Drivers) Device and Driver Technologies (Windows Drivers) Network Devices (Windows Drivers) Network Drivers Starting with Windows Vista (Windows Drivers) Network Driver Design Guide (Windows Drivers) Windows Filtering Platform Callout Drivers (Windows Drivers)
I want to detect, log, reject TCP/UDP connections.
Build environment
Compiler: VS2012
Include/Lib: WDK (Using KM model)
WDKPATH=D:CLWDK
VS12PATH=D:CLVS12
Created the NetMon Driver shell, borrowing from DbgOut 11.1.
Came together very quickly, I was able to compile and link a complete shell in about 2.5 hours.
Ran into and fixed a weird linker warning: System.obj : warning LNK4217: locally defined symbol memmove imported in function OsMemMove The offending source code:
void *OsMemMove(void *pDst, const void *pSrc, UINT ByteCt) { RtlMoveMemory(pDst,pSrc,ByteCt); return(pDst); }
RtlMoveMemory is just a macro around memmove().
FIXED: Remove "_DLL" from the C preprocessor definitions. This macro forces everything to be declared as __declspec(dllimport), but in the KM driver model memmove is assumed to be an intrinsic so the linkage flags don't match.
Post-Build event
Executes the script to self-sign the driver.
For some reason, VS12 does not display the description text.
The script does execute, proven by looking at the SignTool.log file.
I cannot type the file from the script because VS12 uses paths that are a mix of forward and back slashes.