These instructions are for setting up a Windows build machine from scratch.
For subsequent machines, the setup can be simplified by copying components from the first machine - see windows_simple.md.
On Windows, the build system runs within Git for Windows. This provides a bash shell that enables our Unix build system to work on Windows without alteration.
The build system assumes Windows itself is installed on the C:
drive. If this is not the case on your machine, set the ROOT
environment variable to the appropriate drive.
If you use Gradle to build Machine Learning you do not need to work in Git Bash, as Gradle tasks will run it when necessary. In order to develop interactively on Windows, you must work in Git Bash. You will need the following environment variables to be defined:
CPP_SRC_HOME
- This must contain the absolute path of the top level source code directory.
You will also need to add several directories to your PATH
environment variable. These need to be in the MinGW format, i.e. beginning with /drive letter/ followed by the directory path using forward slashes instead of backslashes, and with no spaces in the directory names. Where a Windows directory name contains a space, you must use the 8.3 version of that directory name (which you can find using dir /x
at a command prompt). The directories that need to be added are:
/c/PROGRA~1/MICROS~1/2022/Professional/VC/Tools/MSVC/14.41.34120/bin/Hostx64/x64/
/c/PROGRA~1/MICROS~1/2022/Professional/Common7/IDE
/c/PROGRA~2/WI3CF2~1/8.0/bin/x64
/c/PROGRA~2/WI3CF2~1/8.0/bin/x86
/c/PROGRA~1/MICROS~1/2022/Professional/TEAMTO~1/PERFOR~1/x64
/c/PROGRA~1/MICROS~1/2022/Professional/TEAMTO~1/PERFOR~1
/c/PROGRA~1/Java/jdk1.8.0_121/bin
/c/PROGRA~1/CMake/bin
/c/usr/local/bin
/c/usr/local/lib
For example, you might create a .bashrc
file in your home directory containing this:
export CPP_SRC_HOME=$HOME/ml-cpp
VCVER=`/bin/ls -1 /c/PROGRA~1/MICROS~1/2022/Professional/VC/Tools/MSVC | tail -1`
VCBINDIR=/c/PROGRA~1/MICROS~1/2022/Professional/VC/Tools/MSVC/$VCVER/bin/Hostx64/x64:/c/PROGRA~1/MICROS~1/2022/Professional/Common7/IDE:/c/PROGRA~2/WI3CF2~1/8.0/bin/x64:/c/PROGRA~2/WI3CF2~1/8.0/bin/x86:/c/PROGRA~1/MICROS~1/2022/Professional/TEAMTO~1/PERFOR~1/x64:/c/PROGRA~1/MICROS~1/2022/Professional/TEAMTO~1/PERFOR~1
export JAVA_HOME=/c/PROGRA~1/Java/jdk1.8.0_121
export PATH="$CPP_SRC_HOME/build/distribution/platform/windows-x86_64/bin:$VCBINDIR:/mingw64/bin:$JAVA_HOME/bin:/c/usr/local/bin:/c/usr/local/lib:/bin:/c/Windows/System32:/c/Windows:/c/PROGRA~1/CMake/bin"
64 bit Windows is required.
It is possible to build on Windows Server 2012r2, Windows Server 2016, 2019 & 2022 and Windows 10. Other versions may encounter problems.
Download sdksetup.exe
from https://developer.microsoft.com/en-us/windows/downloads/windows-8-sdk and run it. Accept the default installation location, opt out of the customer experience improvement program and accept the license agreement. Then install with all features selected.
Visit https://my.visualstudio.com/downloads and download the DVD image, VisualStudioSetup.exe
, by clicking on the Download
button for Windows SDK for Windows 11, version 21H2
. Once downloaded, double click on the iso
image to mount it, then navigate to the newly mounted drive in File Explorer
and run WinSDKSetup.exe
by double clicking on it.
Make sure you install the Windows 8 SDK before this, as it cannot be installed afterwards.
The Professional edition requires an MSDN subscription. Visit https://my.visualstudio.com/downloads and download the installer, VisualStudioSetup.exe
, by clicking on the Download
button for Visual Studio Professional 2022
, and run it.
On the "Workloads" page that is displayed after a short while, check "Desktop development with C++". Then click on the "Individual Components" tab and check "Windows Universal CRT SDK" (about half way down the list). Then click "Install".
Download Git-2.46.2-64-bit.exe
from https://github.com/git-for-windows/git/releases/download/v2.46.2.windows.1/Git-2.46.2-64-bit.exe.
Install it using mainly the default options suggested by the installer, but on the feature selection dialog also check "On the desktop" in the "Additional icons" section.
Do not change the suggested default autocrlf setting (true), as Unix line endings do not work for unit test test files.
As well as providing a Git implementation, Git for Windows comes with Windows ports of a variety of common Unix commands, e.g. grep
, sed
, awk
, find
, and many others.
CMake version 3.19.2 is the minimum required to build ml-cpp. Download the MSI installer for version 3.27.9 from https://cmake.org/files/v3.27/cmake-3.27.9-windows-x86_64.msi (or get a more recent version).
Install it mainly using the default options except on the "Install Options" dialog check "Add CMake to the system PATH for all users".
The instructions below relate to building 3rd party dependencies from source. These steps can either be performed
manually or, if preferred, automatically by running the PowerShell script build_windows_third_party_deps.ps1
in the
dev-tools
directory.
Whilst it is possible to download a pre-built version of zlib1.dll
, for consistency we want one that links against the Visual Studio 2022 C runtime library. Therefore it is necessary to build zlib from source.
Download the source code from http://zlib.net/ - the file is called zlib131.zip
. Unzip this file under C:\tools
, so that you end up with a directory called C:\tools\zlib-1.3.1
.
To build, start a command prompt using Start Menu -> Apps -> Visual Studio 2022 -> x64 Native Tools Command Prompt for VS 2022, then in it type:
cd \tools\zlib-1.3.1
nmake -f win32/Makefile.msc LOC="-D_WIN32_WINNT=0x0601"
nmake -f win32/Makefile.msc test
All the build output will end up in the top level C:\tools\zlib-1.3.1
directory. Once the build is complete, copy zlib1.dll
and minigzip.exe
to C:\usr\local\bin
. Copy zlib.lib
and zdll.lib
to C:\usr\local\lib
. And copy zlib.h
and zconf.h
to C:\usr\local\include
.
Download libxml2-2.9.14.tar.xz
from https://download.gnome.org/sources/libxml2/2.9/libxml2-2.9.14.tar.xz.
Extract it in a Git bash shell using the GNU tar that comes with Git for Windows, e.g.:
cd /c/tools
tar Jxvf /z/cpp_src/libxml2-2.9.14.tar.xz
Edit C:\tools\libxml2-2.9.14\win32\Makefile.msvc
and change the following lines:
CFLAGS = $(CFLAGS) /D "NDEBUG" /O2
to:
CFLAGS = $(CFLAGS) /D "NDEBUG" /O2 /Zi /D_WIN32_WINNT=0x0601
(because we might as well generate a PDB file and we want to limit Windows API functions to those that exist in Windows Server 2008r2).
Start a command prompt using Start Menu -> Apps -> Visual Studio 2022 -> x64 Native Tools Command Prompt for VS 2022, then in it type:
cd \tools\libxml2-2.9.14\win32
cscript configure.js iconv=no prefix=C:\usr\local
nmake
nmake install
Download version 1.86.0 of Boost from https://archives.boost.io/release/1.86.0/source/boost_1_86_0.tar.bz2 . You must get this exact version, as the Machine Learning build system requires it.
Assuming you chose the .bz2
version, extract it in a Git bash shell using the GNU tar that comes with Git for Windows, e.g.:
cd /c/tools
tar jxvf /z/cpp_src/boost_1_86_0.tar.bz2
Edit boost/unordered/detail/prime_fmod.hpp
and change line 37 from:
constexpr static std::size_t const sizes[] = {13ul, 29ul, 53ul, 97ul,
to:
constexpr static std::size_t const sizes[] = {3ul, 13ul, 29ul, 53ul, 97ul,
Start a command prompt using Start Menu -> Apps -> Visual Studio 2022 -> x64 Native Tools Command Prompt for VS 2022, then in it type:
cd \tools\boost_1_86_0
bootstrap.bat
b2 -j6 install --prefix=C:\usr\local --layout=versioned --disable-icu --toolset=msvc-14.2 cxxflags="-std:c++17" linkflags="-std:c++17" --build-type=complete -sZLIB_INCLUDE="C:\tools\zlib-1.2.13" -sZLIB_LIBPATH="C:\tools\zlib-1.2.13" -sZLIB_NAME=zdll --without-context --without-coroutine --without-graph_parallel --without-mpi --without-python architecture=x86 address-model=64 optimization=speed inlining=full define=BOOST_MATH_NO_LONG_DOUBLE_MATH_FUNCTIONS define=BOOST_LOG_WITHOUT_DEBUG_OUTPUT define=BOOST_LOG_WITHOUT_EVENT_LOG define=BOOST_LOG_WITHOUT_SYSLOG define=BOOST_LOG_WITHOUT_IPC define=_WIN32_WINNT=0x0601
The Boost headers and appropriate libraries should end up in C:\usr\local\include
and C:\usr\local\lib
respectively.
Visual Studio's C runtime library has a strftime()
function but no strptime()
function, and unfortunately it's needed for our date handling functionality.
There is an open source strptime()
implementation from NetBSD. It needs some hacking about to get it to compile on Windows, but this is not an insurmountable challenge. Download strptime.c
and private.h
from http://cvsweb.netbsd.org/bsdweb.cgi/src/lib/libc/time/strptime.c?rev=HEAD and http://cvsweb.netbsd.org/bsdweb.cgi/src/lib/libc/time/private.h?rev=HEAD respectively. These files are also checked in in the strptime
sub-directory of the directory containing these instructions, as are some patch files that modify the original files so that they'll compile on Windows.
In a Git bash shell, copy the original source files and patches to a temporary directory and apply the patches, e.g.:
cp -r strptime /c/tools
cd /c/tools/strptime
patch -i strptime.ucrt.patch
patch -i private.patch
Start a command prompt using Start Menu -> Apps -> Visual Studio 2022 -> x64 Native Tools Command Prompt for VS 2022, then in it type:
cd C:\tools\strptime
cl -c -nologo -O2 -MD -DWIN32_LEAN_AND_MEAN -D_WIN32_WINNT=0x0601 -DNDEBUG strptime.c
lib -NOLOGO strptime.obj
copy strptime.lib C:\usr\local\lib
PyTorch currently requires Python 3.7 or higher; we use version 3.10.
Download the executable installer for Python 3.10.10 from https://www.python.org/ftp/python/3.10.10/python-3.10.10-amd64.exe.
Right click on the installer and "Run as administrator". (Note that elevating privileges during the install is not sufficient for the Python 3.10.10 installer, it needs to have elevated privileges when first run. Obviously this is bad practice, but that's the way it is in version 3.10.10.)
On the first installer screen click "Customize installation". (Although "Install Now" seems like it would do the job, the "Install launcher for all users" option literally only installs the launcher for all users, not Python itself.)
Click "Next" on the "Optional Features" screen.
On the "Advanced Options" screen, check "Install for all users" and "Add Python to environment variables". Then click "Install".
For the time being, do not take advantage of the option on the final installer screen to reconfigure the machine to allow paths longer than 260 characters. We still support Windows versions that do not have this option.
(This step requires a lot of memory. It failed on a machine with 12GB of RAM. It just about fitted on a 20GB machine. 32GB RAM is recommended.)
PyTorch requires that certain Python modules are installed. Start a command prompt "cmd.exe" using "Run as administrator". In it run:
pip install numpy ninja pyyaml mkl mkl-include setuptools cmake cffi typing_extensions future six requests dataclasses
Next, in a Git bash shell run:
cd /c/tools
git clone --depth=1 --branch=v2.5.1 https://github.com/pytorch/pytorch.git
cd pytorch
git submodule sync
git submodule update --init --recursive
Edit torch/csrc/jit/codegen/fuser/cpu/fused_kernel.cpp
and replace:
std::unique_ptr<FILE, decltype(&_pclose)> pipe(
_wpopen(cmd.c_str(), L"r"), _pclose);
with:
std::unique_ptr<FILE> pipe;
Also replace:
intptr_t r = _wspawnve(_P_WAIT, comspec, a, e.data());
return r;
with:
return -1;
This file is used to compile fused CPU kernels, which we do not expect to be
doing and never want to do for security reasons. Replacing the calls to
_wpopen()
and _wspawnve()
ensures that a heuristic virus scanner looking for
potentially dangerous function calls in our shipped product will not encounter
these functions that run external processes.
In C:\Program Files (x86)\Microsoft Visual Studio\2022\Professional\VC\Tools\MSVC\14.29.30133\lib\x64
rename libomp.lib
to libomp.lib.bak
and rename libompd.lib
to libompd.lib.bak
.
This is to prevent CMake's FindOpenMP module from linking to Microsoft's experimental
library that we are not allowed to redistribute.
Start a command prompt using Start Menu -> Apps -> Visual Studio 2022 -> x64 Native Tools Command Prompt for VS 2022, then in it type:
cd \tools\pytorch
set BUILD_TEST=OFF
set BUILD_CAFFE2=OFF
set USE_NUMPY=OFF
set USE_DISTRIBUTED=OFF
set USE_MKLDNN=ON
set USE_QNNPACK=OFF
set USE_PYTORCH_QNNPACK=OFF
set USE_XNNPACK=OFF
set MSVC_Z7_OVERRIDE=OFF
set PYTORCH_BUILD_VERSION=2.5.1
set PYTORCH_BUILD_NUMBER=1
python setup.py install
When it completes, at the very end there is an error message that the Python modules could not be installed. This can be ignored as we only want the headers, DLLs and import libraries.
To finish off, in a Git bash shell run:
cd /c/tools/pytorch
rm -rf /c/usr/local/include/pytorch
mkdir /c/usr/local/include/pytorch
cp -r torch/include/* /c/usr/local/include/pytorch/
cp torch/lib/torch_cpu.dll /c/usr/local/bin/
cp torch/lib/torch_cpu.lib /c/usr/local/lib/
cp torch/lib/c10.dll /c/usr/local/bin/
cp torch/lib/c10.lib /c/usr/local/lib/
cp torch/lib/fbgemm.dll /c/usr/local/bin/
cp torch/lib/fbgemm.lib /c/usr/local/lib/
cp torch/lib/asmjit.dll /c/usr/local/bin/
cp torch/lib/asmjit.lib /c/usr/local/lib/
Finally, in C:\Program Files (x86)\Microsoft Visual Studio\2022\Professional\VC\Tools\MSVC\14.29.30133\lib\x64
rename libomp.lib.bak
to libomp.lib
and rename libompd.lib.bak
to libompd.lib
.