Posts

从源码编译OpenCV 4.5.x：深度学习与CUDA加速实战指南

从源码编译OpenCV 4.5.x：深度学习与CUDA加速实战指南在这篇图文并茂的指南中，我们着重讲解如何在Windows 10环境下通过CMake构建OpenCV 4.5.x的定制版本，特别关注深度学习模块（DNN）与CUDA加速的配置方法。我们将通过结构化流程图和关键配置点标注，帮助开发者快速完成编译环境搭建，确保DNN推理模块能够充分利用NVIDIA GPU资源。 🔧 环境准备硬件要求 - NVIDIA GPU（CUDA兼容设备） - >= 8GB显存（建议16GB以上）软件依赖 1. Visual Studio 2019（需安装C++桌面开发套件） 2. CUDA Toolkit 11.3+ (**需与cuDNN版本对应**) 3. cuDNN 8.2+（下载路径示例：https://developer.nvidia.com/cudnn） 4. Git命令行工具 📦 源码下载与工程初始化源码获取（Git操作） :: 进入工作目录 cd /d D:/WORK/opencv-github :: 克隆核心库 git clone https://github.com/opencv/opencv.git -b 4.5.2 # 注意添加tag或branch指定版本 :: 克隆扩展模块库 git clone https://github.com/opencv/opencv_contrib.git -b 4.5.2 TIP 直接在GitHub页面切换对应版本更稳妥（commit版本记录见下图） 🔧 CMake配置流程详解配置界面流程图 graph TD A[启动CMake-GUI] --> B[设置源目录] B --> C[设置构建目录] C --> D[首次Configure选择VS2019x64] D --> E[等待IPPICV自动下载] E --> F[再次Configure应用结果] F --> G[设置OPENCV_EXTRA_MODULES_PATH] G --> H[启用CUDA相关选项] H --> I[勾选必要编译选项] I --> J[Generate生成VS工程] 关键配置项解析 1. 模块路径设置 Name: OPENCV_EXTRA_MODULES_PATH Path: D:/WORK/opencv-github/opencv_contrib/modules 2. CUDA加速相关配置 FolderPath[CUDA选项区]: - WITH_CUDA → ON - OPENCV_DNN_CUDA → ON - WITH_CUDNN → ON - CUDA_FAST_MATH → ON 注意点： - 禁用 BUILD_CUDA_STUBS（防止生成无效模拟代码） - CUDA gfx平台无需启用（Windows默认支持x64） 3. 高级优化选项优化选项配置组: - BUILD_opencv_world → ON（唯一动态库模式） - BUILD_DOCS → OFF（禁用文档生成） - ENABLE_NONFREE → ON（启用专利算法模块）

Thursday, March 27, 2025 Read

在Docker中安装和配置OpenVino 2022.3

从代码安装官方推荐从代码安装开局不利选择通过gitee clone安装，但是反复clone失败。报错如下： # git clone -b 2022.3.0 https://gitee.com/openvinotoolkit-prc/openvino.git Cloning into 'openvino'... remote: Enumerating objects: 345561, done. remote: Counting objects: 100% (59023/59023), done. remote: Compressing objects: 100% (33490/33490), done. error: RPC failed; curl 56 GnuTLS recv error (-9): Error decoding the received TLS packet. fatal: the remote end hung up unexpectedly fatal: early EOF fatal: index-pack failed 解决方法 git config --global http.postBuffer 500M git config --global http.maxRequestBuffer 100M git config --global core.compression 0

Tuesday, April 11, 2023 Read

无需sudo运行docker命令

Run Docker commands without sudo 新安装的Docker，有可能需要使用sudo才能运行docker命令。比如，运行一个GPU docker镜像，需要使用sudo，否则会报错： $ docker run --rm --runtime=nvidia --gpus all nvidia/cuda:11.6.2-base-ubuntu20.04 nvidia-smi docker: permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock: Post "http://%2Fvar%2Frun%2Fdocker.sock/v1.24/containers/create": dial unix /var/run/docker.sock: connect: permission denied. See 'docker run --help'. 报错信息就是说，没有权限访问docker daemon。这是因为docker默认情况下是不允许普通用户使用的，需要使用sudo才能运行docker命令。这样很不方便，下面介绍一下如何让docker不需要sudo就能运行。 1. Add the docker group if it doesn’t already exist 首先创建一个Docker 用户组，如果用户组已经存在，可以跳过这一步。 $ sudo groupadd docker 2. Add the connected user $USER to the docker group 然后把当前用户加入到docker用户组中。

Saturday, April 8, 2023 Read

开启face_recognition的CUDA支持以及人脸识别package的一点讨论

开启face_recognition的CUDA支持的方法 face_recognition是一个python包，用于人脸识别。属于基础的人脸识别包，支持人脸检测、人脸编码、人脸比对等功能。 face_recognition包支持使用GPU进行加速，但是默认情况下是不开启的。也就是说，直接用Pip或这conda方式进行安装，装上的包基本上是不支持GPU计算的。 face_recognition的核心计算部分依赖dlib，所以需要安装dlib的GPU版本。如需要使用GPU进行加速的时候，需要安装CUDA和cuDNN。安装dlib的GPU版本，需要下载代码，开启cuda和cudnn支持，然后编译安装。下面介绍一下安装过程。核心：编译安装dlib with CUDA 官方文档介绍开启cuda支持的dlib的安装过程如下： Installing dlib using conda with CUDA enabled Prerequisite: conda and/or miniconda are already installed Create a conda environment. $ conda create -n dlib python=3.8 cmake ipython Activate the environment. $ conda activate dlib Install CUDA and cuDNN with conda using nvidia channel $ conda install cuda cudnn -c nvidia Then find the path to the nvcc of this environment. We will use this path for the build step below

Friday, March 31, 2023 Read

使用Docker辅助图像识别程序开发：在Docker中访问GPU和、USB相机以及网络

引言在操作系统中发行应用程序，尤其是python应用程序，其环境配置常常是分发过程中的重要一环。如果像开发的时候那样手动构建，一方面工作量难以承受，另一方面经常会出现各种各样的问题。在不同的目标主机上手动构建环境，会受到目标操作系统的版本、文件系统、所安装软件包的情况影响。而且开发时所使用的一些默认安装包，到了发布的时候可能已经都被更新过，所以手动构建要求使用的包版本号也精确记录。安装和配置安装GPU docker，首先需要安装docker，然后在docker的基础上安装nvidia-docker。安装docker 参考链接 https://docs.docker.com/engine/install/ubuntu 安装nvidia-docker 参考链接 https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#docker 在docker中显示GUI 如果在Docker中开发的是带有GUI的应用程序，也就是在docker中显示GUI，需要启动支持GUI的docker镜像。首先配置一下xhost xhost +local:docker 或者 xhost + 启动docker，由于需要docker中显示GUI，所以加入参数 -v /tmp/.X11-unix:/tmp/.X11-unix -e DISPLAY=$DISPLAY 完整指令如下： docker run --name mydocker --gpus all --shm-size=1g --ulimit memlock=-1 -it -v /tmp/.X11-unix:/tmp/.X11-unix -e DISPLAY=$DISPLAY snn-server:basic 注意，这里是启动了GPU docker，–gpus all是指定使用所有的GPU，如果只使用一块GPU，可以指定为–gpus 0。在Docker中访问usb相机如果需要在docker中访问usb相机，需要在启动docker的时候，追加以下参数 -v /dev/video0:/dev/video0 --device=/dev/video0 这样就把宿主机的/dev/video0映射到docker中的/dev/video0，然后在docker中就可以访问到相机了。 docker run --name mydocker --gpus all --shm-size=1g --ulimit memlock=-1 -v /dev/video0:/dev/video0 --device=/dev/video0 -it -v /tmp/.X11-unix:/tmp/.X11-unix -e DISPLAY=$DISPLAY myimage:latest 在Docker镜像中开放端口如果需要从docker中对外提供服务，需要在docker中向宿主机进行端口映射，才可以从宿主机访问到docker中的服务。 Docker中的端口映射，需要在启动docker的时候，加入参数 -p 8080:8080 这样，宿主机的8080端口就映射到docker中的8080端口了。

Thursday, November 3, 2022 Read

Build FFMpeg with CUDA on CentOS 8

Prepareing CUDA toolkits need to be installed. Then the following listed packages should be installed too. dnf -y install automake autoconf libtool make gcc gcc-c++ dnf --enablerepo=powertools -y install giflib-devel dnf --enablerepo=powertools -y install libexif-devel dnf -y install bison pkgconfig glib2-devel gettext make libpng-devel libjpeg-devel libtiff-devel libexif-devel giflib-devel libX11-devel freetype-devel fontconfig-devel cairo-devel fribidi-devel dnf -y install openssl openssl-devel Note that, powertools might be PowerTools according to different configurations, just replace the name.

Monday, July 4, 2022 Read

Installation of GNUPlot 5.4.3 on Linux with png/jpg export support

We need to install libpng and libgd before the compilation of gnuplot-5.4.3 We can install libpng using yum install yum install libpng-devel The libgd-devel package provided by yum is not compatable with gnuplot-5.4.3 on the aws-linux-2, we download and compile the officially new version. wget https://github.com/libgd/libgd/releases/download/gd-2.3.3/libgd-2.3.3.tar.gz tar zxvf ./libgd-2.3.3.tar.gz cd libgd-2.3.3 ./configure make sudo make install Then compile gnuplot. cd libs wget https://sourceforge.net/projects/gnuplot/files/gnuplot/5.4.3/gnuplot-5.4.3.tar.gz tar zxvf ./gnuplot-5.4.3.tar.gz cd gnuplot-5.4.3 ./configure make sudo make install

Monday, July 4, 2022 Read

C++中正则表达式的用法

C++11引入了 <regex> 标准库，为开发人员提供了强大的正则表达式支持。正则表达式（RegEx）可用于字符串匹配、搜索、替换等操作，是处理文本数据的利器。本文将从基础语法到高级用法，逐步介绍C++正则表达式如何高效使用。一、基础准备 1. 引入库和命名空间在使用正则表达式之前，需包含头文件并使用相关命名空间： #include <regex> #include <string> #include <iostream> using namespace std; 2. 构建正则表达式使用 std::regex 对象存储正则表达式： regex re("\\d+"); // 匹配数字，注意转义字符二、正则表达式语法基础 1. 常见元字符元字符含义 . 匹配任意单个字符（除换行符） * 匹配前一个字符 0 次或多次 + 匹配前一个字符 1 次或多次 ? 匹配前一个字符 0 次或 1 次 ^ 匹配字符串开头 $ 匹配字符串结尾 [] 匹配括号内的任意一个字符（如 [abc]） \d 匹配数字 [0-9] \D 匹配非数字 [^0-9] \w 匹配字母、数字、下划线 [A-Za-z0-9_] 2. 分组和捕获（Groups） regex re("(\\d{4})-(\\d{2})-(\\d{2})"); // 解析日期格式 YYYY-MM-DD smatch result; if(regex_search("2023-04-01", result, re)) { cout << "年份: " << result[1] << endl; // 输出 "2023" cout << "月份: " << result[2] << endl; // 输出 "04" } 三、核心函数与用法 1. 检查匹配（regex_match）检查整个字符串是否匹配：

Wednesday, June 15, 2022 Read

How to Implement FTP Upload by C++ and POCO

POCO POCO is a lightweight and flexible network library for C++ users. You can refer to the POCO library at its homepage: https://pocoproject.org/ or its github project page: https://github.com/pocoproject/poco . You can simply git clone from the POCO repository, and build it follows its official mannual. For me, I built it simply with the CMake-GUI tool on Windows, with all default settings. How to Upload Includes Stuffs Most of the FTP related APIs are in the Poco/Net/FTPClientSession.h header file. The Exception Processing tools are used all the time, so that we include it too. When you need to do the uploading to FTP server, Poco/StreamCopier.h is must be included.

Wednesday, June 15, 2022 Read

OpenCV 4.X 使用CvxText在图片显示汉字

最近又需要在图像上实时绘制汉字。一般来讲如果绘制汉字的需求绕不过的话，直接绘制在图片总归是最easy的实现方式。因为不然的话可能要额外调用GUI组件来实现。一般都是用freetype+cvxtext，老生常谈。且不说实际实现起来是否最easy，主要是这种方法多年来实践了无数次了，不过今次切换到OpenCV4.5，突然发现可能又要修改CvxText代码才可以，因为直接使用，不work。准备需要的依赖有： C/C++ 编译环境（似乎是废话） OpenCV (仍然废话) freetype的lib：提前编译好，官网是 https://freetype.org/，我使用的版本是2.9.1 字体文件，一般用simhei.ttf。在操作系统的字体里面哦。修改 CvxText 代码我这里有一份CvxText代码，在旧版本的OpenCV下可以使用（OpenCV3.X)。如今更换到了OpenCV4.5，这份代码直接使用会有些小问题，不过都很容易修改。 OpenCV头文件包含方式首先需要重写头文件包含方法。在OpenCV4以前，include下有两个子目录，分别是opencv，和opencv2。在OpenCV4.X后，include下只剩一个opencv2文件夹了。涉及到opencv的头文件包含代码，改为如下形式： #include "opencv2/core/core.hpp" #include "opencv2/core/core_c.h" #include "opencv2/highgui/highgui.hpp" #include "opencv2/imgproc/imgproc.hpp" 这里特别说明，引入core_c.h这个头文件很重要。因为我手里这份CvxText代码类型都是基于旧式的C类型，core_c.h 提供了对C类型的兼容。 CvScalar类型问题下一处需要修改的是和CvScalar相关的代码。尽管我们重新写了头文件包含，引入了C类型，但是有些代码仍然不能直接编译通过，因为CvScalar不能隐式的转为C++类型的cv::Scalar。下面的puttext函数代码中，我修改了显式的手工转换替代了注释中的代码。样子很丑，但是简单好用（总共花费了不到1分钟）。 int CvxText::putText(cv::Mat &frame, const char *text, CvPoint pos) { //return putText(frame, text, pos, CV_RGB(255, 255, 255)); CvScalar s = {255, 255, 255}; return putText(frame, text, pos, s); } int CvxText::putText(cv::Mat &frame, const wchar_t *text, CvPoint pos) { //return putText(frame, text, pos, CV_RGB(255, 255, 255)); CvScalar s = {255, 255, 255}; return putText(frame, text, pos, s); } cv::Mat转为IplImage 另一处就是比较老生常谈的问题，cv::Mat转为IplImage。这里之前的实现是直接采用C形式的强制转换，如下所示：

Wednesday, June 15, 2022 Read

Curved Text Detection by PaddleOCR

Text Detection Task is an old topic of Detction tasks. Curved text is much more free form. in this Post, we show you how to run the curved text detection by PaddleOCR. Setup Environment Create a virtual environment named paddle_env. conda create --name paddle_env python=3.8 Then activate it. conda activate paddle_env Install paddlepaddle and paddle ocr packages through pip. python -m pip install paddlepaddle pip install "paddleocr>=2.0.1" # Recommend to use version 2.0.1+ Test Environment Test the installations.

Thursday, April 14, 2022 Read

Tips of Migrating Numpy to C++ by OpenCV

Sigmoid A sigmoid function is a mathematical function having a characteristic "S"-shaped curve or sigmoid curve. From wikipedia: Sigmoid_function It’s formula: $S(t)=\frac {1}{1+e^{-t}}$ We can implement sigmoid function with python and numpy like this: import numpy as np z = 1/(1 + np.exp(-x)) Now let’s implement it in OpenCV with cpp. Suppose we’ve got a output from a neural network as a [1, h, w] cv::Mat named pred. We then do something like pred = sigmoid(pred).

Saturday, March 26, 2022 Read