# DynaSLAM **Repository Path**: myypen/DynaSLAM ## Basic Information - **Project Name**: DynaSLAM - **Description**: 利用语义分割信息和几何信息得到的动/静分割信息，剔除部分不可靠的关键点来使得跟踪变得更可靠使用mask-rcnn获取语义分割信息使用运动点判断准则获取动/静 mask 结合语义mask 和动/静 mask 生成需要剔除的 mask 在构造帧的时候对提取的关键点进行滤波，删除不可靠的关键点，使得跟踪更可靠 - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2020-04-09 - **Last Updated**: 2024-10-14 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # DynaSLAM [[Project]](https://bertabescos.github.io/DynaSLAM/) [[Paper]](https://arxiv.org/pdf/1806.05620.pdf) # 主要思想利用语义分割信息和几何信息得到的动/静分割信息，剔除部分不可靠的关键点来使得跟踪变得更可靠使用mask-rcnn获取语义分割信息使用运动点判断准则获取动/静 mask 结合语义mask 和动/静 mask 生成需要剔除的 mask 在构造帧的时候对提取的关键点进行滤波，删除不可靠的关键点，使得跟踪更可靠 # 思考 1. 是否可以结合光流来生成动/静 mask ，不过要考虑相机自身的运动引起的光流 2. 如果用于导航，仅仅依靠orb关键点，数量不够，是否可以添加边缘关键点检测算法 DynaSLAM is a visual SLAM system that is robust in dynamic scenarios for monocular, stereo and RGB-D configurations. Having a static map of the scene allows inpainting the frame background that has been occluded by such dynamic objects.

DynaSLAM: Tracking, Mapping and Inpainting in Dynamic Scenes [Berta Bescos](http://bertabescos.github.io), [José M. Fácil](http://webdiis.unizar.es/~jmfacil/), [Javier Civera](http://webdiis.unizar.es/~jcivera/) and [José Neira](http://webdiis.unizar.es/~jneira/) RA-L and IROS, 2018 We provide examples to run the SLAM system in the [TUM dataset](http://projects.asl.ethz.ch/datasets/doku.php?id=kmavvisualinertialdatasets) as RGB-D or monocular, and in the [KITTI dataset](http://www.cvlibs.net/datasets/kitti/eval_odometry.php) as stereo or monocular. # 运动点判断准则从关键帧数据库（最多20个）中获取当前帧的参考帧：差异性：和当前帧欧拉角度差平方 + 平移量差平方利用各自最大最小值归一化后，使用加权求和 vDist = 0.7*vDist + 0.3*vRot 对差异性进行排序： DESCENDING 降序排列选取前面 (差异性最大的) 作为参考帧 (最多5个) // 提取动态点=============这是不是可以考虑用光流来计算动态点=============== // 1. 选取参考帧关键点深度(0~6m) 计算参考帧下3d点再变换到世界坐标系下 // 2. 保留当前帧到世界点向量与参考帧到世界点向量夹角小于30的点，不会太近的点 // 3. 保留世界点反投影到当前帧坐标系下深度值 <7m的点 // 4. 保留世界点反投影到当前帧像素坐标系下浓缩平面( 20～620 & 20～460=)内的点,且该点，当前帧深度!=0 // 5. 根据投影点深度值和其周围20×20领域点当前帧深度值筛选出深度差值较小的领域点的深度值来更新当前帧深度值 // 6. 点投影深度值和特征点当前帧下深度差值过大，且该点周围深度方差小，确定该点为运动点 # 代码修改 Frame.cc Frame.h 根据传入的 mask 对提取的关键点进行滤波，剔除部分不可靠的点 Tracking.cc 双目/单目仅仅依靠语义mask 过滤关键点 RGBD 结合语义mask 和动/静mask 来过滤关键点具体做法先根据运动模型轻量级跟踪获取当前帧位姿态，使用运动点判断准则获取和动/静mask 其他添加了 c++ 调用 python 程序的文件双目左右图的语义检测，直接将两张图拼接在一起输入到网络，获取的语义结果再分开这样检测时间上不会增加多少，因为都会缩放到网络固定的尺寸进行检测，不过检测精度有所损失，但是速度快啊，这个idear赞 ## Getting Started - Install ORB-SLAM2 prerequisites: C++11 or C++0x Compiler, Pangolin, **OpenCV 2.4.11** and Eigen3 (https://github.com/raulmur/ORB_SLAM2). - Install boost libraries with the command `sudo apt-get install libboost-all-dev`. - Install python3, keras and tensorflow, and download the `mask_rcnn_coco.h5` model from this GitHub repository: https://github.com/matterport/Mask_RCNN/releases. - Clone this repo: ```bash git clone git@github.com:BertaBescos/DynaSLAM.git cd DynaSLAM ``` ``` cd DynaSLAM chmod +x build.sh ./build.sh ``` - Place the `mask_rcnn_coco.h5` model in the folder `DynaSLAM/src/python/`. ## RGB-D Example on TUM Dataset - Download a sequence from http://vision.in.tum.de/data/datasets/rgbd-dataset/download and uncompress it. - Associate RGB images and depth images executing the python script [associate.py](http://vision.in.tum.de/data/datasets/rgbd-dataset/tools): ``` python associate.py PATH_TO_SEQUENCE/rgb.txt PATH_TO_SEQUENCE/depth.txt > associations.txt ``` These associations files are given in the folder `./Examples/RGB-D/associations/` for the TUM dynamic sequences. - Execute the following command. Change `TUMX.yaml` to TUM1.yaml,TUM2.yaml or TUM3.yaml for freiburg1, freiburg2 and freiburg3 sequences respectively. Change `PATH_TO_SEQUENCE_FOLDER` to the uncompressed sequence folder. Change `ASSOCIATIONS_FILE` to the path to the corresponding associations file. `PATH_TO_MASKS` and `PATH_TO_OUTPUT` are optional parameters. ``` ./Examples/RGB-D/rgbd_tum Vocabulary/ORBvoc.txt Examples/RGB-D/TUMX.yaml PATH_TO_SEQUENCE_FOLDER ASSOCIATIONS_FILE (PATH_TO_MASKS) (PATH_TO_OUTPUT) ``` If `PATH_TO_MASKS` and `PATH_TO_OUTPUT` are **not** provided, only the geometrical approach is used to detect dynamic objects. If `PATH_TO_MASKS` is provided, Mask R-CNN is used to segment the potential dynamic content of every frame. These masks are saved in the provided folder `PATH_TO_MASKS`. If this argument is `no_save`, the masks are used but not saved. If it finds the Mask R-CNN computed dynamic masks in `PATH_TO_MASKS`, it uses them but does not compute them again. If `PATH_TO_OUTPUT` is provided, the inpainted frames are computed and saved in `PATH_TO_OUTPUT`. ## Stereo Example on KITTI Dataset - Download the dataset (grayscale images) from http://www.cvlibs.net/datasets/kitti/eval_odometry.php - Execute the following command. Change `KITTIX.yaml`to KITTI00-02.yaml, KITTI03.yaml or KITTI04-12.yaml for sequence 0 to 2, 3, and 4 to 12 respectively. Change `PATH_TO_DATASET_FOLDER` to the uncompressed dataset folder. Change `SEQUENCE_NUMBER` to 00, 01, 02,.., 11. By providing the last argument `PATH_TO_MASKS`, dynamic objects are detected with Mask R-CNN. ``` ./Examples/Stereo/stereo_kitti Vocabulary/ORBvoc.txt Examples/Stereo/KITTIX.yaml PATH_TO_DATASET_FOLDER/dataset/sequences/SEQUENCE_NUMBER (PATH_TO_MASKS) ``` ## Monocular Example on TUM Dataset - Download a sequence from http://vision.in.tum.de/data/datasets/rgbd-dataset/download and uncompress it. - Execute the following command. Change `TUMX.yaml` to TUM1.yaml,TUM2.yaml or TUM3.yaml for freiburg1, freiburg2 and freiburg3 sequences respectively. Change `PATH_TO_SEQUENCE_FOLDER`to the uncompressed sequence folder. By providing the last argument `PATH_TO_MASKS`, dynamic objects are detected with Mask R-CNN. ``` ./Examples/Monocular/mono_tum Vocabulary/ORBvoc.txt Examples/Monocular/TUMX.yaml PATH_TO_SEQUENCE_FOLDER (PATH_TO_MASKS) ``` ## Monocular Example on KITTI Dataset - Download the dataset (grayscale images) from http://www.cvlibs.net/datasets/kitti/eval_odometry.php - Execute the following command. Change `KITTIX.yaml`by KITTI00-02.yaml, KITTI03.yaml or KITTI04-12.yaml for sequence 0 to 2, 3, and 4 to 12 respectively. Change `PATH_TO_DATASET_FOLDER` to the uncompressed dataset folder. Change `SEQUENCE_NUMBER` to 00, 01, 02,.., 11. By providing the last argument `PATH_TO_MASKS`, dynamic objects are detected with Mask R-CNN. ``` ./Examples/Monocular/mono_kitti Vocabulary/ORBvoc.txt Examples/Monocular/KITTIX.yaml PATH_TO_DATASET_FOLDER/dataset/sequences/SEQUENCE_NUMBER (PATH_TO_MASKS) ``` ## Citation If you use DynaSLAM in an academic work, please cite: @article{bescos2018dynaslam, title={{DynaSLAM}: Tracking, Mapping and Inpainting in Dynamic Environments}, author={Bescos, Berta, F\'acil, JM., Civera, Javier and Neira, Jos\'e}, journal={IEEE RA-L}, year={2018} } ## Acknowledgements Our code builds on [ORB-SLAM2](https://github.com/raulmur/ORB_SLAM2). # DynaSLAM