============== Takeshi Ishita ============== Work Experience =============== `CyberAgent AI Lab `__ ------------------------------------------------------- July 2024 - `Tier IV, Inc. `__ --------------------------------------------------------------------------- R&D of vehicle localization | July 2020 - June 2024 `Mitou Program `__ --------------------------------------------------------------------------- Development of a Visual SLAM framework | April 2019 - March 2020 `DeNA Co., Ltd. `__ -------------------------------------- | April 2018 - March 2020 | Part-time job `Cookpad Inc. `__ ---------------------------------------------- Design and implementation of machine learning methods for ingredient recognition from food images. Patent ~~~~~~ The model I proposed is granted as a patent `#6306770 `__. | Dec 2016 - Jul 2017 | Part-time job `Usagee Inc. `__ -------------------------------------- - Research and development of Machine Learinng & Computer Vision methods - Providing effective solutions to customers | May 2014 - Jan 2017 | Part-time job Education ========= | National Institute of Technology, Tokyo College, Advanced Course | April 2017 - March 2019 | Student exchange with Metropolia University of Applied Sciences | August 2017 - December 2017 | National Institute of Technology, Tokyo College | April 2012 - March 2017 My works ======== My works are available on `GitHub `__ 1. `Tadataka (under development) `__ ------------------------------------------------------------------------------- This project aims to develop a Visual SLAM framework that is flexible and simple to use. Currently implemented algorithms: DVO (Dense Visual Odometry) [#Steinbrucker_et_al_2011]_ [#Kerl_et_al_2013]_ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Estimating camera motion from RGB-D video sequence (`YouTube video `__). .. raw:: html Feature Based Visual Odometry ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Estimating camera motion and 3D structure from a single RGB camera (`YouTube video `__). .. raw:: html 2. `RoadDamageDetector `__ ------------------------------------------------------------------------------- .. image:: images/road-damage-1.png :width: 800 | Road damage detector based on SSD (Single Shot Multibox Detector). | The detailed explanation is at `my Qiita blog page (in Japanese) `__. | Trained models are published along with the source code. What I did ~~~~~~~~~~ - Trained SSD(VGG16) on the RoadDamageDataset provided by Maeda et al. (2018) [#Maeda_et_al_2018]_ - Replaced VGG16 with ResNet-101 and evaluated the performance 3. `PCANet `__ ------------------------------------------------------- | PCANet is a neural network for image classification that trains its weights with PCA [#Chan_et_al_2015]_. | PCANet requires histogram calculation in the pooling layer. Although there was no GPU support for histogram calculation in CuPy. | I implemented the histogram calculation in CUDA and sent a pull request, which has been merged into the CuPy repository. `#298 `__ `Ensemble PCANet `__ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | PCANet can train quickly. On the other hand, its representation ability is not strong. | I combined PCANet with Bagging and succeeded to increase the representation ability while keeping the training speed. | This idea is proposed to `JSAI 2017 `__. 4. `SCW `__ ------------------------------------------------- | Implementation of SCW (Soft Confidence-Weighted Learning) [#Wang_et_al_2012]_. | SCW is an online supervised learning algorithm which utilizes all the four salient properties: - Large margin training - Confidence weighting - Capability to handle non-separable data - Adaptive margin Blog ==== - `The Zen of Python `__ - `Kalman Filter `__ - `Tomasi-Kanade 3D reconstruction `__ Article ======= - `日経ソフトウエア 2017年8月号「Pythonで機械学習」 `__ - `日経ソフトウエア 2020年5月号「撮影した物体を3次元データで復元」 `__ - Journal of the Japan society of photogrammetry and remote sensing, November 2023, "Utilization of SLAM for Autonomous Driving in Urban Areas" Presentations ============= - `Sparse Bundle Adjustment `__ - `3D rotation representation in so(3) `__ - `Histogram calculation in CuPy `__ References ========== .. [#Steinbrucker_et_al_2011] Steinbrücker Frank, Jürgen Sturm, and Daniel Cremers. "Real-time visual odometry from dense RGB-D images." Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on. IEEE, 2011. .. [#Kerl_et_al_2013] Kerl, Christian, Jürgen Sturm, and Daniel Cremers. "Robust odometry estimation for RGB-D cameras." Robotics and Automation (ICRA), 2013 IEEE International Conference on. IEEE, 2013. .. [#Maeda_et_al_2018] Maeda, Hiroya, et al. "Road damage detection using deep neural networks with images captured through a smartphone." arXiv preprint arXiv:1801.09454 (2018). .. [#Chan_et_al_2015] Chan, Tsung-Han, et al. "PCANet: A simple deep learning baseline for image classification?." IEEE transactions on image processing 24.12 (2015): 5017-5032. .. [#Wang_et_al_2012] Wang, Jialei, Peilin Zhao, and Steven CH Hoi. "Exact soft confidence-weighted learning." arXiv preprint arXiv:1206.4612 (2012).