==============
Takeshi Ishita
==============
Work Experience
===============
`CyberAgent AI Lab `__
-------------------------------------------------------
July 2024 -
`Tier IV, Inc. `__
---------------------------------------------------------------------------
R&D of vehicle localization
| July 2020 - June 2024
`Mitou Program `__
---------------------------------------------------------------------------
Development of a Visual SLAM framework
| April 2019 - March 2020
`DeNA Co., Ltd. `__
--------------------------------------
| April 2018 - March 2020
| Part-time job
`Cookpad Inc. `__
----------------------------------------------
Design and implementation of machine learning methods for ingredient recognition from food images.
Patent
~~~~~~
The model I proposed is granted as a patent
`#6306770 `__.
| Dec 2016 - Jul 2017
| Part-time job
`Usagee Inc. `__
--------------------------------------
- Research and development of Machine Learinng & Computer Vision methods
- Providing effective solutions to customers
| May 2014 - Jan 2017
| Part-time job
Education
=========
| National Institute of Technology, Tokyo College, Advanced Course
| April 2017 - March 2019
| Student exchange with Metropolia University of Applied Sciences
| August 2017 - December 2017
| National Institute of Technology, Tokyo College
| April 2012 - March 2017
My works
========
My works are available on `GitHub `__
1. `Tadataka (under development) `__
-------------------------------------------------------------------------------
This project aims to develop a Visual SLAM framework that is flexible and simple to use.
Currently implemented algorithms:
DVO (Dense Visual Odometry) [#Steinbrucker_et_al_2011]_ [#Kerl_et_al_2013]_
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Estimating camera motion from RGB-D video sequence (`YouTube video `__).
.. raw:: html
Feature Based Visual Odometry
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Estimating camera motion and 3D structure from a single RGB camera (`YouTube video `__).
.. raw:: html
2. `RoadDamageDetector `__
-------------------------------------------------------------------------------
.. image:: images/road-damage-1.png
:width: 800
| Road damage detector based on SSD (Single Shot Multibox Detector).
| The detailed explanation is at `my Qiita blog page (in Japanese) `__.
| Trained models are published along with the source code.
What I did
~~~~~~~~~~
- Trained SSD(VGG16) on the RoadDamageDataset provided by Maeda et al. (2018) [#Maeda_et_al_2018]_
- Replaced VGG16 with ResNet-101 and evaluated the performance
3. `PCANet `__
-------------------------------------------------------
| PCANet is a neural network for image classification that trains its weights with PCA [#Chan_et_al_2015]_.
| PCANet requires histogram calculation in the pooling layer. Although there was no GPU support for histogram calculation in CuPy.
| I implemented the histogram calculation in CUDA and sent a pull request, which has been merged into the CuPy repository.
`#298 `__
`Ensemble PCANet `__
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
| PCANet can train quickly. On the other hand, its representation ability is not strong.
| I combined PCANet with Bagging and succeeded to increase the representation ability while keeping the training speed.
| This idea is proposed to `JSAI 2017 `__.
4. `SCW `__
-------------------------------------------------
| Implementation of SCW (Soft Confidence-Weighted Learning) [#Wang_et_al_2012]_.
| SCW is an online supervised learning algorithm which utilizes all the four salient properties:
- Large margin training
- Confidence weighting
- Capability to handle non-separable data
- Adaptive margin
Blog
====
- `The Zen of Python `__
- `Kalman Filter `__
- `Tomasi-Kanade 3D reconstruction `__
Article
=======
- `日経ソフトウエア 2017年8月号 「Pythonで機械学習」 `__
- `日経ソフトウエア 2020年5月号 「撮影した物体を3次元データで復元」 `__
- Journal of the Japan society of photogrammetry and remote sensing, November 2023, "Utilization of SLAM for Autonomous Driving in Urban Areas"
Presentations
=============
- `Sparse Bundle Adjustment `__
- `3D rotation representation in so(3) `__
- `Histogram calculation in CuPy `__
References
==========
.. [#Steinbrucker_et_al_2011] Steinbrücker Frank, Jürgen Sturm, and Daniel Cremers. "Real-time visual odometry from dense RGB-D images." Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on. IEEE, 2011.
.. [#Kerl_et_al_2013] Kerl, Christian, Jürgen Sturm, and Daniel Cremers. "Robust odometry estimation for RGB-D cameras." Robotics and Automation (ICRA), 2013 IEEE International Conference on. IEEE, 2013.
.. [#Maeda_et_al_2018] Maeda, Hiroya, et al. "Road damage detection using deep neural networks with images captured through a smartphone." arXiv preprint arXiv:1801.09454 (2018).
.. [#Chan_et_al_2015] Chan, Tsung-Han, et al. "PCANet: A simple deep learning baseline for image classification?." IEEE transactions on image processing 24.12 (2015): 5017-5032.
.. [#Wang_et_al_2012] Wang, Jialei, Peilin Zhao, and Steven CH Hoi. "Exact soft confidence-weighted learning." arXiv preprint arXiv:1206.4612 (2012).