Seito Kasai

MS Student, Computer Science and Electronic Engineering

About Me

My name is Seito, and I study computer vision & machine learning. I’m currently a MS student working on vision & language at Aoki Media Sensing Lab. in Keio University, and live in Tokyo, Japan. Although I used to code mainly for my research, my recent interests are creating software to give an impact on many people.

Currently I am looking for engineering/research internships, especially ones in which I can express and enhance my coding skills. Please don’t hesitate to contact me!

Programming Languages: Python, C/C++, R, MATLAB, Java

Skills: Machine Learning, Computer Vision, Natural Language Processing, Data Science



Mar 2019 - present

This organization consists of members from various universities in Japan all studying computer vision. Over 100 members participate in the projects that we hold at cvpaper.challenge, e.g. CVPR 2018 survey.

To follow the latest research, I participate in the survey member for top conferences on computer vision and natural language processing. Speaking with fellow researchers on computer vision helps me a lot on my research and brainstorming.

ActivityNet Large-Scale Activity Recognition Challenge (Dense-Captioning Events in Videos)

Apr 2019 - Jun 2019

Most natural videos contain numerous events. For example, in a video of a 'man playing a piano', the video might also contain another 'man dancing' or 'a crowd clapping'. This challenge studies the task of dense-captioning events, which involves both detecting and describing events in a video.

I will participate in this competition @CVPR2019 WS. Details would be updated after the challenge.


Feb 2019 - Mar 2019

A lecture given on AI programming using PyTorch. The audience was students from Keio University MBA.

I conducted the whole lecture, which includes making the code, preparing the slides and presentation, and teaching students. I explained the powers and limits of AI as accurately as possible to beginners without giving high-level mathematics.

Work Experience

National Institute of Advanced Industrial Science and Technology (AIST)

Research Assistant, Computer Vision Research Group

Apr 2019 - present

Japanese resesarch facility aiming to utilize scientific knowledge and technology to meet socio-economic needs

I’m conducting the latest research in computer vision. My aims at AIST are to publicate my works on video recognition using deep learning.

Pre-Startup in Keio University MBA

Software Engineer, Teacher

Jan 2019 - present

A pre-startup working with just four people. Starting business from scratch, and have produced two MVPs for our commercial products.

I worked on establishing a startup from scratch, which greatly helped me in developing my skills for working with people with different specialties. My works include a fully-conducted Japanese course on Deep Learning for Beginners including code written using Google Colaboratory. Our next project is iOS development on a social networking application. Details are to be updated.


Keio University, Faculty of Science and Technology

BS in Electronic Engineering

2015 - 2019

Courses taken include Image Processing, Neural Networks, Electromagnetic Physics, Information Theory, Electronic circuits, and Semiconductors. My graduate thesis theme is based on vision and language. Outside of my studying hours, I was captain in the tennis club, which enhanced my managing and problem solving skills as well as my leadership.

Graduate School in Keio University, Faculty of Science and Technology

MS in Computer Science

2019 - present

Mainly taking courses related to computer science, such as compilers, networks, algorithms and data structures, software engineering.



  • (under review)

International Conferences

  • N. Akimoto, S. Kasai, M. Hayashi, Y. Aoki, 360-DEGREE IMAGE COMPLETION BY TWO-STAGE CONDITIONAL GANS, IEEE International Conference on Image Processing (ICIP), Sep 2019.
  • K. Yoshida, K. Okayasu, K. Tsubura, M. Minoguchi, T. Ikeya, A. Nakamura, S. Kasai, H. Kataoka, Which generates better jokes, hand-crafted features or deep features, CVPR 2019 Language & Vision Workshop, Jun 2019.

Domestic Conferences

  • 笠井誠斗,林昌希,秋本直郁,青木義満,自然言語指示文を用いた顔画像属性変換,動的画像処理実利用化ワークショップ (DIA), OS5-3,2019年3月
  • 笠井誠斗,中込優,黒澤郁音,矢代達希,西川真史,相澤宏旭,福井宏,【招待ショートサーベイ】コンピュータビジョンと自然言語処理の共存マップ, 電子情報通信学会,パターン認識・メディア理解研究会(PRMU),2018年10月
  • S. Kasai, Y. Ishikawa, T. Wakamiya, K. Hara, H. Kataoka, Exploring the Best Model for Dense Captioning Events in Videos, The 22nd Meeting on Image Recognition and Understanding (MIRU), Aug 2019.

A Little More About Me

I used to live in Miami, FL in the U.S. for about 6 years when I was young, so I’m comfortable communicating in both Japanese and English. Alongside my interests in machine learning research and software engineering some of my other interests and hobbies are:

  • Playing Tennis (I’ve played for more than 15 years!)
  • Travelling