Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

Posts

Future Blog Post

less than 1 minute read

Published:

This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

Blog Post number 4

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 3

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 2

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 1

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

portfolio

BigGorilla

BigGorilla is open-source components for data integration and preparation, which began in 2016 jointly by Recruit and University of Wisconsin at Madison. It documents existing technologies and our original technologies to solve the problem. I created a couple of components of BigGorilla, and evangelized them. I also applied these technologies into 8 companies within Recruit, and showed that BigGorilla is effective across the company’s diverse range of businesses: the extraction of store names (or person names and location information) from unstructured data, merging of lists from multiple data sources, etc. For example, with BigGorilla, we obtained 98.9% accuracy on the task of de-duplicating approximately 10,000 store names (Here is the press release at that time).

Spysee2

Spysee2 is one of the biggest Japanese search engine of people and their relationship on the web. I extracted a person’s social network from the web using heuristic techniques and developed contemporary machine learning and NLP based algorithms for the extraction of various types of information about people (bio-data, profile picture, social network, related web pages etc.) from unstructured and noisy web data. I also developed a solution for name disambiguation problems in short texts using a probabilistic classifier. (Here is the press release, the article, and the blog at that time).

Pedestrian Congestion Visualization

Developed a pedestrian congestion visualization algorithm using GPS based users’ log and OpenStreetMap’s road information, which enables companies to analyze the time and day when area that they want to know will be crowded and to check how the road is congested on the map.

Smartphone Application’s Competitiveness Visualization

Developed a smartphone apps competitiveness visualization algorithm and approach, which visualize the competitivenes of apps and how the app will survive on smartphones based on how long users possess their applications. (This work was collaborated with Fuller, which have the biggest service of mobile app market analytics in Japan, App Ape. Recently, this technology was released as the real application and here is the press release.).

publications

talks

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.