このエントリーをはてなブックマークに追加

Nov

10

WebHack#40 Introduction to Japanese tokenizers

Registration info

Free Admission

Free

FCFS
69/80

Description

WebHack

If you believe building relationships with people who are in the same industry helps you know yourself and achieve what you couldn't do alone, this month's event is for you!

If you value continuously learning, read WebHack Monthly and follow @WebHackMeetup to keep receiving valuable contents & earlier updates 👨‍🎓

Event Url

https://indeed.zoom.us/s/95236959786

PASSWORD: bekind

To gain the best experience, please

  • Join on PC (not on mobile phones)
  • Use Chrome browser (not Safari or Firefox)
  • Start at 19:30 in Tokyo time (GMT+9)

Feel free to write real-time collaborative notes at real-time event notes!

Language

English

Description

In this talk, Wanasit will share what he learn about Japanese NLP after trying to build a Japanese tokenizer from scratch.

Doing Natural Language Processing (NLP) or text processing for Japanese has many challenges. One of the most basic and obvious problems is tokenization (aka. splitting text into a list of words).

Unlike English that the words typically separated by space, splitting Japanese text (e.g. 日本語の自然言語処理を行うには…) doesn’t have such a rule-of-thumb. It requires the tokenizers and NLP tools to be a lot more sophisticated.

Speaker

Wanasit Tanakitrungruang, Engineering Manager, Indeed

Wanasit works an Engineering Manager in Search Quality team at Indeed. His team is focused on improving NLP for job descriptions and helps people get jobs by making Indeed's search better.

He also works on language and text processing projects on his free time (and this talk is related to one of his personal projects).

Schedule

Time Session
19:30 - 19:35 Opening
19:35 - 20:00 Talk
20:00 - 20:15 Q & A
20:15 - 20:30 Networking
20:30 Good night!

Venue

This event will be live streamed, and the link will be sent to attendees three days before. Ensure to register in order to receive the link, please.

Media View all Media

If you add event media, up to 3 items will be shown here.

Feed

WesleyHuang

WesleyHuang published WebHack#40 Introduction to Japanese tokenizers.

10/23/2020 11:11

WebHack#40 Introduction to Japanese tokenizers has been published!

Group

WebHack Meetup

Number of events 42

Members 1251

Ended

2020/11/10(Tue)

19:30
21:00

You cannot RSVP if you are already participating in another event at the same date.

Registration Period
2020/10/19(Mon) 00:00 〜
2020/11/10(Tue) 20:30

Location

password: bekind

https://indeed.zoom.us/s/95236959786

Attendees(69)

HARUYAMA Seigo

HARUYAMA Seigo

I joined WebHack#40 Introduction to Japanese tokenizers!

NatsuZeref

NatsuZeref

I joined WebHack#40 Introduction to Japanese tokenizers!

Eiji Shinohara

Eiji Shinohara

WebHack#40 Introduction to Japanese tokenizers に参加を申し込みました!

t2hnd

t2hnd

WebHack#40 Introduction to Japanese tokenizersに参加を申し込みました!

LeonStrife

LeonStrife

I joined WebHack#40 Introduction to Japanese tokenizers!

Rajanikant Deshmukh

Rajanikant Deshmukh

I joined WebHack#40 Introduction to Japanese tokenizers!

kylechen

kylechen

I joined WebHack#40 Introduction to Japanese tokenizers!

Shreyansh Pandey

Shreyansh Pandey

I joined WebHack#40 Introduction to Japanese tokenizers!

rnpk

rnpk

WebHack#40 Introduction to Japanese tokenizers に参加を申し込みました!

sadasad

sadasad

I joined WebHack#40 Introduction to Japanese tokenizers!

Attendees (69)

Canceled (2)