Jason S. Kessler

Jason Kessler

Bio

Jason Kessler is an Applied Scientist at Amazon in Seattle, WA. Previously, he was a lead data scientist at CDK Global, where he analyzed language use and consumer behavior in the online auto-shopping ecosystem. Before CDK, Jason was the founding data scientist at PlaceIQ and worked as a research scientist for JD Power and Associates. He has published peer-reviewed papers on algorithms and corpora for sentiment and belief analysis, and has sat on program committees and reviewed for several AI and NLP conferences. Most recently, he has conducted research on identifying persuasive and influential language and the visualization of differing corpora.

Mistaken Idenity. If you've ever wondered what it's like to have the name Jason Kessler, check out this December 2017 New Yorker article.

Video of my talk on Scattertext

Publications:

Vin Sachidananda, Jason Scott Kessler and Yi-An Lai. Efficient Domain Adaptation of Language Models via Adaptive Tokenization. SustaiNLP: Workshop on Simple and Efficient Natural Language Processing at EMNLP. 2021.
- PDF via arXiv
Jason S. Kessler. Scattertext: a Browser-Based Tool for Visualizing how Corpora Differ. ACL System Demonstrations. Vancouver, BC. 2017.
- PDF via arXiv | Code | Poster | Video of PyData Talk
Jason S. Kessler and Nicolas Nicolov. The JDPA Sentiment Corpus for the Automotive Domain. The Handbook of Linguistic Annotation. Berlin. Nancy Ide and James Pustejovsky, eds., Springer. 2017.
- PDF | BibTeX
Jason S. Kessler, Miriam Eckert, Lyndsay Clark and Nicolas Nicolov. The ICWSM 2010 JDPA Sentiment Corpus for the Automotive Domain. 4th International AAAI Conference on Weblogs and Social Media Data Challenge Workshop. Washington, D.C. 2010.
Jason S. Kessler and Nicolas Nicolov. Targeting Sentiment Expressions through Supervised Ranking of Linguistic Configurations. 3rd International AAAI Conference on Weblogs and Social Media (ICWSM 2009). San Jose, CA. 2009.
- Video | PDF | Slides | BibTeX
Jason S. Kessler. Polling the Blogosphere: a rule-based approach to belief classification. International Conference on Weblogs and Social Media (ICWSM 2008), Seattle, WA. 2008.
- Video | PDF | Slides | BibTeX
- Download the veridicality lexicon here.
Theresa Wilson, Paul Hoffmann, Swapna Somasundaran, Jason Kessler, Janyce Wiebe, Yejin Choi, Claire Cardie, Ellen Riloff and Siddharth Patwardhan. OpinionFinder: A System for Subjectivity Analysis. Human Language Technology Conference/Conference on Empirical Methods in Natural Language Processing (HLT-EMNLP 2005). Austin, TX. 2005.

Blog posts:

How to write a persuasive ICLR review: visualizing the ICLR 2018 open review dataset. Medium. 2018.

Talks:

Classy words: formulas for finding category-associated terms. Advanced Topics in Machine Learing Discussion Group. Seattle, WA. 2018
- GitHub Repo | Class Association Scores Notebook | Notebook on Toxic Comment Exploration
Lexicon Mining, Language Visualization and Semiotic Squares in Python. Puget Sound Python Meetup. Seattle, WA. 2018
- GitHub Repo | Introductory Slides | Notebook on Semiotic Squares for Facebook Engagemet
Lexicon Mining for Semiotic Squares: Exploding Binary Classification. Data Day Texas. Austin, TX. 2018
- Slides | Jupyter Notebook
Understanding Cultures and Perspectives through Text and Emoji Visualization. Data Day Seattle. Seattle, WA. 2017
- Notebooks
Using Scattertext and the Python NLP Ecosystem for Text Visualization. PyData. Seattle, WA. 2017
- Video | Slides and Code
Discovering Persuasive Language through Observing Customer Behavior. Predictive Analytics World. San Francisco, CA. 2017.
- Slides
Scattertext: A Tool for Visualizing Differences in Language. Data Day Texas. Austin, TX. 2017.
- Slides | Code
Turning Unstructured Content into Kernels of Ideas. Data Day Seattle. Seattle, WA. 2016.
- Video Interview | Slides | Code
From Sentiment to Persuasion Analysis: A Look at Idea Generation Tools. Talk presented at Texas NLP Day. Austin, TX. 2016.
- Slides
From Sentiment to Persuasion Analysis. Talk presented at The Sentiment Analysis Symposium. New York, NY. 2015.
- Video | Slides

Unpublished Papers:

Miriam Eckert, Lyndsie Clark and Jason Kessler. Structural Sentiment and Entity Annotation Guidelines. J.D. Power and Associates. Boulder CO. 2009.
Jason Kessler. A Corpus for Comparing Belief Classification Systems. Research conducted at Indiana University. Bloomington, IN. 2008.
Ralf Frieser and Jason Kessler. HelpAHobo: Panhandling in the 21st century. Research conducted at Indiana University. Bloomington, IN. 2008.

Popular and Trade Press

Scattertext
- MSDN Magazine. Cognitive Services - Improving LUIS Intent Classifications. July 2018.
- INFORMS. The important role visualization plays in presenting data in a powerful and credible way. December 2017.
- O'Reilly. Four short links: 9 June 2017. Text Analysis, Specific Phones, AI Copyright, and Minecraft for R. 2017-06-09.
- Global Investigative Journalism Network. Top Ten #ddj: This Week's Top Data Journalism. 2017-06-14.
- CapTech Consulting. PyData Seattle 2017 - What's New?. 2017-08-15.
Demographic-Specific Language of Closers: Reviews at CDK Global
- Wards Auto. Different Words Matter to Different Car Shoppers. 2017-06-02.
- Press Release. New research reveals different words seal the deal with different types of car buyers. 2017-03-28.
- Automotive News. Leave 'warranty' out of the service contract pitch. 2017-03-29.
- Auto Remarketing. Car description words most enticing to key demographics. 2017-03-29.
- Jalopnik. Dealers Who Stop Using The Word 'Warranty' Will Still Sell You Bullsh*t Extras. 2017-03-29.
- Canadian Auto Dealer. Language Matters: make or break a sale. 2017-03-29.
- DBusiness Magazine. Report: Different Words Seal the Deal with Automotive Buyers. 2017-03-29.
Language of Closers: Reviews at CDK Global
- (Recommended) Wards Auto. 'This Car Is Amazing!' So What?. 2016-10-20.
- CDK Global. How to Use Words to Convert Customers. 2016-10-11.
- Press release. CDK Global Identifies the Words that Matter Most in Dealership Reviews. 2016-10-13.
- Auto Remarketing. Dealers, choose your reviews carefully. 2016-10-13.
- Glencen. Study: These Top 5 Words Help To Convert At Dealerships. 2016-10-13.
- Autotalk. Closing more leads with right words. 2016-10-13.
- Press release. What if a Word Could Help You Make a Sale?. 2016-12-15.
Language of Closers: Emails at CDK Global
- CDK Global's white paper and press release 2016-06-06.
- DrivingSales' press release. DrivingSales Announces the Most Valuable Insight of 2016 at Presidents Club in Miami. 2016-06-13.
- Auto Dealer Today. Top Closers Avoid Industry Jargon, CDK Global Finds. 2016-06-09.
- Auto Remarketing. Top Closers Choose Their Words Carefully. 2016-06-09.
- The Auto Channel. New Study Reveals Leading Language Car Dealers Use to Close Deals. 2016-06-09.
- Autotalk. Can't close? You're speaking the wrong language. 2016-06-10.
- Behind the Wheel. Clear language found to be a car salespersons best friend. 2016-06-12.
- Columbus Dispatch. For good salespeople, the magic word is 'provide'. 2016-06-15.
- Dealership Daily. Study reveals 'leading language' dealers use to close deals. 2016-06-21.
- DrivingSales News. CDK Global's 'Language of Closers' Wins Most Valuable Insight Competition. 2016-07-22.
- Inside Lane. Do you speak the language of closers - or dozers - to email shoppers? 2016-07-28.

Data and Software

Scattertext, a Python term importance and text visualization package.
Age from Name, a Python package to estimate a person's age and generation from their name and gender.
You may request the JDPA Sentiment Corpus (used in Kessler and Nicolov [2009] and Kessler et al. [2010]) through the official website.
The lexicon of terms and multi-word units organized by part-of-speech, veridicality (including facticity) can be found here. These terms, when selected for by syntactic templates outlined in the ICWSM 2008 paper can be used to accurately predict the veridicity of an embedded, finite clause. This an important step in recognizing textual entailment and paraphrase.

Industrial Activities:

2018-present: Applied Scientist, Amazon. Seattle.
2013-2018: Lead (as of Sept. 2016) data scientist at CDK Global, Seattle.
2010-2012: Founding data scientist at PlaceIQ, NYC.
2010-2013: Adviser to Votizen, Mountain View, CA. (acquired by Causes)
2008-2010:: Scientist at J.D. Power and Associates, Boulder, CO. Working with Dr. Nicolas Nicolov, I helped to guide the construction of a corpus for structural sentiment analysis and researched ways of automatically annotating structural sentiment relations. Please see our ICWSM 200 and, 2010 papers, as well as our recent Handbook of Linguistic Annotation chapter for details on this effort.
Summer of 2009:: Research Intern at Palo Alto Research Center (formally Xerox PARC). I worked on a project in sentiment analysis as a member of the Computing Sciences Lab.

Service:

Reviewer, RANLP 2011
Program committee member, ICWSM 2011
Session chair, Microblogging, ICWSM 2010
Reviewer, ACL System Demos 2010
Program committee member, SocInfo 2010
Program committee member, IEEE SocCom-2010 Workshop on Finding Synergies Between Texts and Networks, 2010
Reviewer, Journal of Natural Language Engineering, 2010
Reviewer, CICLing 2010
Program committee member, ICWSM 2010
Reviewer, International Journal of Computers and Applications, 2009
Reviewer, RANLP 2009
Program committee member, ICWSM 2009
External reviewer, MICAI 2008
External reviewer, CITII 2008

Contact:

E-mail: [first name].[last name]@gmail.com
Twitter: @jasonkessler
Linkedin: here
Github: here

Academic Activities:

2005-2010:: Ph.D. candidate in Computer Science at Indiana University, Bloomington. My research focused on applying statistical natural language processing techniques for sentiment analysis. Specifically, I explore the compositional way that evaluations is expressed toward discourse entities, a topic my collaborators and I call "structural sentiment."
2001-2005:: In 2005, I received a B.S. in Computer Science from the University of Pittsburgh. While an undergrad, I worked on OpinionFinder, a publicly available system for sentence and expression-level subjectivity analysis.

Tutorials:

My colleague Nicolas Nicolov put together an excellent, engineering-orientied tutotorial on machine learning. It touches on some projects we've worked on that haven't made this web site.

Distractions:

A map of Rome showing the location of each Borromini building in the city.
Richard Serra on Charlie Rose (14 December 2001)
The Linguist's Search Engine.
The Gallery of "Misused" Quotation Marks
SIL's glossary of linguistic terms.
My Erdös number is less than or equal to 5 (via Claire Cardie ~ Raymond J. Mooney ~ Wolfgang Maass ~ Andras Hajnal)
A slightly outdated guide by John "Verm" Sherman to the Hueco Scale for grading boulder problems.

This is a personal web site, produced on my own time and solely reflecting my personal opinions. Statements on this site do not represent the views or policies of my employer, past or present, or any other organization with which I may be affiliated.