Short version:
Is there something similar to PHP parser tokens in other interpreted languages (Python, Ruby, etc.) and compiled languages (the C family, Java, etc.)?
Long Version:
On the CPP Rocks website there is an article showing a visual comparison of language complexity by means of a graph that breaks down the various building blocks of a language into categories. The graph for Coffeescript looks like this:
I wanted to make such a graph for PHP using the parser tokens as a starting point (to make sure I don't miss anything and because I'm lazy). I was wondering if there is something similar to these tokens in other in other interpreted languages (Python, Ruby, etc.) and compiled languages (the C family, Java, etc.).
Findings thus far:
- Java: the Chapters of the Language Specification describing Syntax and Lexical Structure seem a good place to start.
- Python: Chapter 2 of the manual does describe Python's lexical structure.
- Ruby: the token list for Ruby.