MyCompiler: Custom Lexer and Tokenizer

Overview

This project implements a lexical analyzer (lexer) for a custom programming language. The lexer uses Non-Deterministic Finite Automata (NFA) and Deterministic Finite Automata (DFA) to recognize different token types, including integers, decimals, identifiers, booleans, operators, and delimiters.

Features

Converts NFAs to DFAs for efficient token recognition.
Supports multiple token types with priority-based classification.
Can process input code and classify tokens accordingly.

Installation

Prerequisites

Java Development Kit (JDK 11 or later)
Any Java IDE (e.g., IntelliJ IDEA, Eclipse, or VS Code)

Steps to Run

Clone the repository:

git clone https://github.com/yourusername/your-repo.git
cd your-repo

Compile and run the project:

javac -d out src/**/*.java
java -cp out main.Main

Token Rules

The lexer follows the following rules to classify tokens:

1. Identifiers

Must start with a lowercase letter (a-z).
Can contain only lowercase letters.
Example: varname, hello, testvar

2. Integers

Consist of one or more digits (0-9).
Example: 42, 12345

3. Decimals

Consist of an integer part followed by a . and a fractional part (up to 5 digits).
Example: 3.14, 42.001

4. Booleans

Recognized keywords: true and false.
Example: true, false

5. Operators

Supported: +, -, *, /, %, ^.
Example: a + b, x * y

6. Assignment Operator

Single equal sign (=) for assignment.
Example: x = 10;

7. Delimiters

Supported: ;, ,, (, ), {, }, [, ].
Example: if (true) {}

Example Code

x = 10;
y = 3.14159;
z = x + y;
if (true) { z = z * 2; }

The lexer will classify the tokens accordingly and output their types.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
dfa		dfa
errorHandler		errorHandler
lexer		lexer
main		main
nfa		nfa
InstructionManual.pdf		InstructionManual.pdf
README.md		README.md
module-info.java		module-info.java

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MyCompiler: Custom Lexer and Tokenizer

Overview

Features

Installation

Prerequisites

Steps to Run

Token Rules

1. Identifiers

2. Integers

3. Decimals

4. Booleans

5. Operators

6. Assignment Operator

7. Delimiters

Example Code

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

sanamir23/MyCompiler

Folders and files

Latest commit

History

Repository files navigation

MyCompiler: Custom Lexer and Tokenizer

Overview

Features

Installation

Prerequisites

Steps to Run

Token Rules

1. Identifiers

2. Integers

3. Decimals

4. Booleans

5. Operators

6. Assignment Operator

7. Delimiters

Example Code

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages