Simple Grammar for Chinese

The project is based on python environment because we found that Prolog doesn't support Chinese so well. All the charactors in the files are encoded with utf-8.

In the file of the dictionary.utf-8, words are defined for the dictionary. One line for one word and an empty line to separate different types of words.

In the file of the sentence.utf-8, a normal Chinese sentence is defined here. The sentence will be splitted into words with the splitter defined in MySplitter.py and finally analysed with the interpreter defined in MyCharty.py.

If a word is not defined in the dictionary, the splitter will consider by default one Chinese charactor as a word. For example, if "今天" is not defined in the dictionary, then the splitted sentences for "我今天在操场跑步" will be "我今天在操场跑步".

MySplitter

python3 MySplitter.py dictionary.utf-8 sentence.utf-8 result.utf-8

dictionary.utf-8 is the file defining a dictionary of words
sentence.utf-8 is the file for sentences to be splitted
result.utf-8 is the file for the output of the splitted words

MyCharty

python3 MyCharty.py -g PSG3.txt -i result.utf-8

PSG3.txt is the file for the grammar
result.utf-8 is the file containing the splitted sentence which will be analysed

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.idea		.idea
.vscode		.vscode
__pycache__		__pycache__
.gitattributes		.gitattributes
ChartyPy3.py		ChartyPy3.py
MyCharty.py		MyCharty.py
MySplitter.py		MySplitter.py
PSG1.txt		PSG1.txt
PSG3.txt		PSG3.txt
PSGParse3.py		PSGParse3.py
README.MD		README.MD
dictionary.utf-8		dictionary.utf-8
lgpl-3.0.txt		lgpl-3.0.txt
result.utf-8		result.utf-8
sentence.utf-8		sentence.utf-8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simple Grammar for Chinese

MySplitter

MyCharty

About

Releases

Packages

Contributors 2

Languages

HsinChang/ChartyPy3

Folders and files

Latest commit

History

Repository files navigation

Simple Grammar for Chinese

MySplitter

MyCharty

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages