Project for CS421 – University of Illinois at Chicago

Name 1: vsakur2@uic.edu
Name 2: ufaiz2@uic.edu

Setup

Extract the .zip file
Install Maven using

Ubuntu - “sudo apt install maven”
Windows - https://maven.apache.org/install.html

Add all the files in input/testing/essays folder
Run ./run.sh in executable directory in terminal

Technique

Length of Essay: We use parse tree to obtain length of the essay. We use three cues for this:

No. of root sentences ([ROOT [S) tags in the parse tree.
No. of main verbs in the essay - We increment the sentence count if there more than one verb is used in the essay and the sentence does not have clause (SBAR tag).
No. of conjunctions in the essay - If a sentence tag follows a conjunction tag ([CC … [S) then it implies that it connecting one sentence with another. We keep an account of this too.

Spelling Mistakes: For counting the number of spelling mistakes, a dictionary.txt file is created which has a large list of words.

The word is considered as a spelling mistake if the POS tagging of the word comes out as NN and if the word is not present in the dictionary.txt file.
An additional file called wordStopper.txt is introduced just to make sure that all the stop words are not considered as a spelling mistake.

Agreement: Agreement between the nouns and verb were calculated in the following way

Using the Sentence Splitter from the StanfordParser, we are breaking the complete essay into sentences.
We are then checking whether each sentence has a subject using dependency parsing.
If a subject exists then we check if the correct POS tags assigned to all it’s words in that sentence so that they won’t violate the Singular-Plural property between the Nouns and Verbs.
For singular: {“VBZ”, “NN”, “NNP”, “VBG”, “VBN”, “VB”, “VBD”-”was”} and other tags can occur.
For Plural: {“VBP”, “NNS”, “NNPS”, “VBG”, “VBN”, “VB”, “VBD”-”were”} and other tags can occur

Missing Verb: Using the Sentence Splitter from the StanfordParser, we are breaking the complete essay into sentences.

We are then checking whether each sentence has all the six main verb tag in them i.e. {“VBZ”, “VBP”, “VBG”, “VBN”, “VB”, “VBD”}

Sentence Formation:

Getting count of fragments in an essay
Patterns found from the training set like "the my" in sentence counted as an invalid sentence
Counting sentences with only subordinate clause and no main clause
SINV sentences checked for validity

Text Coherency:

Collecting all the pronouns from each sentence.
Checking each pronoun in the sentence is available in the output of CoReferenceChain output.

Essay Validness - Topic Coherence: Using wordnet to identify synonyms of nouns used in essay and topic

Get list of synonyms to the nouns in the topic sentence
Get list of synonyms to the nouns in complete essay
If the essay synonym intersects with noun then we say that the noun used is in line with the topic of essay and hence increment count
The score is mapped as ratio of nouns related to essay/ total nouns in essay

##FinalScore

The equation used for calculating the final score is:

     FinalScore = (0.27720  * a) - (0.09944 * b) + (0.17453 * ci) - (0.01569  * cii) + (0.03670 *ciii) - (0.08948 * di) + (0.11589 * dii));

Using this equation the recall value of low essays in training corpus was 0.92 and recall for high essays increased to 0.98.

Error Patterns observed:

Some essays contained patterns of the form “ true.This ” i.e., the sentences did not have a space after a full stop. This gave rise to the problem of that pattern being tagged with a single POS tag. In order to overcome this problem, we preprocessed our file content to include a space after a full stop.
Some stop words are not recognized by the dictionary and hence a file name “wordStopper.txt” was created to ensure no stop word is being missed.
The verbs “was” and “were” come under the tag VBD and they are the only exceptions considered while looking for Noun-Verb agreement. The rest of the VBD tagged words do not violate the Noun-Verb agreement.
Some sentence had invalid word order of the form "the my"

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
.idea		.idea
executable		executable
input		input
output		output
src/main		src/main
target/classes/com/nlp/autoscoring/sentenceformation		target/classes/com/nlp/autoscoring/sentenceformation
.DS_Store		.DS_Store
.Rhistory		.Rhistory
.gitignore		.gitignore
Final Report.pdf		Final Report.pdf
NLP_ Intermediate Report.pdf		NLP_ Intermediate Report.pdf
README.md		README.md
README.pdf		README.pdf
essayscores.csv		essayscores.csv
pom.xml		pom.xml
properties.xml		properties.xml
regression.R		regression.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project for CS421 – University of Illinois at Chicago

Setup

Technique

Error Patterns observed:

About

Releases

Packages

Contributors 2

Languages

unaizafaiz/automatic-essay-scoring-system

Folders and files

Latest commit

History

Repository files navigation

Project for CS421 – University of Illinois at Chicago

Setup

Technique

Error Patterns observed:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages